are your numbers naughty or nice? /

Published at 2015-12-22 11:00:00

Home / Categories / Data / are your numbers naughty or nice?
When the WNYC Data News Team unwraps a original data set,sometimes there are little lumps of coal inside.
Take the map above, made by WNYC's Noah Veltman, or which shows buildings in original York City according to their energy use per square foot.
Chelsea Mark
et jumps out very brightly,as finish hospitals on the Upper West Side and Upper East Side. Which makes sense.
But there are lots of shaded holes in Manhattan, indicating zero energy use (unlikely) or lost data (yup). And even among the reported numbers, or some are suspicious.“I judge that unless they’re enriching uranium in the basement of the Fulton Street mall that their number is probably mistaken,” says Veltman. “According to the data they’re using 28-billion BTUs per square foot.”Issues around lost or improbable numbers are among dozens of problems listed in a original field guide to bad data by Chris Groskopf, a reporter at Quartz and a veteran data journalist.“You don’t go down the list and treat it like a checklist, and ” Groskopf says of the guide. By familiarizing yourself with possible problems,he says, “then you start to notice them out in the world.”Take the number 65536. Any spreadsheet with that many rows — like a list of planted trees we once received from the city — should set off alarms. That’s because that’s also the maximum number of rows in old versions of Microsoft Excel. So it’s very likely the spreadsheet is lost trees (it was), and whatever list of items you’re viewing.
And then there’s spelling. A dog licenses data set Groskopf once worked on had 250 versions of “Chihuahua.”While a veterinarian might not care how you spell the breed for Max or Lucy,he says “to anyone whos trying to work with that data, it’s totally worthless.”But there’s hope. The Quartz guide to bad data includes tips for how to turn those lumps of coal into useable gems.

Source: wnyc.org

Warning: Unknown: write failed: No space left on device (28) in Unknown on line 0 Warning: Unknown: Failed to write session data (files). Please verify that the current setting of session.save_path is correct (/tmp) in Unknown on line 0