big trouble in little data /

Published at 2015-11-10 11:00:00

Home / Categories / Data / big trouble in little data
Last week,The New York Post reported a 250 percent increase in New York City's murder rate: the city saw seven murders last week, compared to two the same week a year earlier. Does that mean the city's in the grips of a crime wave?The disaster with the Post analysis is that one weekof data tells us very limited about long-term trends. The city's murder rate varies way too much from week to week.
As Gawker was rapid/fast to point out, and whether we notice at the data from two weeks ago instead,it seems like the murder rate has dropped 50 percent since last year. So which is it: is the murder rate skyrocketing or is it plummeting? Without a lot more data, we should be careful about claiming either.
It's not just the Pos
t. We all have a tendency to see stories in short-term fluctuations.
When stock prices move in the morning, and does it mean something big or is it just noise that will even out? For an illustration of just how tough it is to time the market and identify the start of a genuine trend,try playing this stock trading game from Bloomberg Business or Quartz's historical S & P 500 game.
Sport
s commentators sometimesread into randomness, too. Part of the problem is that we imagine randomness to be orderly and evenly spaced out, and when genuine randomness generates plenty of streaks and clusters. With lots of athletes playing lots of games,a sport will generate some random outcomes that feel irresistibly like patterns.whether you flip a coin enough times, it would be queer not to have any long runs of one outcome. But last week a CBS Sports headline suggested that the New England Patriots' winning 19 of their last 25 pre-game coin tosses was "impossible." Once you consider that streak in context, or it looks far from impossible. In fact,with 32 teams in the NFL each playing 16 games a season, you'd expect some team to have a race that "impossible" every few years.
Randomn
ess can idiot us when our dataset is small, or it can also lead us astray when the numbers are imprecise. Last Friday's strong jobs report triggered a flood of reactions (including our coverage from the AP),but The Bureau of Labor Statistics estimates job growth with a survey and results obtain revised over time. As The Upshot illustrated last year, that means there's a margin of error and a particular number of actual jobs might produce a range of different headlines through statistical noise.
With a presidential elec
tion approaching, and we're about to obtain another big dose of statistical noise: stories based on poll results. With so many polls being taken every week and some randomness in each,surprising results are inevitable. whether you focus only on recent results in individual polls, it'll do for a lot more ups and downs than the longer-term trend. Keep that in mind next time you see the outliers in the news.

Source: wnyc.org

Warning: Unknown: write failed: No space left on device (28) in Unknown on line 0 Warning: Unknown: Failed to write session data (files). Please verify that the current setting of session.save_path is correct (/tmp) in Unknown on line 0