The Signal and the Noise: Why So Many Predictions Fail-but Some Don't
Author: Nate Silver
Rating: ā 4/5
Date Read: 2012/12/29
Pages: 545
3.5 stars, but Iām rounding up because I like stats. The first half of this book is fantastic: it outlines the issues that cause people to make terrible predictions. Across many fields, people are not so good at prediction, for a number of reasons. Silver fights the idea that having enough data means that predictions will be great. Data is noisy, and just adding more noisy data isnāt going to allow computers to magically find signal. Thereās a reason why statisticians say āGarbage in, garbage out.ā
The second half of the book, which focuses on the so-called solution, is where everything falls apart. According to Silver, the answer is Bayes theorem. I donāt take much issue with this: Bayes theorem is elegant and useful, particularly when it comes to making inferences about the world. Thereās a reason why you canāt go to a conference on cognition without hearing a talk on Bayesian learning: itās a good idea, and it seems to work.
The problem is that Silver seems to confuse Bayesian statistics with āthinking probabilisticallyā when the two donāt really mean the same thing. I think probabilistically when I get on a flight (or walk past the lottery ticket counter without buying one). The probability of a safe flight is close to 100%, whereas the probability of winning the lottery is close to 0%. This is not Bayesian, itās just a simple computation of some event of interest divided by total events (e.g., safe flight / all flights (safe andā¦, well, not so good)).
Bayes theorem is different. Most simply, itās stated as follows: P(A|B) = [P(B|A) * P(A)] / P(B). As you may or may not be able to discern from the equation, it allows you to compute the probability of some event A occurring given that some event B has occurred, using the probability that B occurs given A, the probability of A on its own, and the probability of B. In simple terms, imagine that Iām waiting on the elevated platform for my train. When I get to the platform, I donāt have much reason to believe that my train will go express and skip my stop. However, I wait for awhile, longer than I should have to, and the train doesnāt come. I can compute the probability that the long wait signals that Iāll get skipped using Bayes theorem, and itās possibile that my brain has been doing something Bayesian throughout my many morning commutes. Give it a little input, and Bayes will let me know if Iām better off taking a cab.
Throughout The Signal and the Noise, Silver will profile someone whoās making a lot of money betting on sports, or making a lot of money playing poker. Then, heāll say āAnd this is Bayesian! This proves that we should all be using Bayesian stats all the time!ā He doesnāt really explain how any of these people are applying Bayesian stats, or profile anyone making good predictions using a non-Bayesian approach. I like Bayes, but Silver is going to have to do more than say āLook! Bayes!ā to convince me that itās the panacea for prediction.
Silver also points out that conventional methods for hypothesis testing produces far more false-positives (usually called Type I errors) than they should. This is true, and most scientists are aware that they need to make some changes in the way they analyze data. Silver, of course, thinks that we should all be using Bayes, although he doesnāt speak about any of the other methods that can reduce Type I errors (including simple methods, like reporting the effect size). At one point, Silver equates science with Bayesian thinking, while completely missing the utility of a well-designed experiment.
I do recommend Silverās book, particularly for non-scientists who are interested in statistics. However, itās best taken a series of interesting stories about prediction, and not as a fully developed theory for how most predictions should be developed.