Conditional Probability

POLS 3220: How to Predict the Future

Today’s Agenda

  • Learn how to estimate conditional probability

  • Introduce a common approach used by successful forecasters: balancing Inside View and Outside View.

  • Show how all this connects to a central problem in statistics: the bias-variance tradeoff.

Motivating Problem

Will the movie Roofman be “Certified Fresh” by Rotten Tomatoes?

How would you go about this prediction? What information would you research first?

Two Different Approaches

  • Inside View: Focus on the details of the case at hand.

    • What characteristics of Roofman suggest that it will be Certified Fresh?
  • Outside View: Ignore the details of the case at hand. Focus on what happened previously with similar cases.

    • How often does any film achieve a Certified Fresh rating?
  • Most people naturally gravitate towards the Inside View (Kahneman and Lovallo 1993), but the best forecasters combine both approaches!

Outside View

Tomatometer Total %
Certified-Fresh 3,259 18.4
Fresh 6,844 38.7
Rotten 7,565 42.8

Outside View

  • This base rate is a useful starting point.

    • Achieving a Certified Fresh rating is historically very rare. Serves as an “anchor” for our prediction.
  • What’s the problem with this approach? Why not stop there?

    • The pure Outside View yields a biased prediction, because it doesn’t incorporate any information about the case in question.

    • More details about the movie Roofman should help us refine our prediction.

Unconditional Probability

18.5%

81.5%

All Movies

Certified Fresh

Not CF

Conditional Probability

36%

64%

22%

78%

16.5%

83.5%

All Movies

Rated R

Not Rated R

Certified Fresh

Not CF

Certified Fresh

Not CF

Conditional Probability

Conditional Probability

36%

64%

63.4%

36.6%

25.5%

74.5%

All Movies

Rated R

Not Rated R

Released After 2000

Released Before 2000

Certified Fresh

Not CF

Conditional Probability

Conditional Probability

Conditional Probability

Conditional Probability

How far should we take this???

36%

64%

63.4%

36.6%

0.05%

99.95%

100%

0%

All Movies

Rated R

Not Rated R

Released After 2000

Released Before 2000

Directed by Cianfrance

Not Cianfrance

Certified Fresh

Not CF

Inside View

  • The “Inside View” is very confident about Roofman. R-movies directed by Derek Cianfrance are always Certified Fresh.

  • What’s the problem with this approach?

Inside View

  • Well, there are only 3 films in the entire database directed by Derek Cianfrance.

Practice

Is this strong evidence that Derek Cianfrance is an above-average director? \(P(\text{Certified Fresh}) > 0.25\)?

  1. If Cianfrance was just an average director, what is the probability that all three of his films would be Certified Fresh?
  2. If Cianfrance was just an average director, what is the probability that at least 2 out of his 3 films would be Certified Fresh?

Inside View

  • With only three data points, we cannot claim with confidence that Cianfrance’s next movie has a \(\frac{2}{3}\) probability of achieving Certified Freshness.

  • It could be that his movies are better-than-average.

  • But he also could have gotten lucky.

Bias-Variance Tradeoff

  • On one end of the spectrum (pure Outside View), we have a biased but precise prediction.

    • 18.5% of all movies are Certified Fresh.
  • On the opposite end (pure Inside View), we have an less-biased but highly uncertain prediction.

    • 66.6% of Derek Cianfrance’s movies are Certified Fresh.
  • The truth probably lies somewhere in the middle.

  • The “art” of forecasting is learning how to balance this tradeoff. How much should you weigh the Outside View vs. the Inside View?

Next Time

Bayes Rule as a method for computing conditional probability, and adjusting your predictions in light of available evidence.

References

Kahneman, Daniel, and Dan Lovallo. 1993. “Timid Choices and Bold Forecasts: A Cognitive Perspective on Risk Taking.” Management Science 39 (1): 17–31. https://www.jstor.org/stable/2661517.