What is Regression to the Mean?

The statistical tendency for extreme observations to be followed by less extreme ones closer to the population average.

TL;DR

Regression to the mean: extreme performances are typically followed by less extreme ones. Most apparent hot streaks contain a large luck component that fades.

Full explanation

Francis Galton named the effect in the 1880s after noticing that tall parents tended to have children shorter than themselves, and short parents children taller than themselves. The underlying math is universal. Any observed performance is the sum of a stable true-talent component and a noisy luck component. When you observe an extreme result — the noise was probably high. The next observation will, in expectation, contain less of that noise and come in closer to the true talent.

The implications for sports modeling are constant. A hitter posting a .380 batting average in April will, almost certainly, not maintain that pace. The .380 reflects true talent — but the gap between the .380 and his career line is mostly noise. The forward projection is closer to his career line than to the hot start.

Quantifying the regression is the harder problem. How much weight should you put on the new data versus the prior? The answer comes from Bayesian shrinkage: the size of the new sample, the variance of the underlying process, and the strength of the prior all enter the formula. In practice, sports models tune a regression weight per metric — pitcher strikeout rate stabilizes after fewer batters faced than pitcher hit rate, so the regression weight on strikeouts is lower.

The professional discipline around regression is to apply it before you act, not after. Anyone can look back at a hot streak and observe that it ended. The work is anticipating which currently-hot streaks are mostly luck and which reflect a real talent shift. The gap between observed performance and underlying advanced metrics — wOBA minus xwOBA, ERA minus xERA, shooting percentage minus expected — is the most direct signal that regression is overdue.

Formula

Bayesian shrinkage: projected_value = (prior_weight × prior + observed_weight × observed) / (prior_weight + observed_weight). Prior weight is tuned per metric based on how quickly the metric stabilizes.

Why it matters in our model

Every player rate stat in our system is regressed toward a prior. The shrinkage weight is tuned per metric — strikeout rate stabilizes fast, BABIP stabilizes slowly — and the prior is updated nightly. Without regression, our projections would be whipsawed by small-sample noise.

Frequently asked

How fast does a stat regress?

It depends on the stat. Strikeout rate stabilizes after ~70 batters faced; BABIP needs 800+ balls in play. Use stabilization research to set the weight.

Is regression to the mean the same as the gambler's fallacy?

No. The gambler's fallacy is the belief that past results affect independent future outcomes. Regression to the mean reflects the noise component of the observation, not the outcome itself.

Does regression mean I should always bet against hot streaks?

No — it means hot streaks are statistically likely to cool, but the market often prices them correctly. Edge comes from finding cases where the market hasn't fully regressed.

Related terms

← Back to glossary