Question 1

What is regression to the mean?

Accepted Answer

Regression to the mean is the statistical phenomenon where extreme observations on one measurement tend to be less extreme on a subsequent measurement. A student who scores in the 99th percentile on a test will likely score lower (though still high) on a retest. This happens because extreme scores are partly due to skill and partly due to luck — and the luck component is unlikely to repeat.

Question 2

Why does regression to the mean fool people?

Accepted Answer

People naturally attribute the change to whatever happened between measurements. A coach punishes a player after a bad game, then the player improves — the coach credits the punishment, but regression to the mean explains the improvement. A CEO implements a new policy after record profits, then profits fall — critics blame the policy, but regression explains the decline.

Question 3

How is regression to the mean related to correlation?

Accepted Answer

The amount of regression is directly determined by the correlation between measurements. With perfect correlation (r=1), there is no regression. With zero correlation (r=0), extreme scores regress completely to the mean. The expected second score is: μ + r × (first_score - μ). This formula was discovered by Francis Galton in the 1880s.

Question 4

Can regression to the mean be prevented or corrected?

Accepted Answer

It cannot be prevented because it is a mathematical consequence of imperfect correlation, not a bias that can be removed. However, it can be accounted for in study design by using control groups. Any change in the treatment group that also appears in the control group is likely regression to the mean, not a treatment effect.

Regression to the Mean: Why Extremes Don't Last

Formula

The Invisible Force That Fools Everyone

Why Extremes Regress

Real-World Consequences

Galton's Original Discovery

FAQ

Sources

Embed

Regression to the Mean: Why Extremes Don't Last

Formula

The Invisible Force That Fools Everyone

Why Extremes Regress

Real-World Consequences

Galton's Original Discovery

FAQ

Sources

Other simulations: Statistics & Inference

Analysis of Variance (ANOVA)

Chi-Squared Test for Independence

Confidence Intervals & Sample Size

Hypothesis Testing & P-Values

Embed