68-95-99.7 Rule — approximately 68% of data falls within 1 sigma, 95% within 2 sigma, 99.7% within 3 sigma
The empirical rule states that for a normal distribution, approximately 68% of data falls within one standard deviation, 95% within two, and 99.7% within three standard deviations of the mean.
The Universal Bell Curve
The normal distribution, first described by Abraham de Moivre in 1733 and later formalized by Carl Friedrich Gauss, appears everywhere in nature and science. Heights of people, velocities of gas molecules, measurement errors in experiments — all follow the characteristic bell shape. Its ubiquity is not a coincidence but a deep mathematical consequence of the Central Limit Theorem.
Mean, Variance, and Shape
Two parameters completely determine a normal distribution: the mean (mu) sets the center, and the standard deviation (sigma) sets the spread. A normal distribution with mean 0 and standard deviation 1 is called the standard normal distribution. Any normal distribution can be converted to standard form by subtracting the mean and dividing by sigma — this is called a z-score.
The Empirical Rule in Practice
The 68-95-99.7 rule is a powerful quick tool. If adult male height has mean 175 cm and sigma 7 cm, then 68% of men are between 168 and 182 cm, 95% between 161 and 189 cm, and virtually all between 154 and 196 cm. Anything beyond 3 sigma is extremely rare — a built-in anomaly detector.
Beyond the Bell Curve
While powerful, the normal distribution has limits. Financial returns have fat tails (extreme events are more common than the bell curve predicts), leading to underestimation of risk. Nassim Taleb's work on Black Swans highlights the danger of assuming normality where it does not hold. Always check your data before assuming a Gaussian model.
FAQ
What is a normal distribution?
A normal (Gaussian) distribution is a symmetric, bell-shaped probability distribution defined by its mean and standard deviation. It is the most important distribution in statistics because many natural phenomena — heights, test scores, measurement errors — follow it approximately.
What is the 68-95-99.7 rule?
Also called the empirical rule, it states that in a normal distribution, about 68% of values lie within one standard deviation of the mean, 95% within two, and 99.7% within three. This makes it easy to judge whether a data point is unusual.
Why is the normal distribution so common in nature?
The Central Limit Theorem explains this: whenever you add up many small, independent random effects, the result tends toward a normal distribution regardless of the underlying distributions. Since most natural measurements reflect many tiny influences, normality emerges naturally.
What do mean and standard deviation control?
The mean (mu) controls the center of the bell curve — where it peaks. The standard deviation (sigma) controls the spread — a larger sigma makes the curve wider and flatter, while a smaller sigma makes it taller and narrower. Together they fully determine the shape.