The chaos becomes order
Roll a single die — the outcomes are flat, not bell-shaped at all. But sum 30 dice rolls? The result looks approximately Normal. This works for any starting distribution.
This is the Central Limit Theorem.
Statement of the CLT
Let be i.i.d. random variables with mean and variance . Then as : Equivalently, the sum is approximately for large .
If you standardize the sample mean, it becomes approximately standard Normal, regardless of the original distribution.
Requirements: The must be independent, identically distributed, with finite mean and variance. If the variance is infinite (e.g., Cauchy distribution), the CLT does not apply.
See it yourself
Draw from a crazy distribution — exponential, uniform, or even bimodal. Watch the sum distribution converge to a bell curve as you increase :
Why does it work? Intuition
The mathematical proof uses moment-generating functions: the MGF of the standardized sum converges to , which is the MGF of .
The CLT in practice
How large must be?
| Starting Distribution | Minimum for good approximation |
|---|---|
| Symmetric (e.g., Uniform) | |
| Mildly skewed | |
| Heavily skewed (e.g., Exponential) | |
| Very heavy tails |
The more "non-Normal" the original distribution, the more samples you need.
Polling and elections
The CLT connects everything
The CLT explains why the Normal distribution appears everywhere. Test scores are the sum of many small factors (preparation, sleep, luck). Measurement errors accumulate from many tiny perturbations. Stock returns aggregate many independent trades. Heights are shaped by many independent genetic and environmental factors.
The Normal distribution is the attractor: the shape everything converges to when you add enough independent things together.
Continuity correction
When approximating a discrete distribution (like Binomial) with the Normal:
The is the continuity correction — it accounts for the fact that the Normal is continuous but the Binomial is discrete.
Practice problems
CLT vs LLN
| Law of Large Numbers | Central Limit Theorem | |
|---|---|---|
| Says | is approximately Normal | |
| Type | Convergence in probability | Convergence in distribution |
| Tells you | Where the average goes | How it fluctuates around there |
| Key formula |
The LLN says the average converges. The CLT says how fast and in what shape it converges. The LLN is qualitative; the CLT is quantitative.
Summary
| Concept | Key Formula |
|---|---|
| CLT (standardized) | |
| Sum version | |
| Standard error | |
| Continuity correction | Replace with for discrete→continuous |
Test your understanding
What's next
With the CLT in hand, we're ready to tackle Markov chains, where the future depends only on the present.