More than mean and spread
A city reports its average household income is $80,000. But most households earn far less — a handful of millionaires pull the average up. The mean and variance alone can't capture this asymmetry.
The moment hierarchy
We've seen the first two moments: the mean (1st moment) tells us where the distribution is centered, and the variance (2nd moment) tells us how spread out it is. The third and fourth moments describe the distribution's shape.
Skewness: the 3rd moment
The skewness of is the standardized third central moment: It measures the asymmetry of the distribution around its mean.
| Skewness | Meaning | Example |
|---|---|---|
| Symmetric | Normal distribution | |
| Right-skewed (long right tail) | Income, waiting times | |
| Left-skewed (long left tail) | Exam scores (near-perfect clustering) |
Why the cube? Squaring (variance) treats left and right deviations equally. Cubing preserves the sign — large positive deviations contribute positively, large negative deviations contribute negatively.
Explore how skewness affects distribution shape — watch mean, median, and mode diverge:
Kurtosis: the 4th moment
The kurtosis of is the standardized fourth central moment: It measures the heaviness of the tails relative to the Normal distribution (kurtosis = 3).
The Normal distribution has kurtosis = 3. Excess kurtosis = kurtosis :
| Excess Kurtosis | Type | Meaning |
|---|---|---|
| Mesokurtic | Normal-like tails | |
| Leptokurtic | Heavier tails, more outliers | |
| Platykurtic | Lighter tails, fewer outliers |
High kurtosis doesn't mean "peaky." It means heavy tails, with more probability mass in extreme values. A distribution can be flat and still have high kurtosis if it has heavy tails.
Mold the distribution
Switch between distributions to see how skewness and kurtosis change. The exponential is right-skewed (skewness = 2); the uniform has light tails (kurtosis < 3):
Common distributions compared
| Distribution | Mean | Variance | Skewness | Kurtosis |
|---|---|---|---|---|
| Normal() | 0 | 3 | ||
| Exponential() | 2 | 9 | ||
| Uniform() | 0 | 1.8 | ||
| Bernoulli() |
Why do higher moments matter?
Practical interpretation
Positive skewness means the mean is pulled above the median because the right tail dominates. Negative skewness means the mean is pulled below the median because the left tail dominates. High kurtosis means rare events happen more often than a Normal model would predict.
Practice problems
Summary
| Moment | Measures | Formula |
|---|---|---|
| 1st: Mean | Center | |
| 2nd: Variance | Spread | |
| 3rd: Skewness | Asymmetry | |
| 4th: Kurtosis | Tail weight |
Mean tells you where a distribution sits, variance tells you how spread out it is, skewness tells you which way it leans, and kurtosis tells you how extreme the tails are.
Test your understanding
What's next
Now that we can fully describe a distribution's shape, we'll meet the moment-generating function, a single formula that encodes all moments at once.