How long until...?
How many times do you need to flip a coin before you get Heads? How many emails before you get a job offer? How many die rolls before you see a 6?
This "waiting time for the first success" shows up everywhere. Here's the formal definition.
The geometric distribution
If each trial succeeds with probability (independently), and = number of trials until the first success, then follows a Geometric distribution with parameter : We write .
The logic: to succeed on trial , you need failures (each with probability ) followed by one success (probability ).
Explore how the PMF shape changes as you adjust the success probability:
The PMF shape
The geometric PMF always decreases:
- The most likely outcome is (immediate success)
- Each additional failure makes the outcome less likely
- The PMF has a "tail" that stretches to infinity
Key properties
If :
| Probability | Expected trials | Interpretation |
|---|---|---|
| 0.5 | 2 | Fair coin → 2 flips |
| 1/6 | 6 | Fair die → 6 rolls to see a 6 |
| 0.01 | 100 | 1% chance → 100 attempts on average |
| 0.001 | 1000 | Rare event → needs many trials |
The expected wait is always , regardless of what happened before. This is the memoryless property.
The CDF: tail probabilities
The probability of needing more than trials:
This is the complement: you need more than trials exactly when the first all fail.
The memoryless property
This is what makes the geometric distribution special:
If , then for any :
In words: given that you've already waited trials without success, the probability of waiting at least more is the same as starting fresh.
The coin has no memory. If you've flipped 10 Tails in a row, the 11th flip still has P(H) = p. The past doesn't matter.
Proof (it's short!)
The geometric distribution is the only discrete distribution with the memoryless property.
Explore the memoryless property visually — watch how the conditional survival curve perfectly overlaps the original:
The curves overlap!
P(X > 3+2 | X > 3) = 0.4900 = P(X > 2) = 0.4900
Real-world applications
How many job applications?
If each application has a 5% chance of resulting in an offer: applications.
Server requests until failure
If each request has a 0.1% chance of crashing the server: requests on average.
Geometric thinking in everyday life
"How many tries until I find parking?" "How many dates until I meet the right person?" Whenever success is random and memoryless, the geometric appears.
Connecting to the Binomial
The Binomial and Geometric are close relatives:
| Property | Binomial | Geometric |
|---|---|---|
| Question | "How many successes in trials?" | "How many trials until first success?" |
| Fixed | Number of trials () | Success probability () |
| Random | Number of successes | Number of trials |
The Binomial counts successes in a fixed window. The Geometric counts time until the first success.
Summary
| Property | Formula |
|---|---|
| PMF | |
| CDF | |
| Expected value | |
| Variance | |
| Memoryless |
If each trial is like a coin flip that doesn't remember its past, and you're counting "how many until the first success," you've got a Geometric distribution.
Test your understanding
What's next
From counting "how many trials until a success" to counting how many events in an interval — the Poisson distribution, the workhorse of rare-event modeling.