Counting the uncountable
How many emails will you receive in the next hour? How many typos on a page? How many cars pass a checkpoint in 10 minutes?
From Binomial to Poisson
The Poisson distribution arises naturally from the Binomial. Imagine dividing time into tiny slices:
- In 1 minute, the probability of a call is high
- In each second (60 slices), the probability per slice is small
- In each millisecond (60,000 slices), the probability per slice is tiny
As we slice finer and finer, keeping constant, the Binomial converges to the Poisson.
The Poisson is the limit case: infinitely many trials, each with an infinitesimally small probability, summing to a finite rate .
The Poisson distribution
A random variable follows a Poisson distribution with rate if: We write .
The PMF has a clear structure:
- ensures the probabilities sum to 1
- grows with (more events becomes more likely up to a point)
- in the denominator eventually dominates (very high counts become rare)
Explore how the Poisson PMF shape changes with λ — notice how mean always equals variance:
Key properties
If :
Mean equals variance — this is the signature of the Poisson. If your data has mean ≈ variance, a Poisson model may be appropriate. If variance ≫ mean, look elsewhere.
| Rate | Use case |
|---|---|
| 0.5 | Earthquakes per year in a city |
| 2 | Typos per page |
| 5 | Calls per minute |
| 100 | Website hits per second |
Computing Poisson probabilities
When to use the Poisson
The Poisson is the right model when:
- Events occur independently: One email arriving doesn't affect the next
- Constant rate: The average rate doesn't change over time
- No simultaneous events: In a small enough time window, at most one event occurs
- Counting events in an interval: Time, space, or any "exposure" measure
Generate random events on a timeline at rate λ:
Classic examples:
- Number of accidents at an intersection per month
- Number of mutations in a stretch of DNA
- Number of photons hitting a detector per second
- Number of customers entering a store per hour
The Poisson approximation to the Binomial
When is large, is small, and is moderate:
This is incredibly useful for computation. Instead of calculating , just use .
Rule of thumb: The approximation works well when and .
See the convergence in action — fix λ and increase n to watch the Binomial PMF morph into the Poisson:
Poisson sums
The sum of independent Poissons is Poisson.
If and are independent, then:
If Store A gets 3 customers/hour and Store B gets 5 customers/hour (independently), the combined total is Poisson with rate 8/hour.
Poisson vs. other distributions
| Distribution | Question | Parameters |
|---|---|---|
| Binomial | How many successes in trials? | , |
| Geometric | How many trials until first success? | |
| Poisson | How many events in an interval? |
The Poisson stands out because it models counts in continuous time or space, not a fixed number of discrete trials.
Summary
| Property | Formula |
|---|---|
| PMF | |
| Mean | |
| Variance | |
| Key feature | Mean = Variance |
| Limit of | as , , |
| Sum property |
The Poisson is for rare events at a constant rate. Think: "how many times does something happen in a fixed window of time (or space)?"
Test your understanding
What's next
We've now covered the main discrete distributions. Next, we cross into the continuous world — where outcomes are real numbers and probabilities become areas under curves.