ABA∩BS
P(Aᶜ) = 1 − P(A)
P(A∪B) = P(A) + P(B) − P(A∩B)

What can probability do?

What do you think?
You roll a fair die. A = {1, 2, 3, 4} and B = {3, 4, 5, 6}. What is P(A or B)?

Naive probability told us what probability is: P(A)=A/SP(A) = |A|/|S| when outcomes are equally likely. But it didn't tell us how probabilities combine. What's P(not A)P(\text{not } A)? What's P(A or B)P(A \text{ or } B)?

Three axioms answer every such question.

The three axioms

In 1933, Andrey Kolmogorov wrote down three rules. Every fact about probability follows from these.

Kolmogorov's axioms

For a sample space SS and any events A,BSA, B \subseteq S:

  1. Non-negativity: P(A)0P(A) \geq 0
  2. Normalization: P(S)=1P(S) = 1
  3. Additivity: If AA and BB are disjoint (AB=A \cap B = \emptyset), then P(AB)=P(A)+P(B)P(A \cup B) = P(A) + P(B)

That's it. Three rules. Adjust the die weights below and watch all three axioms respond in real time.

ABA∩BS
P(Aᶜ) = 1 − P(A)
P(A∪B) = P(A) + P(B) − P(A∩B)
Kolmogorov's Axioms in Action
0.167
0.167
0.167
0.167
0.167
0.167
Probability distribution
0.17
0.17
0.17
0.17
0.17
0.17
Axiom 1:P(A) ≥ 0 for every event A
Axiom 2:P(S) = 1 — sum = 1.000
Axiom 3:Disjoint events add: each outcome is a separate “slice”
What do you think?
Why does axiom 3 require A and B to be DISJOINT? What goes wrong if they overlap?

Set notation for events

Before using the axioms, we need vocabulary for combining events. Events are sets, so set operations apply.

Set operations on events
  • Union ABA \cup B: outcomes in A or B (or both) — "A or B happens"
  • Intersection ABA \cap B: outcomes in both A and B — "A and B both happen"
  • Complement AcA^c: outcomes not in A — "A does not happen"
  • Disjoint (mutually exclusive): AB=A \cap B = \emptyset — A and B cannot both happen

Build each operation and see the result highlighted:

Set Operation Builder
A = even faces
123456
B = low faces
123456
A ∪ B = outcomes in A or B (or both)
123456
P(A ∪ B) = 5/6 = 0.833
Die roll: A = {1,2,3}, B = {2,3,4}. List A ∩ B. (e.g. {1,2})
1/3

The complement rule

The first payoff from the axioms. Since AA and AcA^c are disjoint and together cover all of SS:

Deriving the complement rule
AAc=SA \cup A^c = S
Every outcome is either in A or not in A. Together they cover everything.
Step 1 of 6
Complement rule

P(Ac)=1P(A)P(A^c) = 1 - P(A)

The probability that AA does not happen equals 11 minus the probability that it does.

Roll dice and watch P(A)+P(Ac)=1P(A) + P(A^c) = 1 emerge empirically:

Complement Rule Simulator
A = {even face} = {2, 4, 6}. Aᶜ = {odd face} = {1, 3, 5}.
1
2
3
4
5
6
A (even)Aᶜ (odd)
P̂(A)
P̂(Aᶜ)
P̂(A) + P̂(Aᶜ)
What do you think?
The probability of rain tomorrow is 0.3. What's the probability of no rain?
decimal, e.g. 0.7

The complement rule creates a powerful problem-solving strategy: when "at least one" is hard to calculate directly, compute "none" and subtract from 1. You'll see this trick repeatedly in the birthday paradox and quality control lessons.

The addition rule

What about P(AB)P(A \cup B) when A and B overlap?

What do you think?
Die roll: A = {2,4,6} (even), B = {1,2,3} (low). P(A) = 3/6, P(B) = 3/6. What is P(A ∪ B)?

The fix: subtract the overlap.

The addition rule (inclusion-exclusion)
AB=(AB)(AB)(BA)A \cup B = (A \setminus B) \cup (A \cap B) \cup (B \setminus A)
Split the union into three disjoint pieces: A only, both, B only.
Step 1 of 4
Addition rule (inclusion-exclusion)

P(AB)=P(A)+P(B)P(AB)P(A \cup B) = P(A) + P(B) - P(A \cap B)

When AB=A \cap B = \emptyset (disjoint): P(AB)=P(A)+P(B)P(A \cup B) = P(A) + P(B) — which is just axiom 3.

Drag the sliders to adjust the Venn diagram and watch the formulas update. Try making the events disjoint (set "A ∩ B" to 0) to see the special case.

Addition Rule Explorer
A only15Both10B only25Neither: 50ABS (total: 100)
A only15
B only25
A ∩ B10
Neither50
Complement: P(Aᶜ) = 1 − P(A) = 1 − 0.250 = 0.750
Addition: P(A∪B) = P(A) + P(B) − P(A∩B) = 0.250 + 0.3500.100 = 0.500
Intersection: P(A∩B) = 0.100
P(A) = 0.4, P(B) = 0.5, P(A ∩ B) = 0.2. What is P(A ∪ B)? (decimal, e.g. 0.7)
P(A) = 0.3, P(B) = 0.6, A and B are disjoint. What is P(A ∪ B)? (decimal, e.g. 0.9)
P(A) = 0.7, P(B) = 0.5, P(A ∪ B) = 0.9. What is P(A ∩ B)? (decimal, e.g. 0.3)

Consequences

These two rules — complement and addition — cascade into everything else.

What do you think?
Can you prove that P(∅) = 0 using the axioms?

Two more quick consequences:

Monotonicity: If ABA \subseteq B, then P(A)P(B)P(A) \leq P(B). (B has everything A has, plus maybe more.)

Bounds: For any event AA: 0P(A)10 \leq P(A) \leq 1.

If A ⊆ B, then B = A ∪ (B\A) with A and B\A disjoint. Use additivity to show P(A) ≤ P(B).
1/2

Test your understanding

P(A) = 0.6. What is P(Aᶜ)? (decimal)
P(A) = 0.5, P(B) = 0.4, P(A ∩ B) = 0.1. What is P(A ∪ B)? (decimal)
True or False: If P(A ∪ B) = P(A) + P(B), then A and B must be disjoint. (true or false)
Die roll: A = {1,2}, B = {3,4}, C = {5,6}. What is P(A) + P(B) + P(C)? (decimal or fraction)
A deck has 52 cards. P(red) = 26/52 = 1/2. P(face card) = 12/52. P(red face card) = 6/52. What is P(red or face card)? (decimal to 3 places)

What's next

We have the rules for combining probabilities. The next lesson gives us the tools to count outcomes fast: the multiplication rule turns "how many outfits?" into a simple product.