MHT-CET Maths · Probability Distribution

Conditional Probability, Independence and Bayes' Theorem

Restrict the sample space to compute P(A|B), chain events with the multiplication rule, exploit independence for 'at least one / exactly one' shortcuts, and reverse the conditioning with total probability and Bayes' theorem.

Why this matters

This is the densest subtopic in the chapter: 23 PYQs sit here (4 HARD, 15 MODERATE, 4 EASY). MHT-CET tests the whole conditional-probability chain — the definition P(A|B) = P(A∩B)/P(B), sequential draws without replacement, the independence identity P(A∩B) = P(A)P(B), the 1 − P(none) shortcut for 'the target is hit / the problem is solved', and Bayes' theorem for bag/box/disease posteriors. The recurring traps are all here too: confusing 'exactly one' with 'at least one', forgetting P(A'|B) = P(A') only when A and B are independent, and swapping priors with likelihoods in the Bayes ratio.

Concept 1 of 7

Conditional Probability — Restricting the Sample Space

Intuition

Once you are told that event B has happened, B becomes your new, smaller universe. The conditional probability P(A|B) asks: within that shrunken world of B-outcomes, what fraction also lie in A? You are simply re-measuring A's chance against B instead of against the whole sample space.

Definition

For events A and B with P(B)>0P(B) > 0, the conditional probability of A given B is

  • P(AB)=P(AB)P(B)P(A\mid B) = \dfrac{P(A\cap B)}{P(B)} — the share of B's probability that also lies in A.
  • Equivalently, rearranged, P(AB)=P(B)P(AB)=P(A)P(BA)P(A\cap B) = P(B)\,P(A\mid B) = P(A)\,P(B\mid A) — the multiplication rule.
  • When outcomes are equally likely, this reduces to counting: P(AB)=n(AB)n(B)P(A\mid B) = \dfrac{n(A\cap B)}{n(B)}.

Definition of conditional probability

P(AB)=P(AB)P(B),P(B)>0P(A\mid B) = \dfrac{P(A\cap B)}{P(B)},\qquad P(B) > 0
  • P(A\cap B)probability that both A and B occur
  • P(B)probability of the conditioning (given) event — the new universe
  • P(A\mid B)probability of A once B is known to have occurred

Diagram · P(A | B) restricts the world to B

SABA∩B

Once B is given, only the amber region counts — it's the new whole. P(A | B) is the slice of B that also lies in A: P(A | B) = P(A∩B) / P(B). Dividing by P(B) is exactly "rescale B to be the new 100%".

Worked example

A fair die is rolled. Given that the outcome is even, find the probability that it is greater than 3.
  1. Conditioning event B={2,4,6}B = \{2,4,6\}, so P(B)=36=12P(B) = \tfrac36 = \tfrac12.
  2. Event A={4,5,6}A = \{4,5,6\}; the overlap AB={4,6}A\cap B = \{4,6\}, so P(AB)=26=13P(A\cap B) = \tfrac26 = \tfrac13.
  3. Restrict: P(AB)=P(AB)P(B)=1/31/2=23P(A\mid B) = \dfrac{P(A\cap B)}{P(B)} = \dfrac{1/3}{1/2} = \dfrac{2}{3}.
  4. Check by counting inside B: of the three even outcomes {2,4,6}\{2,4,6\}, two exceed 3, giving 23\tfrac23.
Answer:P(AB)=23P(A\mid B) = \dfrac{2}{3}

P(A|B) and P(B|A) are not the same number

The condition goes in the denominator: P(AB)=P(AB)P(B)P(A\mid B) = \dfrac{P(A\cap B)}{P(B)} divides by P(B)P(B), while P(BA)=P(AB)P(A)P(B\mid A) = \dfrac{P(A\cap B)}{P(A)} divides by P(A)P(A). The numerator P(AB)P(A\cap B) is shared, but swapping which event you condition on gives a different answer unless P(A)=P(B)P(A)=P(B).

Divide by the GIVEN event's probability, not by 1

A conditional probability is measured against the reduced universe B, so P(AB)P(A\mid B) can be much larger than P(A)P(A). Forgetting to divide by P(B)P(B) — reporting just P(AB)P(A\cap B) — is the most common conditional-probability slip.

Concept 2 of 7

Multiplication Rule and Sequential Draws Without Replacement

Intuition

When events happen in sequence — draw a ticket, then another, then another — chain them with the multiplication rule, updating the pool after every draw. Each factor is the conditional probability given everything drawn so far. If the draws come from separate independent sources (one ball from each of several urns), the factors just multiply and no updating is needed.

Definition

The general multiplication rule for a chain of events:

  • P(E1E2E3)=P(E1)P(E2E1)P(E3E1E2)P(E_1\cap E_2\cap E_3) = P(E_1)\,P(E_2\mid E_1)\,P(E_3\mid E_1\cap E_2).
  • Without replacement: after each draw the counts shrink, so denominators drop by 1 and the relevant numerator drops by 1 as well. Drawing tickets one at a time is exactly this.
  • One item from each of several independent sources: the joint probability is just the product of each source's single-draw probability; add over all the ways a target composition can occur.

Chain rule for a sequence of dependent draws

P(E1E2E3)=P(E1)P(E2E1)P(E3E1E2)P(E_1\cap E_2\cap E_3) = P(E_1)\,P(E_2\mid E_1)\,P(E_3\mid E_1\cap E_2)

Visualization · total probability & Bayes tree

P(B₁)=0.6P(B₂)=0.4P(A|B₁)=0.2P(A|B₂)=0.5B₁B₂A: 0.12A: 0.2not Anot A
P(A) = 0.12 + 0.2 = 0.32P(B₁|A) = 0.12/0.32 = 0.375

Each leaf is a route product P(Bᵢ)·P(A|Bᵢ). Total probability adds the two leaves that end in A; Bayes' theorem divides one of those leaves by that total to flip the conditioning.

Worked example

A box has 4 red and 3 green tickets. Three tickets are drawn one at a time without replacement. Find the probability that they come out in the order red, green, red.
  1. First draw red: 47\dfrac{4}{7}.
  2. Given a red is gone, draw green: 36\dfrac{3}{6} (6 tickets left, 3 green).
  3. Given red and green gone, draw red: 35\dfrac{3}{5} (5 left, 3 red remain).
  4. Multiply the chain: 473635=36210=635\dfrac{4}{7}\cdot\dfrac{3}{6}\cdot\dfrac{3}{5} = \dfrac{36}{210} = \dfrac{6}{35}.
Answer:P(R, G, R)=635P(\text{R, G, R}) = \dfrac{6}{35}
Practice this conceptself-check · 4 quick reps

Try it yourself

A box contains 9 tickets numbered 1 to 9. Three tickets are drawn one at a time without replacement. Find the probability that they come out alternately as odd, even, odd.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    From 5 red and 3 blue balls, two are drawn without replacement. Probability both red?
  2. 2.
    Urn 1 has 2 white of 5; Urn 2 has 3 white of 5. Draw one from each — probability both white?
  3. 3.
    9 tickets 1–9, draw 3 without replacement. P(even, odd, even)?
  4. 4.
    In a chain of dependent draws, what does the third factor condition on?

From the bank · past-year question

Example 2Probability DistributionMODERATE
Three urns respectively contain 2 white and 3 black, 3 white and 2 black and 1 white and 4 black balls. If one ball is drawn from each urn, then the probability that the selection contains 1 black and 2 white balls is

[Q143 · 22 April Shift II · 2025]

Without replacement: shrink BOTH the numerator and the denominator

After drawing an odd ticket from 9, the next 'another odd' probability is 48\dfrac{4}{8} (one fewer odd, one fewer total) — not 58\dfrac{5}{8}. Freezing the count at its original value is the classic sequential-draw error.

Add over all favourable orderings for a composition

For '1 black and 2 white, one ball from each of three urns', the black can come from urn 1, 2, or 3 — so sum the three products P(B1W2W3)+P(W1B2W3)+P(W1W2B3)P(B_1W_2W_3)+P(W_1B_2W_3)+P(W_1W_2B_3). Computing only one ordering undercounts.

'Alternately O,E,O OR E,O,E' means add both patterns

Two disjoint arrangements satisfy the requirement, so compute each chain separately and add: P(O,E,O)+P(E,O,E)P(\text{O,E,O}) + P(\text{E,O,E}). Treating it as a single pattern halves the answer.

Concept 3 of 7

Computing P(A|B) by Restriction — Distributions, Counting and Composite Events

Intuition

Many MHT-CET conditional questions are just the definition applied carefully: build the conditioning event B, find the overlap A∩B, and divide. B may be a range of a random variable, a family of tickets sharing a property, or a birth-order event. The whole skill is identifying A∩B correctly and dividing by P(B), never by the full sample space.

Definition

Apply P(AB)=P(AB)P(B)P(A\mid B) = \dfrac{P(A\cap B)}{P(B)} to composite events:

  • Distribution table: P(aX<bXc)P(a\le X<b\mid X\le c) is the sum of the P(X) values in the numerator range that also satisfy XcX\le c, divided by P(Xc)P(X\le c).
  • 'At least one' condition: for family/coin problems, P(all girlsat least one girl)=P(all girls)P(at least one girl)P(\text{all girls}\mid \text{at least one girl}) = \dfrac{P(\text{all girls})}{P(\text{at least one girl})}, and 'at least one' is best found as 1P(none)1 - P(\text{none}).
  • Counting-based: P(AB)=n(AB)n(B)P(A\mid B) = \dfrac{n(A\cap B)}{n(B)} — list the outcomes in B, then count how many also lie in A.

Restriction form for composite conditioning

P(AB)=P(AB)P(B)=n(AB)n(B) (equally likely)P(A\mid B) = \dfrac{P(A\cap B)}{P(B)} = \dfrac{n(A\cap B)}{n(B)}\ \text{(equally likely)}

Worked example

A random variable X takes values 0,1,2,3 with P(X) equal to 2k,3k,4k,6k2k, 3k, 4k, 6k respectively. Find P(X2X1)P(X\ge 2\mid X\ge 1).
  1. Total probability is 1: 2k+3k+4k+6k=15k=12k+3k+4k+6k = 15k = 1, so k=115k = \tfrac{1}{15} (not needed as k cancels).
  2. Numerator P(X2X1)=P(X2)=P(2)+P(3)=4k+6k=10kP(X\ge 2\cap X\ge 1) = P(X\ge 2) = P(2)+P(3) = 4k+6k = 10k.
  3. Denominator P(X1)=3k+4k+6k=13kP(X\ge 1) = 3k+4k+6k = 13k.
  4. Divide: 10k13k=1013\dfrac{10k}{13k} = \dfrac{10}{13}.
Answer:P(X2X1)=1013P(X\ge 2\mid X\ge 1) = \dfrac{10}{13}
Practice this conceptself-check · 4 quick reps

Try it yourself

A family has 3 children. Find the probability that all three are girls, given that at least one of them is a girl.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    For X with P(X=1)=2k, P(X=2)=4k, P(X=0)=k, find P(1≤X≤2 | X≤2).
  2. 2.
    A die is rolled. P(prime | odd)?
  3. 3.
    'At least one girl' in 3 children — probability?
  4. 4.
    Given digit-sum 8 among tickets 00–49, how many tickets qualify?

From the bank · past-year question

Example 3Probability DistributionMODERATE
A random variable XX has the following probability distribution ------------------------------------------------------
X:X:
0 1 2 3 4 ----------- ------- -------- -------- -------- -------
P(X):P(X):
kk
2k2k
4k4k
2k2k
kk
------------------------------------------------------ then the value of P(1X<4X2)=P(1 \leqslant X< 4 \mid X\leqslant 2) =

[Q120 · 20 April Shift I · 2025]

The overlap A∩B is measured inside B, not over the whole space

For P(1X<4X2)P(1\le X<4\mid X\le 2) the numerator is only P(X=1)+P(X=2)P(X=1)+P(X=2) — the values that satisfy BOTH conditions (X=3X=3 is excluded by X2X\le 2). Summing the full range 1X<41\le X<4 in the numerator over-counts.

Compute 'at least one' as the complement

P(at least one girl)P(\text{at least one girl}) in 3 children is 1P(no girls)=118=781 - P(\text{no girls}) = 1 - \tfrac18 = \tfrac78, not 38\tfrac38 (that is exactly-one) and not 12\tfrac12. The at-least-one event is large; its complement 'none' is the single easy case.

Watch for a 'None of these' answer when your value is not listed

A carefully computed conditional probability may not appear among the four numeric options; MHT-CET occasionally makes the correct choice 'None of these'. Recount the conditioning set before assuming an arithmetic slip.

Concept 4 of 7

Independence and Event Algebra with Unions

Intuition

Two events are independent when knowing one occurred does not change the other's probability — formally P(A∩B) = P(A)P(B). This single identity powers a whole family of algebra problems: given a union probability and one event, solve for the other; combine complements; or evaluate P(A'|B), which collapses to P(A') when A and B are independent. Odds 'in favour a:b' convert to probability a/(a+b) first.

Definition

The independence identity and its consequences:

  • P(AB)=P(A)P(B)P(A\cap B) = P(A)\,P(B) (definition); then P(AB)=P(A)+P(B)P(A)P(B)P(A\cup B) = P(A)+P(B)-P(A)P(B).
  • If A, B are independent, so are the pairs (A,B)(A',B), (A,B)(A,B'), (A,B)(A',B'); hence P(AB)=P(A)P(A'\mid B) = P(A') and P(BA)=P(B)P(B'\mid A') = P(B').
  • Odds: 'odds in favour a:ba:b' means P=aa+bP = \dfrac{a}{a+b}; 'odds against a:ba:b' means P=ba+bP = \dfrac{b}{a+b}.
  • The complement-conditional sum P(AB)+P(BA)=P(AB)P(B)+P(AB)P(A)P(A'\mid B') + P(B'\mid A') = \dfrac{P(A'\cap B')}{P(B')} + \dfrac{P(A'\cap B')}{P(A')}, using P(AB)=1P(AB)P(A'\cap B') = 1 - P(A\cup B).

Independence and the union it produces

P(AB)=P(A)P(B),P(AB)=P(A)+P(B)P(A)P(B)P(A\cap B) = P(A)\,P(B),\qquad P(A\cup B) = P(A)+P(B)-P(A)\,P(B)
  • P(A)P(B)the product form — holds ONLY for independent A, B
  • P(A'\mid B)equals P(A') when A, B are independent

Diagram · mutually exclusive ≠ independent

ABMutually exclusiveP(A∩B) = 0ABIndependentP(A∩B) = P(A)·P(B)

Mutually exclusive events can't both happen, so they don't overlap and P(A∩B) = 0. Independent events do overlap — one happening doesn't change the other, so P(A∩B) = P(A)·P(B). Disjoint events with non-zero probability are therefore never independent.

Worked example

A and B are independent events with P(A)=13P(A) = \tfrac13 and P(AB)=35P(A\cup B) = \tfrac35. Find P(B)P(B).
  1. Independence: P(AB)=P(A)+P(B)P(A)P(B)P(A\cup B) = P(A) + P(B) - P(A)P(B).
  2. Substitute: 35=13+P(B)13P(B)=13+23P(B)\tfrac35 = \tfrac13 + P(B) - \tfrac13 P(B) = \tfrac13 + \tfrac23 P(B).
  3. So 23P(B)=3513=9515=415\tfrac23 P(B) = \tfrac35 - \tfrac13 = \tfrac{9-5}{15} = \tfrac{4}{15}.
  4. Hence P(B)=41532=25P(B) = \tfrac{4}{15}\cdot\tfrac32 = \tfrac{2}{5}.
Answer:P(B)=25P(B) = \dfrac{2}{5}
Practice this conceptself-check · 5 quick reps

Try it yourself

A and B are independent events with P(A)=310P(A) = \tfrac{3}{10} and P(B)=25P(B) = \tfrac{2}{5}. Find P(AB)P(A'\cup B).

Practice — Level 1 (5 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    A, B independent, P(A)=1/4, P(B)=2/5. Find P(A∩B).
  2. 2.
    Odds in favour of an event are 2:5. Find its probability.
  3. 3.
    A, B independent. Simplify P(A'|B).
  4. 4.
    If P(A∪B)=1/3, find P(A'∩B').
  5. 5.
    P(A')=0.75, so P(A)=?

From the bank · past-year question

Example 4Probability DistributionMODERATE
Let A and B are independent events with P(B)=25P(B) = \frac{2}{5}, P(AB)=1120P(A \cup B) = \frac{11}{20}, then P(AB)P(A'|B) is root of the equation

[Shift || · 2025]

P(A'|B) = P(A') needs INDEPENDENCE

For independent A and B, P(AB)=P(A)P(A'\mid B) = P(A') — so if P(A)=14P(A) = \tfrac14 then P(AB)=34P(A'\mid B) = \tfrac34. This collapse is FALSE for dependent events; there you must use P(AB)=P(AB)P(B)P(A'\mid B) = \dfrac{P(A'\cap B)}{P(B)}.

The union formula loses its cross-term only when independent

P(AB)=P(A)+P(B)P(AB)P(A\cup B) = P(A) + P(B) - P(A\cap B) always holds; replacing P(AB)P(A\cap B) by P(A)P(B)P(A)P(B) is legal ONLY under independence. Do not use the product form for mutually exclusive or unspecified events.

Convert odds to probability before plugging in

'Odds in favour a:ba:b' is P=aa+bP = \dfrac{a}{a+b}, not ab\dfrac{a}{b}. Odds 2:52:5 give P=27P = \tfrac27; using 25\tfrac25 is the standard odds-vs-fraction mistake in ship/selection problems.

Independent is not the same as mutually exclusive

Mutually exclusive means P(AB)=0P(A\cap B) = 0; independent means P(AB)=P(A)P(B)P(A\cap B) = P(A)P(B). Two events with nonzero probabilities cannot be both — if they never co-occur, knowing one occurred forces the other out, which is maximal dependence.

Concept 5 of 7

At Least One and Exactly One for Independent Trials

Intuition

'The problem is solved' or 'the target is hit' means at least one of several independent attempts succeeds — always compute this as 1 minus the probability that they ALL fail. 'Exactly one is selected' is different: sum the products where one succeeds and the rest fail. Getting these two apart is the single most-tested distinction in this subtopic.

Definition

For independent events A1,,AnA_1,\dots,A_n with success probabilities pip_i:

  • At least one succeeds: P(at least one)=1i(1pi)P(\text{at least one}) = 1 - \prod_i (1-p_i) — one minus 'all fail'.
  • Exactly one (of two): P=P(A)P(B)+P(A)P(B)P = P(A)P(B') + P(A')P(B).
  • A composite pattern like 'hit by P or Q but not R' is a sum of the independent triple-products matching that description, e.g. PQR+PQR+PQRP Q' R' + P' Q R' + P Q R' (all with R failing).

At-least-one and exactly-one

P(at least one)=1i(1pi),P(exactly one of A,B)=P(A)P(B)+P(A)P(B)P(\text{at least one}) = 1 - \prod_{i}(1-p_i),\qquad P(\text{exactly one of }A,B) = P(A)P(B') + P(A')P(B)

Diagram · two coin tosses → 2² = 4 outcomes

H · ½T · ½½½½½HH ¼HT ¼TH ¼TT ¼

Each toss branches into H or T with probability ½, and the branches multiply: every leaf is ½ × ½ = ¼. Tossing n coins gives 2ⁿ equally likely outcomes — so "at least one head" is easiest via the complement, 1 − P(all tails).

Worked example

Three shooters hit a target independently with probabilities 13,14,15\tfrac13, \tfrac14, \tfrac15. Find the probability that the target is hit.
  1. 'Target is hit' means at least one shooter hits — use the complement.
  2. P(all miss)=(113)(114)(115)=233445P(\text{all miss}) = (1-\tfrac13)(1-\tfrac14)(1-\tfrac15) = \tfrac23\cdot\tfrac34\cdot\tfrac45.
  3. =2460=25= \dfrac{24}{60} = \dfrac25.
  4. P(hit)=125=35P(\text{hit}) = 1 - \tfrac25 = \tfrac35.
Answer:P(target hit)=35P(\text{target hit}) = \dfrac{3}{5}
Practice this conceptself-check · 4 quick reps

Try it yourself

Two candidates A and B are selected independently with probabilities 25\tfrac25 and 47\tfrac47. Find the probability that exactly one of them is selected.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Four people hit independently with p = 1/2, 1/3, 1/4, 1/5. P(target hit)?
  2. 2.
    A, B, C solve a problem independently with p = 1/2, 1/3, 1/4. P(problem solved)?
  3. 3.
    Husband selected 1/7, wife 1/5, independent. P(only one selected)?
  4. 4.
    State the shortcut for 'at least one of many independent successes'.

From the bank · past-year question

Example 5Probability DistributionEASY
A problem in statistics is given to three students A, B and C. Their probabilities of solving the problem are 12,13\frac{1}{2},\frac{1}{3} and 14\frac{1}{4} respectively. If all of them try independently, then the probability that the problem is solved is

[Q142 · 9th May Shift 2 · 2024]

'At least one' is 1 − P(none), NOT the sum of individual probabilities

Adding p1+p2+p3p_1 + p_2 + p_3 over-counts overlaps and can exceed 1. For 'the target is hit / the problem is solved', always use 1(1pi)1 - \prod(1-p_i). E.g. shooters 12,13,14\tfrac12,\tfrac13,\tfrac14 give 114=341 - \tfrac14 = \tfrac34, not 12+13+14\tfrac12+\tfrac13+\tfrac14.

Exactly one ≠ at least one

'One of them is selected' asking for EXACTLY one is P(A)P(B)+P(A)P(B)P(A)P(B') + P(A')P(B). Using P(AB)=P(A)+P(B)P(A)P(B)P(A\cup B) = P(A)+P(B)-P(A)P(B) gives AT LEAST one — a bigger number. Read whether the problem excludes the both-succeed case.

Complement each event correctly inside a composite pattern

'Hit by P or Q but not R' forces R to FAIL in every favourable term, so each product carries P(R)=1P(R)P(R') = 1 - P(R). Sum PQR+PQR+PQRPQ'R' + P'QR' + PQR'. Forgetting to put RR' in each term, or dropping a valid case, is the usual error.

Concept 6 of 7

Total Probability Theorem

Intuition

When an event can happen through several mutually exclusive, exhaustive 'routes' (which bag, which first draw, which machine), its overall probability is the weighted average of the route-conditional probabilities — each weighted by how likely that route is. Draw a two-stage tree: branch to the routes, then to the event, and add the branch products.

Definition

If H1,,HnH_1,\dots,H_n are mutually exclusive and exhaustive (they partition the sample space) with P(Hi)>0P(H_i) > 0, then for any event E:

  • P(E)=i=1nP(Hi)P(EHi)P(E) = \displaystyle\sum_{i=1}^{n} P(H_i)\,P(E\mid H_i).
  • Each term is one route's prior times its conditional; the routes' priors sum to 1.
  • Polya-urn / draw-then-add problems fit here: the first draw's colour defines the partition, and the second-draw probability is conditional on the updated bag.

Total probability theorem

P(E)=i=1nP(Hi)P(EHi)P(E) = \sum_{i=1}^{n} P(H_i)\,P(E\mid H_i)
  • H_ithe partition (mutually exclusive, exhaustive routes)
  • P(H_i)prior probability of route i
  • P(E\mid H_i)probability of E along route i

Visualization · total probability & Bayes tree

P(B₁)=0.6P(B₂)=0.4P(A|B₁)=0.2P(A|B₂)=0.5B₁B₂A: 0.12A: 0.2not Anot A
P(A) = 0.12 + 0.2 = 0.32P(B₁|A) = 0.12/0.32 = 0.375

Each leaf is a route product P(Bᵢ)·P(A|Bᵢ). Total probability adds the two leaves that end in A; Bayes' theorem divides one of those leaves by that total to flip the conditioning.

Worked example

Bag I has 2 red and 3 white balls; Bag II has 4 red and 1 white. A bag is chosen at random and one ball is drawn. Find the probability the ball is red.
  1. Routes: P(Bag I)=P(Bag II)=12P(\text{Bag I}) = P(\text{Bag II}) = \tfrac12.
  2. Conditionals: P(RI)=25P(R\mid \text{I}) = \tfrac25, P(RII)=45P(R\mid \text{II}) = \tfrac45.
  3. Total probability: P(R)=1225+1245=210+410=610P(R) = \tfrac12\cdot\tfrac25 + \tfrac12\cdot\tfrac45 = \tfrac{2}{10} + \tfrac{4}{10} = \tfrac{6}{10}.
  4. Simplify: P(R)=35P(R) = \tfrac35.
Answer:P(R)=35P(R) = \dfrac{3}{5}
Practice this conceptself-check · 4 quick reps

Try it yourself

A bag has 4 red and 6 black balls. A ball is drawn, its colour noted, and it plus 3 more of the same colour are returned. A second ball is then drawn. Find the probability the second ball is red.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Two machines make 60% and 40% of items; defect rates 2% and 5%. P(item defective)?
  2. 2.
    What two properties must the routes H_i have for total probability?
  3. 3.
    Bag chosen 1/2 each; P(R|I)=2/5, P(R|II)=3/8. P(R)?
  4. 4.
    Do the priors P(H_i) sum to 1?

From the bank · past-year question

Example 6Probability DistributionHARD
A bag contains 4 Red and 6 Black balls. A ball is drawn at random from the bag, its colour is observed and this ball along with 3 additional balls of the same colour are returned to the bag. If now a ball is drawn at random from the bag, then the probability that this drawn ball is red is

[Q119 · 2nd May Shift 1 · 2023]

The routes must partition the space — exclusive AND exhaustive

Total probability P(E)=P(Hi)P(EHi)P(E) = \sum P(H_i)P(E\mid H_i) is valid only when the HiH_i are mutually exclusive (no overlap) and exhaustive (cover everything). If your routes leave a gap or overlap, the weighted sum is wrong.

In draw-then-add problems, update the bag before the conditional

After returning the drawn ball plus 3 of its colour, the bag size grows to 10+3=1310 + 3 = 13 and the matching colour count grows by 4. Use P(R2R1)=713P(R_2\mid R_1) = \tfrac{7}{13}, not the original 410\tfrac{4}{10}.

Concept 7 of 7

Bayes' Theorem — Reversing the Conditioning

Intuition

Bayes' theorem answers the reverse question: given that the effect E was observed, which cause H_i was most likely responsible? It reweights each route's prior by how strongly that route predicts the observed evidence (its likelihood), then normalises by the total probability of the evidence. It is total probability run backwards.

Definition

For a partition H1,,HnH_1,\dots,H_n and observed event E with P(E)>0P(E) > 0:

  • P(HkE)=P(Hk)P(EHk)iP(Hi)P(EHi)P(H_k\mid E) = \dfrac{P(H_k)\,P(E\mid H_k)}{\displaystyle\sum_{i} P(H_i)\,P(E\mid H_i)}.
  • Numerator = the chosen route's prior × likelihood; denominator = total probability of E (the sum of ALL routes' prior × likelihood).
  • With equal priors P(Hi)=1nP(H_i) = \tfrac1n, the priors cancel and the posterior is just P(EHk)iP(EHi)\dfrac{P(E\mid H_k)}{\sum_i P(E\mid H_i)} — a ratio of likelihoods.

Bayes' theorem (posterior from priors and likelihoods)

P(HkE)=P(Hk)P(EHk)iP(Hi)P(EHi)P(H_k\mid E) = \dfrac{P(H_k)\,P(E\mid H_k)}{\sum_{i} P(H_i)\,P(E\mid H_i)}
  • P(H_k)prior — probability of cause k before the evidence
  • P(E\mid H_k)likelihood — how well cause k predicts the evidence E
  • P(H_k\mid E)posterior — probability of cause k after seeing E

Visualization · total probability & Bayes tree

P(B₁)=0.6P(B₂)=0.4P(A|B₁)=0.2P(A|B₂)=0.5B₁B₂A: 0.12A: 0.2not Anot A
P(A) = 0.12 + 0.2 = 0.32P(B₁|A) = 0.12/0.32 = 0.375

Each leaf is a route product P(Bᵢ)·P(A|Bᵢ). Total probability adds the two leaves that end in A; Bayes' theorem divides one of those leaves by that total to flip the conditioning.

Worked example

Bag I has 3 red and 1 white ball; Bag II has 1 red and 3 white. A bag is chosen at random and a red ball is drawn. Find the probability it came from Bag I.
  1. Priors: P(I)=P(II)=12P(\text{I}) = P(\text{II}) = \tfrac12. Likelihoods: P(RI)=34P(R\mid \text{I}) = \tfrac34, P(RII)=14P(R\mid \text{II}) = \tfrac14.
  2. Total probability of red: P(R)=1234+1214=38+18=48=12P(R) = \tfrac12\cdot\tfrac34 + \tfrac12\cdot\tfrac14 = \tfrac{3}{8} + \tfrac18 = \tfrac48 = \tfrac12.
  3. Bayes: P(IR)=123412=3/81/2P(\text{I}\mid R) = \dfrac{\tfrac12\cdot\tfrac34}{\tfrac12} = \dfrac{3/8}{1/2}.
  4. =34= \dfrac34.
Answer:P(Bag IR)=34P(\text{Bag I}\mid R) = \dfrac{3}{4}
Practice this conceptself-check · 4 quick reps

Try it yourself

For k=1,2,3k = 1,2,3, box BkB_k contains k red and (k+1)(k+1) white balls, with P(B1)=12, P(B2)=13, P(B3)=16P(B_1) = \tfrac12,\ P(B_2) = \tfrac13,\ P(B_3) = \tfrac16. A box is chosen and a red ball drawn. Find the probability it came from B2B_2.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Equal priors on 3 diseases; test-positive likelihoods 0.7, 0.5, 0.8. P(disease 2 | positive)?
  2. 2.
    In Bayes' theorem, what sits in the denominator?
  3. 3.
    Bag I: 3R 2G, Bag II: 5R 3G, chosen equally; a green is drawn. P(from Bag I)?
  4. 4.
    When do priors drop out of the Bayes ratio?

From the bank · past-year question

Example 7Probability DistributionHARD
For k=1,2,3k = 1,2,3 the box BkB_{k} contains k red balls and (k+1)(k + 1) white balls. Let P(B1)=12,P(B2)=13P\left( B_{1} \right)=\frac{1}{2},P\left( B_{2} \right)=\frac{1}{3} and P(B3)=16.AP\left( B_{3} \right)=\frac{1}{6}.A box is selected at random and a ball is drawn from it. If a red ball is drawn from it, then the probability that it comes from box B2B_{2} is

[Q144 · 20 April Shift II · 2025]

Numerator is ONE route; denominator is ALL routes

P(HkE)P(H_k\mid E) puts only the chosen route P(Hk)P(EHk)P(H_k)P(E\mid H_k) on top, but the FULL total probability iP(Hi)P(EHi)\sum_i P(H_i)P(E\mid H_i) on the bottom. Using the same single term top and bottom gives 1 — a classic Bayes setup error.

Do not swap priors and likelihoods

The prior P(Hi)P(H_i) (which box/bag was chosen) multiplies the likelihood P(EHi)P(E\mid H_i) (chance of the observed ball given that box). Interchanging them — using P(HiE)P(H_i\mid E) where a likelihood belongs — corrupts every term.

Equal priors cancel — reduce to a likelihood ratio

With equal priors P(Hi)=1nP(H_i) = \tfrac1n, the 1n\tfrac1n factors out of every term and the posterior becomes P(EHk)iP(EHi)\dfrac{P(E\mid H_k)}{\sum_i P(E\mid H_i)}. For disease tests with equal prior probability, this shortcut avoids carrying 13\tfrac13 through the arithmetic.

Summary — formulas & gotchas at a glance

A revision cheat-sheet for the formulas and gotchas above. Click any concept name to jump back to its full explanation.

Formulas (7)

  • Conditional Probability — Restricting the Sample Space

    Definition of conditional probability

    P(AB)=P(AB)P(B),P(B)>0P(A\mid B) = \dfrac{P(A\cap B)}{P(B)},\qquad P(B) > 0
  • Multiplication Rule and Sequential Draws Without Replacement

    Chain rule for a sequence of dependent draws

    P(E1E2E3)=P(E1)P(E2E1)P(E3E1E2)P(E_1\cap E_2\cap E_3) = P(E_1)\,P(E_2\mid E_1)\,P(E_3\mid E_1\cap E_2)
  • Computing P(A|B) by Restriction — Distributions, Counting and Composite Events

    Restriction form for composite conditioning

    P(AB)=P(AB)P(B)=n(AB)n(B) (equally likely)P(A\mid B) = \dfrac{P(A\cap B)}{P(B)} = \dfrac{n(A\cap B)}{n(B)}\ \text{(equally likely)}
  • Independence and Event Algebra with Unions

    Independence and the union it produces

    P(AB)=P(A)P(B),P(AB)=P(A)+P(B)P(A)P(B)P(A\cap B) = P(A)\,P(B),\qquad P(A\cup B) = P(A)+P(B)-P(A)\,P(B)
  • At Least One and Exactly One for Independent Trials

    At-least-one and exactly-one

    P(at least one)=1i(1pi),P(exactly one of A,B)=P(A)P(B)+P(A)P(B)P(\text{at least one}) = 1 - \prod_{i}(1-p_i),\qquad P(\text{exactly one of }A,B) = P(A)P(B') + P(A')P(B)
  • Total Probability Theorem

    Total probability theorem

    P(E)=i=1nP(Hi)P(EHi)P(E) = \sum_{i=1}^{n} P(H_i)\,P(E\mid H_i)
  • Bayes' Theorem — Reversing the Conditioning

    Bayes' theorem (posterior from priors and likelihoods)

    P(HkE)=P(Hk)P(EHk)iP(Hi)P(EHi)P(H_k\mid E) = \dfrac{P(H_k)\,P(E\mid H_k)}{\sum_{i} P(H_i)\,P(E\mid H_i)}

Watch out for (20)

Mastery check — 5 interleaved questions

Try each one before clicking. Questions are interleaved across the concepts above, not grouped — interleaving sharpens transfer.

Example 1Probability DistributionMODERATE
A box contains 9 tickets numbered 1 to 9 both inclusive. If 3 tickets are drawn from the box one at a time, then the probability that they are alternatively either {odd, even, odd} or {even, odd, even} is

[Q107 · 19 April Shift I · 2025]

Example 2Probability DistributionMODERATE
Out of 50 tickets numbered 00, 01, 02, \ldots, 49, one ticket is drawn randomly. The probability of the ticket having the product of its digits 7, given that the sum of the digits is 8, is

[Q112 · May Shift 1 · 2021]

Example 3Probability DistributionMODERATE
A and B are independent events with P(A)=310P(A) = \frac{3}{10}, P(B)=25P(B) = \frac{2}{5}, then P(AB)P(A' \cup B) has the value

[Q108 · 10th May Shift 2 · 2023]

Example 4Probability DistributionEASY
A man and his wife appear for an interview for two posts. The probability of the husband's selection is 17\frac{1}{7} and that of the wife's selection is 15\frac{1}{5}. If they appear for the interview independently, then the probability that only one of them is selected, is

[Q140 · 4th May Shift 2 · 2023]

Example 5Probability DistributionHARD
The probability, that a year selected at random will have 53 Mondays, is

[Q127 · 15th May Shift 1 · 2023]

Drill every past-year question on this subtopic

26 questions from the bank — paginated, with cart and Word-export support.

Related notes