MHT-CET Maths · Probability Distribution

Discrete Random Variables, PMF and CDF

A random variable assigns a number to each outcome; its probability mass function lists P(X=x) for every value, obeys 0 ≤ P ≤ 1 and ΣP = 1, and its cumulative distribution function F(x) = P(X ≤ x) accumulates those probabilities.

Why this matters

This is the technique-richest subtopic of the chapter: 29 PYQs (3 EASY, 22 MODERATE, 4 HARD). The bank tests four separate skills that all begin from ΣP = 1 — solving a linear-k table, a quadratic-in-k table (the 6k²+5k−1 and 10k²+9k−1 factorings recur almost every year), an exponential pmf, and an infinite arithmetico-geometric pmf — plus building a distribution from a coin/card/draw experiment, reading a CDF, and normalising a continuous pdf. Expectation and variance are taught separately; here the whole game is finding the constant, reading a range probability, and constructing the table correctly.

Concept 1 of 9

Discrete Random Variable and Its Probability Mass Function

Intuition

A random variable X is a rule that turns each outcome of an experiment into a number (heads count, number drawn, tosses needed). It is DISCRETE when it takes isolated values — 0, 1, 2, … The probability mass function (pmf) is simply the table that lists P(X = x) against each value x.

Definition

A probability mass function P(X=x)P(X=x) of a discrete random variable must satisfy TWO axioms:

  • Each probability is valid: 0P(X=xi)10 \le P(X=x_i) \le 1 for every value xix_i.
  • The total mass is one: iP(X=xi)=1\displaystyle\sum_i P(X=x_i) = 1 — summed over ALL values the variable can take.

These two rules are the engine of the whole subtopic: every 'find the constant' question is just ΣP = 1 solved for the unknown, and every 'is this a valid distribution?' check is these two axioms.

The two pmf axioms

0P(X=xi)1,iP(X=xi)=10 \le P(X=x_i) \le 1, \qquad \sum_{i} P(X=x_i) = 1
  • Xthe discrete random variable
  • x_ieach value X can take
  • P(X=x_i)the probability mass at that value

Worked example

Is P(X=x)=x6P(X=x) = \dfrac{x}{6} for x=1,2,3x = 1,2,3 a valid pmf?
  1. Each value: P(1)=16, P(2)=26, P(3)=36P(1)=\tfrac16,\ P(2)=\tfrac26,\ P(3)=\tfrac36 — all lie in [0,1][0,1]. ✓
  2. Sum: 16+26+36=66=1\tfrac16+\tfrac26+\tfrac36 = \tfrac66 = 1. ✓
  3. Both axioms hold, so it is a valid pmf.
Answer:Yes — all probabilities are in [0,1][0,1] and they sum to 1.

A pmf must sum to exactly 1, over ALL values

The single most common error is summing over only some of the listed values (forgetting the last row, or a P=0P=0 row). Every value the variable can take contributes to P(X=x)=1\sum P(X=x)=1. If a row shows P=0P=0 it still belongs in the table — it just contributes nothing to the sum.

Probabilities can never exceed 1 or go negative

When you solve P=1\sum P = 1 for a constant and get two roots (say from a quadratic), reject any root that makes some P(X=x)P(X=x) negative or bigger than 1. Only the root keeping every entry in [0,1][0,1] is admissible.

Concept 2 of 9

Finding the Constant k from a Linear Probability Table

Intuition

The most direct 'find k' shape: every entry is a whole-number multiple of k (or a simple piecewise rule in k). Add them, set the total to 1, and one division gives k. Once k is known, any range probability is just adding the right cells.

Definition

When the pmf entries are linear in kk (e.g. 2k,k,2k,4k,k2k, k, 2k, 4k, k, or a piecewise rule P=kxP=kx/P=k(5x)P=k(5-x)):

  • Sum all entries and set the total equal to 1: P(X=x)=1\sum P(X=x)=1.
  • Solve the resulting LINEAR equation for kk — a single step.
  • Substitute back to read off any required probability or range.

If a fixed number appears (e.g. P(0)=0.1P(0)=0.1 with the rest in kk), include it in the sum: 0.1+(terms in k)=10.1 + (\text{terms in }k) = 1.

Linear normalisation

xP(X=x)=1    (multiple of k)=1    k=1that multiple\sum_x P(X=x) = 1 \;\Longrightarrow\; (\text{multiple of } k) = 1 \;\Longrightarrow\; k = \frac{1}{\text{that multiple}}

Worked example

For the distribution P(X=x)P(X=x): x=1,2,3,4x=1,2,3,4 with P=k,3k,3k,kP = k, 3k, 3k, k, find kk and P(X2)P(X \le 2).
  1. Sum: k+3k+3k+k=8kk + 3k + 3k + k = 8k.
  2. Set 8k=1k=188k = 1 \Rightarrow k = \tfrac18.
  3. P(X2)=P(1)+P(2)=k+3k=4k=48=12P(X\le 2) = P(1)+P(2) = k + 3k = 4k = \tfrac48 = \tfrac12.
Answer:k=18,P(X2)=12k = \tfrac18,\quad P(X\le 2) = \tfrac12
Practice this conceptself-check · 4 quick reps

Try it yourself

A pmf has P(0)=0.1P(0)=0.1, P(1)=kP(1)=k, P(2)=2kP(2)=2k, P(3)=2kP(3)=2k, P(4)=kP(4)=k. Find the probability of at least 2.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Entries 2k,k,2k,4k,k2k, k, 2k, 4k, k sum to 1. Find kk.
  2. 2.
    Entries k,2k,3k,4k,4k,3k,2k,k,kk, 2k, 3k, 4k, 4k, 3k, 2k, k, k sum to 1. Find kk.
  3. 3.
    For P=k,3k,3k,kP = k, 3k, 3k, k on x=1,2,3,4x=1,2,3,4, find P(X3)P(X\ge 3).
  4. 4.
    If P(0)=0.1P(0)=0.1 and remaining terms total 6k6k, find kk.

From the bank · past-year question

Example 2Probability DistributionMODERATE
A random variable X has the following probability distribution. Then value of k is___ and P(3<x6)P(3<x\leq6) has the value. \begin{array}{|c|c|c|c|c|c|c|c|c|c|}\hline X=x & 0 & 1 & 2 & 3 & 4 & 5 & 6 & 7 & 8\\ \hline P(x) & k & 2k & 3k & 4k & 4k & 3k & 2k & k & k\\ \hline\end{array}

[Q109 · 3rd May 2nd Shift · 2023]

Include every row — even a fixed number — in ΣP = 1

In P(0)=0.1, P(1)=k,P(0)=0.1,\ P(1)=k,\dots the fixed 0.10.1 is part of the total: 0.1+6k=10.1 + 6k = 1, not 6k=16k = 1. Omitting the constant term gives k=16k = \tfrac16 instead of the correct 0.150.15.

Match the range operator exactly: strict vs inclusive

P(3<X6)P(3 < X \le 6) means P(4)+P(5)+P(6)P(4)+P(5)+P(6) — it EXCLUDES x=3x=3 and INCLUDES x=6x=6. Reading it as P(3)+P(4)+P(5)+P(6)P(3)+P(4)+P(5)+P(6) or dropping x=6x=6 is the classic off-by-one range slip.

Concept 3 of 9

Reading a Range Probability from the pmf Table

Intuition

Once the pmf is known, a range probability is a sum of cells. The only skill is decoding the inequality correctly — which endpoints are included — and using the complement P(Xa)=1P(X<a)P(X\ge a)=1-P(X<a) when it is shorter.

Definition

Translate the inequality into exactly which values to add:

  • P(X<a)P(X < a): all values strictly below aa.
  • P(aX<b)P(a \le X < b): from aa up to but NOT including bb.
  • P(Xa)=1P(X<a)P(X \ge a) = 1 - P(X < a) — use the complement to avoid adding a long tail.
  • P(X>a)=1P(Xa)P(X > a) = 1 - P(X \le a).

The complement rule is the workhorse whenever the 'up to' side has fewer cells than the 'from' side.

Complement for a tail probability

P(Xa)=1P(X<a)=1x<aP(X=x)P(X \ge a) = 1 - P(X < a) = 1 - \sum_{x < a} P(X=x)

Worked example

A pmf on x=0,1,2,3,4,5x=0,1,2,3,4,5 is P=0.05,0.15,0.30,0.25,0.15,0.10P = 0.05, 0.15, 0.30, 0.25, 0.15, 0.10. Find P(2X<5)P(2 \le X < 5).
  1. 2X<52 \le X < 5 means the values 2,3,42, 3, 4 — include 22, exclude 55.
  2. P(2X<5)=P(2)+P(3)+P(4)=0.30+0.25+0.15=0.70P(2\le X<5) = P(2)+P(3)+P(4) = 0.30 + 0.25 + 0.15 = 0.70.
  3. Complement check: 1P(0)P(1)P(5)=10.050.150.10=0.701 - P(0) - P(1) - P(5) = 1 - 0.05 - 0.15 - 0.10 = 0.70. ✓
Answer:P(2X<5)=0.70P(2 \le X < 5) = 0.70
Practice this conceptself-check · 4 quick reps

Try it yourself

For a pmf with entries K,2K,K2,2K,5K2K, 2K, K^2, 2K, 5K^2 on x=1,2,3,4,5x=1,2,3,4,5, find P(X>2)P(X > 2).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Given k=110k=\tfrac1{10} and entries 2k,k,2k,4k,k2k,k,2k,4k,k, find P(X<3)P(X<3).
  2. 2.
    Write P(X6)P(X\ge 6) as a complement.
  3. 3.
    Does P(2X<4)P(2\le X<4) include x=4x=4?
  4. 4.
    k=121k=\tfrac1{21}, entries as 4k,3k,2k4k,3k,2k on x=4,5,6x=4,5,6. Find P(3<X6)P(3<X\le 6).

From the bank · past-year question

Example 3Probability DistributionMODERATE
The probability distribution of a discrete random variable X is ---------------------------------- X 0 1 2 3 4 -------------- --- --- --- --- ---
P(X=x)P(X = x)
2 k k 2 k 4 k k ---------------------------------- If a=P(x<3)a = P(x< 3) and b=P(2X<4)b = P(2 \leq X < 4), then

[Q115 · 19 April Shift I · 2025]

P(X>a)P(X > a) and P(Xa)P(X \ge a) are not the same

P(X>2)P(X>2) excludes x=2x=2; P(X2)P(X\ge 2) includes it. For entries k2,2k,k,2k,5k2k^2,2k,k,2k,5k^2, P(X>2)=k+2k+5k2P(X>2)=k+2k+5k^2 but P(X2)=1k2P(X\ge 2)=1-k^2. Mixing the two is the top range error in this subtopic.

Use the complement only when it has fewer terms

P(Xa)=1P(X<a)P(X\ge a)=1-P(X<a) is a shortcut, not a rule to apply blindly. On a table running 070\ldots 7, the tail P(X6)={6,7}P(X\ge 6)=\{6,7\} is SHORT — add it directly. Switch to the complement only when the 'below' side has fewer cells than the tail; count both sides before choosing.

Concept 4 of 9

Finding k from a Quadratic Probability Table

Intuition

When some pmf entries carry k2k^2 (like k2,2k2,7k2+kk^2, 2k^2, 7k^2+k), summing to 1 gives a QUADRATIC in k. Two recurring MHT-CET quadratics factor cleanly: 6k2+5k1=(6k1)(k+1)6k^2+5k-1=(6k-1)(k+1) and 10k2+9k1=(10k1)(k+1)10k^2+9k-1=(10k-1)(k+1). Take the positive root (the negative one makes probabilities invalid).

Definition

The two standard quadratics and their admissible roots:

  • 6k2+5k1=0(6k1)(k+1)=0k=166k^2 + 5k - 1 = 0 \Rightarrow (6k-1)(k+1)=0 \Rightarrow k = \tfrac16 (reject k=1k=-1).
  • 10k2+9k1=0(10k1)(k+1)=0k=11010k^2 + 9k - 1 = 0 \Rightarrow (10k-1)(k+1)=0 \Rightarrow k = \tfrac1{10} (reject k=1k=-1).

Always reject the negative root — a negative kk would make some P(X=x)P(X=x) negative. After finding k, evaluate the required range, remembering the k2k^2 terms: e.g. P(X6)=2k2+(7k2+k)=9k2+kP(X\ge 6)=2k^2+(7k^2+k)=9k^2+k.

The two recurring MHT-CET quadratics

6k2+5k1=(6k1)(k+1),10k2+9k1=(10k1)(k+1)6k^2+5k-1=(6k-1)(k+1),\qquad 10k^2+9k-1=(10k-1)(k+1)

Worked example

A pmf on x=1,2,3x=1,2,3 has P=k,3k,6k2P = k, 3k, 6k^2. Find kk.
  1. Sum: k+3k+6k2=6k2+4k=1k + 3k + 6k^2 = 6k^2 + 4k = 1.
  2. So 6k2+4k1=06k^2 + 4k - 1 = 0; by the quadratic formula k=4±16+2412=4±4012k = \dfrac{-4 \pm \sqrt{16+24}}{12} = \dfrac{-4 \pm \sqrt{40}}{12}.
  3. 40=2106.32\sqrt{40}=2\sqrt{10}\approx 6.32, so k=4+6.32120.194k = \dfrac{-4+6.32}{12}\approx 0.194 (reject the negative root).
Answer:k=2+1060.19k = \dfrac{-2+\sqrt{10}}{6} \approx 0.19
Practice this conceptself-check · 4 quick reps

Try it yourself

A pmf on x=0,,7x=0,\ldots,7 is 0,k,2k,2k,3k,k2,2k2,7k2+k0, k, 2k, 2k, 3k, k^2, 2k^2, 7k^2+k. Find kk and P(X6)P(X\ge 6).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Solve 6k2+5k1=06k^2 + 5k - 1 = 0 for the admissible probability constant.
  2. 2.
    Solve 10k2+9k1=010k^2 + 9k - 1 = 0 for the admissible constant.
  3. 3.
    With k=16k=\tfrac16 and entries k2,2k,k,2k,5k2k^2,2k,k,2k,5k^2, find P(X2)P(X\ge 2).
  4. 4.
    Why is k=1k=-1 rejected in 6k2+5k1=06k^2+5k-1=0?

From the bank · past-year question

Example 4Probability DistributionMODERATE
If a random variable XX has the following probability distribution of X ----------------------------
X=xX = x
P(X=x)P(X = x)
----------- ---------------- 0 0 1 k 2 2 k 3 2 k 4 3 k 5
k2k^{2}
6
2k22k^{2}
7
7k2+k7k^{2} + k
---------------------------- Then P(X6)=P(X\geq 6) =

[Q146 · 25 April Shift I · 2025]

Reject the negative root of the k-quadratic

Both 6k2+5k1=06k^2+5k-1=0 and 10k2+9k1=010k^2+9k-1=0 have k=1k=-1 as a root. A negative kk forces negative probabilities, so it is inadmissible — always keep the positive root (16\tfrac16 or 110\tfrac1{10}).

Don't drop the k2k^2 terms when evaluating a range

For P(X6)=2k2+(7k2+k)=9k2+kP(X\ge 6)=2k^2+(7k^2+k)=9k^2+k with k=110k=\tfrac1{10}, you must square: 9k2=91009k^2=\tfrac{9}{100}, giving 19100\tfrac{19}{100}. Treating k2k^2 as kk gives the wrong tail probability.

P(X2)=1P(X=1)P(X\ge 2)=1-P(X=1), not 1P(X2)1-P(X\le 2)

For entries k2,2k,k,2k,5k2k^2,2k,k,2k,5k^2 with k=16k=\tfrac16, P(X2)=1P(1)=1k2=3536P(X\ge 2)=1-P(1)=1-k^2=\tfrac{35}{36}. Subtracting P(X2)P(X\le 2) instead would wrongly remove x=2x=2, which the '2\ge 2' event includes.

Concept 5 of 9

Finding k for an Exponential pmf on a Finite Range

Intuition

When P(X=x)=k2xP(X=x) = k\cdot 2^x over a FINITE set x=0,1,,nx=0,1,\ldots,n, summing to 1 needs a finite geometric series, not an infinite one. The trap is stopping the sum early or using an infinite-sum formula.

Definition

For P(X=x)=krxP(X=x)=k\,r^x on x=0,1,,nx=0,1,\ldots,n:

  • Finite geometric sum: x=0nrx=rn+11r1\displaystyle\sum_{x=0}^{n} r^x = \dfrac{r^{n+1}-1}{r-1} (for r1r\ne 1).
  • Set k(that sum)=1k \cdot (\text{that sum}) = 1 and solve for kk.

For r=2, n=4r=2,\ n=4: 20+21+22+23+24=1+2+4+8+16=312^0+2^1+2^2+2^3+2^4 = 1+2+4+8+16 = 31, so 31k=131k=1 and k=131k=\tfrac1{31}.

Finite geometric normalisation

x=0nkrx=krn+11r1=1\sum_{x=0}^{n} k\,r^{x} = k\cdot\frac{r^{\,n+1}-1}{r-1} = 1

Worked example

If P(X=x)=k3xP(X=x) = k\cdot 3^x for x=0,1,2x=0,1,2, find kk.
  1. Sum: k(30+31+32)=k(1+3+9)=13kk(3^0 + 3^1 + 3^2) = k(1 + 3 + 9) = 13k.
  2. Set 13k=113k = 1.
  3. So k=113k = \tfrac1{13}.
Answer:k=113k = \tfrac1{13}
Practice this conceptself-check · 4 quick reps

Try it yourself

P(X=x)=k3xP(X=x) = k\cdot 3^x is a distribution on x=0,1,2,3x=0,1,2,3. Find kk and P(X=0)P(X=0).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    P=k2x, x=0..4P=k\cdot 2^x,\ x=0..4. Value of kk?
  2. 2.
    Sum 20+21+22+23+242^0+2^1+2^2+2^3+2^4.
  3. 3.
    P=k3x, x=0,1,2P=k\cdot 3^x,\ x=0,1,2. Value of kk?
  4. 4.
    Is the finite sum x=042x\sum_{x=0}^{4}2^x equal to 2512^5-1?

From the bank · past-year question

Example 5Probability DistributionMODERATE
If f(x)=k2xf(x) = k\cdot 2^x is a probability distribution of a random variable XX that can take on the values x=0,1,2,3,4x = 0, 1, 2, 3, 4, then kk is equal to

[Q129 · May Shift 1 · 2021]

Sum a FINITE range fully — don't stop early

For P=k2xP=k\cdot 2^x on x=0,1,2,3,4x=0,1,2,3,4, the sum is 1+2+4+8+16=311+2+4+8+16=31, so k=131k=\tfrac1{31}. Stopping at x=3x=3 (=15=15) gives k=115k=\tfrac1{15} — a designed distractor. Count every listed value.

A finite exponential pmf is NOT the infinite geometric sum

x=042x=31\sum_{x=0}^{4}2^x = 31, not 112\dfrac{1}{1-2} (which diverges anyway for r>1r>1). Use the finite formula rn+11r1\dfrac{r^{n+1}-1}{r-1}; the infinite 11r\dfrac{1}{1-r} only applies when r<1|r|<1 over an infinite range.

Concept 6 of 9

Finding k for an Infinite pmf k(x+1)rˣ

Intuition

When P(X=x)=k(x+1)rxP(X=x) = k(x+1)r^x over ALL x=0,1,2,x=0,1,2,\ldots, the normalisation sum is an arithmetico-geometric progression (AGP). Its closed form x0(x+1)rx=1(1r)2\sum_{x\ge 0}(x+1)r^x = \dfrac{1}{(1-r)^2} turns the whole problem into one substitution.

Definition

For the infinite pmf P(X=x)=k(x+1)rx, x=0,1,2,P(X=x)=k(x+1)r^x,\ x=0,1,2,\ldots:

  • Key AGP sum (memorise): x=0(x+1)rx=1(1r)2\displaystyle\sum_{x=0}^{\infty}(x+1)r^{x} = \frac{1}{(1-r)^{2}} for r<1|r|<1.
  • Set k1(1r)2=1k\cdot\dfrac{1}{(1-r)^2} = 1, so k=(1r)2k = (1-r)^2.
  • For r=12r=\tfrac12: k=(112)2=14k=(1-\tfrac12)^2=\tfrac14. For r=15r=\tfrac15: k=(115)2=(45)2=1625k=(1-\tfrac15)^2=(\tfrac45)^2=\tfrac{16}{25}.

Then any P(X=x)P(X=x) follows by direct substitution — e.g. P(X=0)=k(1)(r0)=kP(X=0)=k(1)(r^0)=k.

AGP normalisation for k(x+1)rˣ

x=0(x+1)rx=1(1r)2    k=(1r)2\sum_{x=0}^{\infty}(x+1)r^{x} = \frac{1}{(1-r)^{2}} \;\Longrightarrow\; k = (1-r)^{2}
  • rthe common ratio, r<1|r|<1
  • (x+1)the arithmetic factor
  • kthe normalising constant (1r)2(1-r)^2

Worked example

P(X=x)=k(x+1)(13)xP(X=x) = k(x+1)\left(\tfrac13\right)^x for x=0,1,2,x=0,1,2,\ldots. Find kk.
  1. Normalise: kx0(x+1)(13)x=1k\displaystyle\sum_{x\ge 0}(x+1)\left(\tfrac13\right)^x = 1.
  2. Use the AGP sum with r=13r=\tfrac13: =1(113)2=1(2/3)2=94\sum = \dfrac{1}{(1-\tfrac13)^2} = \dfrac{1}{(2/3)^2} = \dfrac94.
  3. So k94=1k=49k\cdot\tfrac94 = 1 \Rightarrow k = \tfrac49.
Answer:k=49k = \tfrac49
Practice this conceptself-check · 4 quick reps

Try it yourself

P(X=x)=k(x+1)(12)xP(X=x) = k(x+1)\left(\tfrac12\right)^x for x=0,1,2,x=0,1,2,\ldots. Find P(X=1)P(X=1).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    State x=0(x+1)rx\sum_{x=0}^{\infty}(x+1)r^x.
  2. 2.
    P=k(x+1)(15)x, x0P=k(x+1)(\tfrac15)^x,\ x\ge 0. Value of kk?
  3. 3.
    P=k(x+1)(12)x, x0P=k(x+1)(\tfrac12)^x,\ x\ge 0. Value of kk?
  4. 4.
    For k(x+1)rxk(x+1)r^x, what is P(X=0)P(X=0)?

From the bank · past-year question

Example 6Probability DistributionHARD
A random variable X takes the values 0, 1, 2, 3, ____\_\_\_\_ with probability\ P(X=x)=k(x+1)(15)xP(X =x) = k(x+ 1)\left( \frac{1}{5} \right)^{x}, where k is a constant.Then P(X=0)P(X = 0) is

[Q145 · 20 April Shift II · 2025]

(x+1)rx=1(1r)2\sum(x+1)r^x = \dfrac{1}{(1-r)^2}, NOT 11r\dfrac{1}{1-r}

The arithmetic factor (x+1)(x+1) squares the denominator. For r=15r=\tfrac15, (x+1)rx=1(4/5)2=2516\sum(x+1)r^x = \dfrac{1}{(4/5)^2}=\dfrac{25}{16}, giving k=1625k=\tfrac{16}{25}. Using the plain geometric 11r=54\dfrac{1}{1-r}=\tfrac54 gives the wrong k=45k=\tfrac45.

The range is INFINITE here — use r<1|r|<1

P(X=x)=k(x+1)rxP(X=x)=k(x+1)r^x runs over all x=0,1,2,x=0,1,2,\ldots, so the infinite AGP sum applies (it converges because r=12r=\tfrac12 or 15\tfrac15 satisfies r<1|r|<1). Don't confuse it with the finite k2xk\cdot 2^x case, whose ratio exceeds 1.

Concept 7 of 9

Constructing a Probability Distribution from an Experiment

Intuition

Instead of a table with an unknown k, you are handed an experiment (toss coins, draw cards, throw a die) and must BUILD the pmf. List the possible values of X, compute each probability from the experiment, and check the total is 1.

Definition

Identify the values XX takes, then find each P(X=x)P(X=x) by the right counting rule:

  • With-replacement draws (independent trials): binomial — P(X=r)=(nr)pr(1p)nrP(X=r)=\binom{n}{r}p^r(1-p)^{n-r}. Two cards with replacement, jack has p=452=113p=\tfrac4{52}=\tfrac1{13}: P(0)=(1213)2=144169, P(1)=21131213=24169, P(2)=1169P(0)=(\tfrac{12}{13})^2=\tfrac{144}{169},\ P(1)=2\cdot\tfrac1{13}\cdot\tfrac{12}{13}=\tfrac{24}{169},\ P(2)=\tfrac1{169}.
  • Without-replacement draws: hypergeometric — P(X=r)=(Dr)(NDnr)(Nn)P(X=r)=\dfrac{\binom{D}{r}\binom{N-D}{n-r}}{\binom{N}{n}}. 4 defective + 16 good, draw 3: P(0)=(163)(203)=2857P(0)=\tfrac{\binom{16}{3}}{\binom{20}{3}}=\tfrac{28}{57}, and so on.
  • Counting outcomes (equally likely): three fair coins, X=X= heads: P(X=k)=(3k)(12)3P(X=k)=\binom{3}{k}(\tfrac12)^3, giving 18,38,38,18\tfrac18,\tfrac38,\tfrac38,\tfrac18.
  • Sequential 'until' experiments: multiply along each branch — a coin tossed until a head or 4 tails gives 12,14,18,18\tfrac12,\tfrac14,\tfrac18,\tfrac18 for X=1,2,3,4X=1,2,3,4 (the last cell pools TTTH and TTTT).

Binomial and hypergeometric building blocks

P(X=r)=(nr)pr(1p)nr,P(X=r)=(Dr)(NDnr)(Nn)P(X=r)=\binom{n}{r}p^{r}(1-p)^{n-r}, \qquad P(X=r)=\frac{\binom{D}{r}\binom{N-D}{n-r}}{\binom{N}{n}}

Worked example

Two fair coins are tossed. Let XX be the number of heads. Build the probability distribution.
  1. Sample space: HH, HT, TH, TT — four equally likely outcomes.
  2. X=0X=0 (TT): P=14P=\tfrac14. X=1X=1 (HT, TH): P=24=12P=\tfrac24=\tfrac12. X=2X=2 (HH): P=14P=\tfrac14.
  3. Check: 14+12+14=1\tfrac14+\tfrac12+\tfrac14 = 1. ✓
Answer:P(0)=14, P(1)=12, P(2)=14P(0)=\tfrac14,\ P(1)=\tfrac12,\ P(2)=\tfrac14
Practice this conceptself-check · 4 quick reps

Try it yourself

A fair die is thrown. Let XX be the number of factors of the number on the top face. Build the distribution of XX.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Two cards drawn WITH replacement; X=X= number of jacks. Find P(X=0)P(X=0).
  2. 2.
    Three fair coins; X=X= heads. Find P(X=2)P(X=2).
  3. 3.
    4 defective + 16 good oranges, draw 3 (no replacement). Find P(X=0)P(X=0).
  4. 4.
    Coin tossed until a head or 4 tails; find P(X=1)P(X=1).

From the bank · past-year question

Example 7Probability DistributionMODERATE
Four defective oranges are accidentally mixed with sixteen good ones. Three oranges are drawn from the mixed lot. The probability distribution of defective oranges is

[Q148 · 22 April Shift II · 2025]

With replacement is binomial; without replacement is hypergeometric

'Successively WITH replacement' means each draw is independent with fixed pp — use (nr)pr(1p)nr\binom{n}{r}p^r(1-p)^{n-r}. 'Drawn from the lot' WITHOUT replacement changes the pool each draw — use the (Dr)(NDnr)(Nn)\dfrac{\binom{D}{r}\binom{N-D}{n-r}}{\binom{N}{n}} ratio. Choosing the wrong model is the top construction error.

P(X=1)P(X=1) counts BOTH orders — include the factor of 2

For two independent draws, P(exactly one success)=2p(1p)P(\text{exactly one success}) = 2\,p(1-p) (success-then-fail OR fail-then-success), e.g. 24524852=241692\cdot\tfrac4{52}\cdot\tfrac{48}{52}=\tfrac{24}{169}. Forgetting the 2 halves the middle probability.

In a bounded 'until' experiment, the last cell POOLS two branches

A coin tossed until a head OR 4 tails: P(X=4)=P(TTTH)+P(TTTT)=116+116=18P(X=4)=P(\text{TTTH})+P(\text{TTTT})=\tfrac1{16}+\tfrac1{16}=\tfrac18. The forced stop at 4 means the experiment ends whether the 4th toss is H or T, so both outcomes count toward X=4X=4.

Concept 8 of 9

Cumulative Distribution Function and pmf ↔ CDF Differencing

Intuition

The CDF F(x)=P(Xx)F(x)=P(X\le x) accumulates the pmf from the left. Because it is a running total, you recover any single mass by DIFFERENCING consecutive CDF values: P(X=xi)=F(xi)F(xi1)P(X=x_i)=F(x_i)-F(x_{i-1}). This is how you answer 'find P(X=a)' or a probability ratio when only the CDF is given.

Definition

For a discrete random variable with values x1<x2<x_1<x_2<\cdots:

  • Definition: F(x)=P(Xx)=xixP(X=xi)F(x)=P(X\le x)=\displaystyle\sum_{x_i \le x}P(X=x_i); it is non-decreasing and reaches 1 at the top value.
  • Recover the pmf (differencing): P(X=xi)=F(xi)F(xi1)P(X=x_i)=F(x_i)-F(x_{i-1}), with P(X=x1)=F(x1)P(X=x_1)=F(x_1).
  • Tail from the CDF: P(X>a)=1F(a)P(X>a)=1-F(a); P(Xa)=F(a)P(X\le a)=F(a).

CDF definition and differencing

F(x)=P(Xx)=xixP(X=xi),P(X=xi)=F(xi)F(xi1)F(x)=P(X\le x)=\sum_{x_i\le x}P(X=x_i), \qquad P(X=x_i)=F(x_i)-F(x_{i-1})

Worked example

A discrete CDF has F(1)=0.2, F(2)=0.5, F(3)=1F(1)=0.2,\ F(2)=0.5,\ F(3)=1 at values x=1,2,3x=1,2,3. Find P(X=2)P(X=2).
  1. P(X=2)=F(2)F(1)P(X=2)=F(2)-F(1).
  2. =0.50.2=0.5-0.2.
  3. =0.3=0.3.
Answer:P(X=2)=0.3P(X=2)=0.3
Practice this conceptself-check · 4 quick reps

Try it yourself

For a discrete CDF with F(3)=0.1, F(1)=0.3, F(0)=0.5F(-3)=0.1,\ F(-1)=0.3,\ F(0)=0.5 at values 3,1,0,-3,-1,0,\ldots, find P(X=3)P(X<0)\dfrac{P(X=-3)}{P(X<0)}.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    State the CDF differencing rule for a discrete pmf.
  2. 2.
    F(0)=0.5F(0)=0.5 in a CDF. Find P(X>0)P(X>0).
  3. 3.
    F(3)=0.1, F(1)=0.3F(-3)=0.1,\ F(-1)=0.3. Find P(X=1)P(X=-1).
  4. 4.
    If P(X0)=0.5P(X\le 0)=0.5 and P(X>0)=0.5P(X>0)=0.5, find their ratio.

From the bank · past-year question

Example 8Probability DistributionEASY
The c.d.f. of a discrete random variable XX is ------------------- X
F(X=x)F(X = x)
---- -------------- -3 0.1 -1 0.3 0 0.5 1 0.65 3 0.75 5 0.85 7 0.90 9 1 ------------------- Then P[X=3]P[X<0]=\frac{P\lbrack X = - 3\rbrack}{P\lbrack X < 0\rbrack}=

[Q120 · 21 April Shift II · 2025]

Read P(X=xi)P(X=x_i) as a CDF DIFFERENCE, not the CDF value

Except for the smallest value, P(X=xi)=F(xi)F(xi1)P(X=x_i)=F(x_i)-F(x_{i-1}), not F(xi)F(x_i). Only P(X=x1)=F(x1)P(X=x_1)=F(x_1) (nothing accumulated before it). Reading P(X=0)=F(0)=0.5P(X=0)=F(0)=0.5 directly (when earlier values exist) double-counts.

P(X0)=F(0)P(X\le 0)=F(0); P(X>0)=1F(0)P(X>0)=1-F(0)

From a CDF, P(X0)P(X\le 0) is exactly F(0)F(0), and P(X>0)=1F(0)P(X>0)=1-F(0). If F(0)=0.5F(0)=0.5 the ratio P(X0)P(X>0)=0.50.5=1\dfrac{P(X\le0)}{P(X>0)}=\dfrac{0.5}{0.5}=1. Watch \le vs <<: P(X<0)P(X<0) excludes the mass exactly at 0.

Concept 9 of 9

Continuous Random Variables — pdf, Normalisation, CDF and P(a < X < b)

Intuition

For a continuous variable, probability lives in a density f(x)f(x), not at single points. Every discrete 'sum' becomes an INTEGRAL: normalise with fdx=1\int f\,dx=1 to find a constant, get the CDF by F(x)=xfdtF(x)=\int_{-\infty}^{x} f\,dt, and read an interval probability as abfdx\int_a^b f\,dx.

Definition

The continuous analogues of the pmf rules:

  • Normalisation (find the constant): f(x)dx=1\displaystyle\int_{-\infty}^{\infty} f(x)\,dx = 1, integrated over the support only.
  • CDF: F(x)=xf(t)dtF(x)=\displaystyle\int_{-\infty}^{x} f(t)\,dt; FF rises from 0 to 1, and F(x)=f(x)F'(x)=f(x).
  • Interval probability: P(a<X<b)=abf(x)dx=F(b)F(a)P(a<X<b)=\displaystyle\int_a^b f(x)\,dx = F(b)-F(a). For continuous XX, << and \le give the same value.
  • Two-condition pdf: if the pdf has TWO unknowns, use f=1\int f=1 AND a given point value (like f(2)=2f(2)=2) to solve the pair.

Continuous normalisation, CDF, interval

f(x)dx=1,F(x)=xf(t)dt,P(a<X<b)=abf(x)dx\int_{-\infty}^{\infty} f(x)\,dx = 1,\quad F(x)=\int_{-\infty}^{x} f(t)\,dt,\quad P(a<X<b)=\int_a^b f(x)\,dx
  • f(x)probability density function
  • F(x)cumulative distribution function, F=fF'=f

Worked example

A pdf is f(x)=kxf(x)=kx for 0<x<20<x<2 (and 0 otherwise). Find kk and P(X<1)P(X<1).
  1. Normalise: 02kxdx=k[x22]02=2k=1k=12\displaystyle\int_0^2 kx\,dx = k\left[\tfrac{x^2}{2}\right]_0^2 = 2k = 1 \Rightarrow k=\tfrac12.
  2. So f(x)=x2f(x)=\tfrac{x}{2}.
  3. P(X<1)=01x2dx=12[x22]01=1212=14P(X<1)=\displaystyle\int_0^1 \tfrac{x}{2}\,dx = \tfrac12\left[\tfrac{x^2}{2}\right]_0^1 = \tfrac12\cdot\tfrac12 = \tfrac14.
Answer:k=12,P(X<1)=14k=\tfrac12,\quad P(X<1)=\tfrac14
Practice this conceptself-check · 4 quick reps

Try it yourself

The pdf is f(x)=3x28f(x)=\tfrac{3x^2}{8} for 0<x<20<x<2 (0 otherwise). Find F(1)F(1) and P(X>1)P(X>1).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    State the normalisation condition for a continuous pdf.
  2. 2.
    For f(x)=kx2f(x)=kx^2 on 0x60\le x\le 6, find kk.
  3. 3.
    For a continuous XX, is P(a<X<b)=P(aXb)P(a<X<b)=P(a\le X\le b)?
  4. 4.
    If f(x)=x8f(x)=\tfrac{x}{8} on (0,4)(0,4), give F(x)F(x) on the support.

From the bank · past-year question

Example 9Probability DistributionMODERATE
The following is p.d.f. of continuous random variable X f(x)={x8, if 0<x<40, otherwise f(x) =\left\{ \begin{matrix} \frac{x}{8} & ,\text{~if~}0 <x< 4 \\ 0 & ,\text{~otherwise~} \end{matrix} \right..Then F(0.5),F(1.7)F(0.5),F(1.7) and F(5)F(5) is respectively

[Q134 · 26 April Shift I · 2025]

Integrate over the SUPPORT only

For a continuous pdf f(x)=kx2f(x)=kx^2 on 0x60\le x\le 6, normalise 06kx2dx=[kx33]06=72k=1\int_0^6 kx^2\,dx = \left[\tfrac{kx^3}{3}\right]_0^6 = 72k = 1, giving k=172k=\tfrac1{72} — integrate only where f0f\ne 0 (f=0f=0 outside [0,6][0,6]). Applying \int_{-\infty}^{\infty} blindly, or a discrete SUM x=06kx2\sum_{x=0}^{6} kx^2 (which would give 91k=191k=1), is the standard continuous-vs-discrete mix-up.

A two-unknown pdf needs TWO equations

If f(x)=ax22+bxf(x)=\tfrac{ax^2}{2}+bx on [1,3][1,3] has unknowns a,ba,b, one equation is 13fdx=1\int_1^3 f\,dx=1; the second is a given value like f(2)=22a+2b=2f(2)=2\Rightarrow 2a+2b=2. Solve the pair — normalisation alone is not enough.

The CDF is the running integral, and F(x)=f(x)F'(x)=f(x)

F(x)=xfF(x)=\int_{-\infty}^{x} f. For f(x)=12x2(1x)f(x)=12x^2(1-x) on (0,1)(0,1), F(x)=0x12t2(1t)dt=4x33x4F(x)=\int_0^x 12t^2(1-t)\,dt = 4x^3-3x^4. Differentiating back must return ff — a quick check that catches sign or coefficient errors.

P(X<2)P(|X|<2) is a symmetric integral 22\int_{-2}^{2}

For f(x)=x218f(x)=\tfrac{x^2}{18} on (3,3)(-3,3), P(X<2)=22x218dx=118163=827P(|X|<2)=\int_{-2}^{2}\tfrac{x^2}{18}\,dx=\tfrac1{18}\cdot\tfrac{16}{3}=\tfrac{8}{27}. Integrate from 2-2 to 22, not 00 to 22X<2|X|<2 means 2<X<2-2<X<2.

Summary — formulas & gotchas at a glance

A revision cheat-sheet for the formulas and gotchas above. Click any concept name to jump back to its full explanation.

Formulas (9)

Watch out for (22)

Mastery check — 5 interleaved questions

Try each one before clicking. Questions are interleaved across the concepts above, not grouped — interleaving sharpens transfer.

Example 1Probability DistributionMODERATE
Let X denote the number of hours you study on a Sunday. It is known that P(X=x)={0.1, if x=0kx, if x=1 or 2k(5x), if x=3 or 40, otherwise P(X =x) =\left\{ \begin{matrix} 0.1 & ,\text{~if~}x= 0 \\ kx & ,\text{~if~}x= 1\text{~or~}2 \\ k(5 -x) & ,\text{~if~}x= 3\text{~or~}4 \\ 0 & ,\text{~otherwise~} \end{matrix} \right.where k is constant. Then the probability that you study at least two hours on a Sunday is

[Q116 · 25 April Shift I · 2025]

Example 2Probability DistributionMODERATE
If a random variable X has the following probability distribution values: X: 0,1,2,3,4,5,6,7 and P(X): 0,k,2k,2k,3k,k2,2k2,7k2+k0,k,2k,2k,3k,k^2,2k^2,7k^2+k. Then P(X6)P(X \geq 6) has the value

[Q106 · 16th May Shift 1 · 2023]

Example 3Probability DistributionHARD
If a discrete random variable X is defined as follows: P[X=x]=k(x+1)5xP[X=x] = k\frac{(x+1)}{5^x}, if x=0,1,2,x = 0,1,2,\ldots, 0 otherwise, then k=k =

[Q121 · 16th May Shift 2 · 2023]

Example 4Probability DistributionEASY
Two cards are drawn successively with replacement from a well shuffled pack of 52 cards. Then the probability distribution of number of jacks is

[Q128 · 12th May Shift 1 · 2024]

Example 5Probability DistributionMODERATE
The cumulative distribution function of a discrete random variable X is then P(X0)P(X>0)=\frac{P(X \leqslant 0)}{P(X > 0)}= --------------------------
X=xX = x
F(X=x)F(X = x)
----------- -------------- -4 0.1 -2 0.3 0 0.5 2 0.65 4 0.75 6 0.85 8 0.90 10 1 --------------------------

[Q131 · 26 April Shift I · 2025]

Drill every past-year question on this subtopic

31 questions from the bank — paginated, with cart and Word-export support.

Related notes