MHT-CET Maths · Probability Distribution

Expectation, Variance and Standard Deviation

Once you can read a probability distribution, three number-summaries follow: the mean E(X) = Σx·P(x) (the long-run average), the variance Var(X) = E(X²) − [E(X)]² (the spread), and the standard deviation SD = √Var — the single most-tested cluster of formulas in this chapter.

Why this matters

This subtopic carries 37 PYQs (7 HARD, 24 MODERATE, 6 EASY) and every year returns three near-identical shapes: compute mean/variance/SD from a pmf, find the expected winnings of a coin or die game, and use the uniform-distribution shortcuts E(X) = (n+1)/2 and Var(X) = (n²−1)/12. The traps are mechanical and repeat: squaring the mean instead of averaging the squares, forgetting to convert a CDF to a pmf first, taking SD as the variance (or vice versa), and mishandling the sign of a loss in a game. Nail the four core formulas and this section is free marks.

Concept 1 of 7

Expectation as the Long-Run Average

Intuition

The expected value (mean) of a discrete random variable is a probability-weighted average of the values it can take — each value pulled toward the mean in proportion to how likely it is. If you played the same random experiment a huge number of times and averaged the outcomes, that average would settle at E(X). It need not be an attainable value of X (a die's mean is 3.5).

Definition

For a discrete random variable X with probability mass function P(X=xi)=piP(X=x_i)=p_i:

  • Expected value / mean: E(X)=μ=ixipiE(X) = \mu = \sum_i x_i\,p_i — multiply each value by its probability and add.
  • The probabilities must satisfy ipi=1\sum_i p_i = 1 and pi0p_i \ge 0; this is always the first thing to check (and how you find an unknown kk).
  • Expectation is linear: E(aX+b)=aE(X)+bE(aX+b) = aE(X)+b, and for independent parts E(X+Y)=E(X)+E(Y)E(X+Y)=E(X)+E(Y) (so the mean of the sum on two dice is 3.5+3.5=73.5+3.5=7).

Expected value of a discrete random variable

E(X)=μ=ixiP(X=xi)E(X) = \mu = \sum_{i} x_i\,P(X=x_i)
  • x_ithe values X can take
  • P(X=x_i)the probability of each value (the pmf)
  • \muthe mean / expected value — a weighted average, not always an attainable value

Worked example

A random variable X takes values 0, 1, 2 with probabilities 0.5, 0.3, 0.2. Find E(X).
  1. Check the probabilities sum to 1: 0.5+0.3+0.2=10.5+0.3+0.2 = 1. Good.
  2. Weight each value by its probability: E(X)=0(0.5)+1(0.3)+2(0.2)E(X) = 0(0.5) + 1(0.3) + 2(0.2).
  3. Add: E(X)=0+0.3+0.4=0.7E(X) = 0 + 0.3 + 0.4 = 0.7.
Answer:E(X)=0.7E(X) = 0.7
Practice this conceptself-check · 4 quick reps

Try it yourself

A random variable X takes values 1, 2, 3 with probabilities 12,13,16\tfrac12, \tfrac13, \tfrac16. Find E(X).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Write the definition of the mean of a discrete random variable X.
  2. 2.
    What must the probabilities of any pmf add up to?
  3. 3.
    Compute E(X) for X = 2, 4 with probabilities 14,34\tfrac14, \tfrac34.
  4. 4.
    Is E(X) always one of the values X can take?

The mean is a weighted average, not a plain average of the values

E(X)=xipiE(X) = \sum x_i p_i — each value is weighted by its OWN probability. Averaging the values while ignoring the probabilities (e.g. reporting 1+2+3+44\tfrac{1+2+3+4}{4} for a non-uniform pmf) is the most common beginner slip.

Always verify pi=1\sum p_i = 1 before computing anything

If a table contains an unknown like kk or 2k2k, the normalization pi=1\sum p_i = 1 is the equation that determines it. Compute the mean only after fixing every probability, or the whole answer is off.

Concept 2 of 7

Computing the Mean E(X) from a Probability Distribution

Intuition

The workhorse EASY/MODERATE task: a table of values and probabilities is given (sometimes with an unknown k fixed by Σp = 1 first), and you read off E(X) = Σx·P(x) directly. For a sum on two dice, linearity E(X+Y) = E(X)+E(Y) is faster than building the full 2..12 distribution.

Definition

To find the mean from a pmf:

  • If a probability is unknown, first use P(x)=1\sum P(x) = 1 to solve for it.
  • Then apply E(X)=xP(x)E(X) = \sum x\,P(x) row by row.
  • Linearity shortcut: for the sum of two independent variables, E(X+Y)=E(X)+E(Y)E(X+Y) = E(X)+E(Y). Each fair die has mean 1+2++66=3.5\tfrac{1+2+\cdots+6}{6} = 3.5, so the expected sum of two dice is 3.5+3.5=73.5+3.5 = 7 — no need to list all 36 outcomes.

Mean of a listed distribution

E(X)=ixiP(X=xi),E(X+Y)=E(X)+E(Y)E(X) = \sum_{i} x_i\,P(X=x_i),\qquad E(X+Y) = E(X)+E(Y)

Worked example

A random variable X has distribution X = 1, 2, 3 with P(x) = 0.2, 0.5, 0.3. Find the mean E(X).
  1. Check: 0.2+0.5+0.3=10.2+0.5+0.3 = 1. Good.
  2. E(X)=1(0.2)+2(0.5)+3(0.3)E(X) = 1(0.2) + 2(0.5) + 3(0.3).
  3. =0.2+1.0+0.9=2.1= 0.2 + 1.0 + 0.9 = 2.1.
Answer:E(X)=2.1E(X) = 2.1
Practice this conceptself-check · 4 quick reps

Try it yourself

For the distribution X = 0, 1, 2, 3, 4, 5 with P(x) = k, 0.3, 0.15, 0.15, 0.1, 2k, find E(X).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Find the expected sum of the numbers on two fair dice.
  2. 2.
    X = 1, 2, 3 with P = 0.1, 0.6, 0.3. Find E(X).
  3. 3.
    The mean of one fair die is?
  4. 4.
    If P = k, 2k, 3k for X = 1, 2, 3, find k.

From the bank · past-year question

Example 2Probability DistributionEASY
The expected value of the sum of the two numbers obtained on the uppermost faces, when two fair dice are rolled, is

[Q141 · 4th May Shift 2 · 2023]

Use linearity for the sum on dice, don't build all 36 outcomes

The expected sum on two dice is E(X1)+E(X2)=3.5+3.5=7E(X_1)+E(X_2) = 3.5+3.5 = 7. You never need the full 2..12 distribution — but if you build it, xP(x)=25236=7\sum x P(x) = \tfrac{252}{36} = 7 confirms the same answer.

Solve for the unknown probability before taking the mean

A row like P(X=5)=2kP(X=5) = 2k is meaningless until P=1\sum P = 1 fixes kk. Find kk first, substitute back, THEN compute E(X)=xP(x)E(X) = \sum xP(x).

Concept 3 of 7

Variance and Standard Deviation: Var(X) = E(X²) − [E(X)]²

Intuition

Variance measures spread — the average squared distance of X from its mean. The exam-friendly form never uses the deviations directly: compute E(X²) = Σx²·P(x) with the SAME probabilities, subtract the square of the mean, and you have the variance. Standard deviation is just its square root (and carries the original units). Some questions hand you a CDF F(x) instead of the pmf — difference it into P(x) first.

Definition

The three core formulas of this subtopic:

  • E(X²): E(X2)=x2P(x)E(X^2) = \sum x^2\,P(x) — square each value, weight by its probability.
  • Variance: Var(X)=E(X2)[E(X)]2\mathrm{Var}(X) = E(X^2) - [E(X)]^2 (equivalently (xμ)2P(x)\sum(x-\mu)^2 P(x), but the E(X²) form is faster).
  • Standard deviation: σ=Var(X)\sigma = \sqrt{\mathrm{Var}(X)}.
  • From a CDF: if F(x) is given, recover the pmf by P(xi)=F(xi)F(xi1)P(x_i) = F(x_i) - F(x_{i-1}) (with P(x1)=F(x1)P(x_1)=F(x_1)) before computing E(X) or E(X²).

Variance and standard deviation

Var(X)=E(X2)[E(X)]2,E(X2)=x2P(x),σ=Var(X)\mathrm{Var}(X) = E(X^2) - [E(X)]^2,\qquad E(X^2) = \sum x^2 P(x),\qquad \sigma = \sqrt{\mathrm{Var}(X)}
  • E(X^2)average of the SQUARES: x2P(x)\sum x^2 P(x)
  • [E(X)]^2the SQUARE of the mean — a different, smaller number in general
  • \sigmastandard deviation = Var\sqrt{\text{Var}}, in the same units as X

Worked example

A random variable X takes values 1, 2, 3 with probabilities 0.2, 0.3, 0.5. Find its variance and standard deviation.
  1. Mean: E(X)=1(0.2)+2(0.3)+3(0.5)=0.2+0.6+1.5=2.3E(X) = 1(0.2)+2(0.3)+3(0.5) = 0.2+0.6+1.5 = 2.3.
  2. E(X2)=1(0.2)+4(0.3)+9(0.5)=0.2+1.2+4.5=5.9E(X^2) = 1(0.2)+4(0.3)+9(0.5) = 0.2+1.2+4.5 = 5.9.
  3. Variance: Var(X)=5.9(2.3)2=5.95.29=0.61\mathrm{Var}(X) = 5.9 - (2.3)^2 = 5.9 - 5.29 = 0.61.
  4. Standard deviation: σ=0.610.781\sigma = \sqrt{0.61} \approx 0.781.
Answer:Var(X)=0.61,σ=0.610.78\mathrm{Var}(X) = 0.61,\quad \sigma = \sqrt{0.61}\approx 0.78
Practice this conceptself-check · 4 quick reps

Try it yourself

The cumulative distribution function of X is F(-1) = 0.3, F(0) = 0.7, F(1) = 0.8, F(2) = 1. Find E(X²).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    State the variance formula in terms of E(X²) and E(X).
  2. 2.
    If E(X) = 3 and E(X²) = 9.6, find the variance.
  3. 3.
    How do you get the pmf from a CDF F(x)?
  4. 4.
    If Var(X) = 4, what is the standard deviation?

From the bank · past-year question

Example 3Probability DistributionEASY
A random variable X has the following probability distribution : --------------------------------------
X=xX = x
1 2 3 4 -------------- ----- ----- ----- -----
P(X=x)P(X = x)
0.1 0.2 0.3 0.4 -------------------------------------- The mean and standard deviation of XX are respectively

[Q133 · 26 April Shift II · 2025]

E(X²) is NOT [E(X)]²

E(X2)=x2P(x)E(X^2) = \sum x^2 P(x) averages the squares; [E(X)]2[E(X)]^2 squares the average. They are equal only when the variance is zero. The whole variance formula is the gap between them: Var(X)=E(X2)[E(X)]2\mathrm{Var}(X) = E(X^2) - [E(X)]^2.

Convert a CDF to a pmf before computing an expectation

If the question gives F(x)F(x) (cumulative), you must first difference it: P(xi)=F(xi)F(xi1)P(x_i) = F(x_i) - F(x_{i-1}). Plugging the CDF values straight into x2F(x)\sum x^2 F(x) is a classic wrong answer.

Standard deviation vs variance — don't hand back the wrong one

SD =Var= \sqrt{\mathrm{Var}}. If the variance works out to 1, the SD is also 1; if the variance is 2, the SD is 2\sqrt2, not 2. Options are deliberately built to punish reporting the variance when the SD is asked (and vice versa).

Variance is never negative

Var(X)=E(X2)[E(X)]20\mathrm{Var}(X) = E(X^2) - [E(X)]^2 \ge 0 always. If you get a negative variance, you have squared the mean wrong or mixed up E(X²) and [E(X)]² — recheck before choosing an option.

Concept 4 of 7

Expected Winnings of a Game: E(g(X)) = Σ g(x)·P(x)

Intuition

The recurring 'game' genre: a payoff (win some money on one outcome, lose money on another) is a function g(X) of the random outcome, and its expected value is the probability-weighted sum of the payoffs. A LOSS enters as a negative amount. If the expected value is 0 the game is fair; positive means a long-run gain, negative a long-run loss.

Definition

For a payoff g(X):

  • Expected payoff: E(g(X))=g(x)P(x)E(g(X)) = \sum g(x)\,P(x) — weight each cash outcome (a gain positive, a loss negative) by its probability.
  • Typical 3-coin game: P(all heads or all tails)=28=14P(\text{all heads or all tails}) = \tfrac{2}{8} = \tfrac14 and P(one or two heads)=68=34P(\text{one or two heads}) = \tfrac{6}{8} = \tfrac34.
  • Variance of a winning amount uses the same E(X²) − [E(X)]² machinery on the payoff values: list the winnings with their probabilities, then apply the variance formula.
  • E=0E = 0 ⟹ fair game; E>0E > 0 ⟹ expected gain; E<0E < 0 ⟹ expected loss.

Expected value of a payoff (function of X)

E(g(X))=g(x)P(x)E\big(g(X)\big) = \sum g(x)\,P(x)
  • g(x)the cash payoff for outcome x — positive for a gain, NEGATIVE for a loss
  • P(x)the probability of that outcome

Worked example

Two coins are tossed. A player wins ₹6 if two heads appear, and loses ₹3 otherwise. Find the expected gain per game.
  1. Outcomes: two heads with P=14P = \tfrac14 (win +6); otherwise with P=34P = \tfrac34 (loss −3).
  2. E=(+6)14+(3)34E = (+6)\cdot\tfrac14 + (-3)\cdot\tfrac34.
  3. =6494=34=0.75= \tfrac{6}{4} - \tfrac{9}{4} = -\tfrac{3}{4} = -0.75.
Answer:Expected loss of ₹0.75 per game (E=0.75E = -0.75).
Practice this conceptself-check · 4 quick reps

Try it yourself

A player tosses two coins and wins ₹10 for 2 heads, ₹5 for one head, ₹2 for no head. Find the variance of the winning amount.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    In a 3-coin toss, what is P(all heads or all tails)?
  2. 2.
    A game pays +40 with probability 14\tfrac14 and −40 with probability 34\tfrac34. Expected gain?
  3. 3.
    What does E = 0 mean for a game?
  4. 4.
    How does a LOSS enter the expected-value sum?

From the bank · past-year question

Example 4Probability DistributionMODERATE
In a game, 3 coins are tossed. A person is paid ₹150 if he gets all heads or all tails and he is supposed to pay ₹50 if he gets one head or two heads. The amount he can expect to win / lose on an average per game in ₹ is

[Q123 · 19 April Shift II · 2025]

A loss is a negative payoff — carry the minus sign

In a game that pays ₹150 on all-heads/all-tails and requires paying ₹50 otherwise, the second term is 50-50, not +50+50: E=14(150)+34(50)=37.537.5=0E = \tfrac14(150) + \tfrac34(-50) = 37.5 - 37.5 = 0. Dropping the minus turns a fair game into a phantom win.

Get the all-heads/all-tails probability right

With 3 coins there are 8 equally-likely outcomes. All heads OR all tails is 2 of them, so P=28=14P = \tfrac{2}{8} = \tfrac14; exactly one or two heads is the remaining 6, so P=68=34P = \tfrac{6}{8} = \tfrac34. Using 18\tfrac18 (only all heads) mis-weights the whole expectation.

Variance of a winning amount is still E(X²) − [E(X)]²

Treat the cash winnings as the values of X, list them with their probabilities, and apply the ordinary variance formula. Don't confuse the variance of the payoff with the expected payoff itself.

Concept 5 of 7

Uniform Distribution on 1 to n: E(X) = (n+1)/2, Var(X) = (n²−1)/12

Intuition

When X takes the equally-likely values 1, 2, …, n each with probability 1/n, two clean formulas drop out and are worth memorising: the mean is the midpoint (n+1)/2 and the variance is (n²−1)/12. These turn 'Var:E = 4:1, find n' and 'Var = E, find n' into one-line algebra. The same pmf sometimes arrives as P(x) = 2x / [n(n+1)], which is NOT uniform — derive its mean from Σx²·(weight).

Definition

For the discrete uniform distribution on {1,2,,n}\{1,2,\ldots,n\}, P(x)=1nP(x)=\tfrac1n:

  • Mean: E(X)=n+12E(X) = \dfrac{n+1}{2} (the middle value).
  • Variance: Var(X)=n2112\mathrm{Var}(X) = \dfrac{n^2-1}{12}.
  • Handy ratio: Var(X)E(X)=(n21)/12(n+1)/2=n16\dfrac{\mathrm{Var}(X)}{E(X)} = \dfrac{(n^2-1)/12}{(n+1)/2} = \dfrac{n-1}{6} — the fastest route to 'find n' questions.
  • A weighted pmf P(x)=2xn(n+1)P(x) = \dfrac{2x}{n(n+1)} on 1,,n1,\ldots,n is a different distribution: E(X)=x2xn(n+1)=2n(n+1)n(n+1)(2n+1)6=2n+13E(X) = \sum x\cdot\dfrac{2x}{n(n+1)} = \dfrac{2}{n(n+1)}\cdot\dfrac{n(n+1)(2n+1)}{6} = \dfrac{2n+1}{3}.

Discrete uniform on 1..n

E(X)=n+12,Var(X)=n2112,Var(X)E(X)=n16E(X) = \dfrac{n+1}{2},\qquad \mathrm{Var}(X) = \dfrac{n^2-1}{12},\qquad \dfrac{\mathrm{Var}(X)}{E(X)} = \dfrac{n-1}{6}
  • nthe number of equally-likely integer values 1, 2, …, n

Worked example

A random variable X takes the values 1, 2, …, n each with probability 1/n. If Var(X) = E(X), find n.
  1. Uniform formulas: E(X)=n+12E(X) = \dfrac{n+1}{2}, Var(X)=n2112\mathrm{Var}(X) = \dfrac{n^2-1}{12}.
  2. Set them equal: n2112=n+12\dfrac{n^2-1}{12} = \dfrac{n+1}{2}.
  3. Factor n21=(n1)(n+1)n^2-1 = (n-1)(n+1) and cancel the common (n+1)(n+1): n112=12\dfrac{n-1}{12} = \dfrac{1}{2}.
  4. So n1=6n - 1 = 6, giving n=7n = 7.
Answer:n=7n = 7
Practice this conceptself-check · 4 quick reps

Try it yourself

A random variable X takes the values 1, 2, …, n with equal probability. If Var(X) : E(X) = 4 : 1, find n.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Mean of the uniform distribution on 1, 2, …, n?
  2. 2.
    Variance of the uniform distribution on 1, 2, …, n?
  3. 3.
    For the uniform on 1..n, evaluate Var(X)/E(X).
  4. 4.
    E(X) if P(x)=2xn(n+1)P(x) = \dfrac{2x}{n(n+1)} on 1,,n1,\ldots,n?

From the bank · past-year question

Example 5Probability DistributionMODERATE
A discrete random variable X can take all possible integer values from 1 to k, each with a probability 1k\frac{1}{k}, then its variance is

[Q122 · 13th May Shift 1 · 2024]

Memorise both uniform formulas — mean (n+1)/2 AND variance (n²−1)/12

The variance is n2112\dfrac{n^2-1}{12}, not n216\dfrac{n^2-1}{6} or n2+112\dfrac{n^2+1}{12} — those are the standard distractors. And E(X) is n+12\dfrac{n+1}{2}, the midpoint, not n2\dfrac{n}{2}.

Cancel the (n+1) factor for 'find n' questions

Equations like n2112=n+12\dfrac{n^2-1}{12} = \dfrac{n+1}{2} collapse instantly once you write n21=(n1)(n+1)n^2-1 = (n-1)(n+1) and cancel the shared (n+1)(n+1). This avoids solving a full quadratic.

P(x) = 2x/[n(n+1)] is NOT the uniform distribution

When probability grows with x, you cannot use E=(n+1)/2E = (n+1)/2. Compute E(X)=x2xn(n+1)E(X) = \sum x\cdot\dfrac{2x}{n(n+1)} using x2=n(n+1)(2n+1)6\sum x^2 = \tfrac{n(n+1)(2n+1)}{6}, which gives 2n+13\dfrac{2n+1}{3}.

Concept 6 of 7

Finding Unknown Probabilities from the Mean and ΣP = 1

Intuition

Two unknowns in a pmf need two equations, and the exam always provides them: the normalization ΣP = 1 and a given value of E(X) (or an extra relation like P(X=3) = 2P(X=1)). Set up the two linear equations and solve — the classic 'find A and B' or 'find P(X=0)' shape. When p varies over a range, non-negativity of every probability bounds p, and the mean's extreme values sit at those bounds.

Definition

The two-equation setup:

  • Equation 1 (normalization): P(x)=1\sum P(x) = 1.
  • Equation 2 (mean or an extra relation): E(X)=xP(x)=givenE(X) = \sum x\,P(x) = \text{given}, or a stated link such as P(X=3)=2P(X=1)P(X=3)=2P(X=1).
  • Solve the linear system for the unknown probabilities.
  • Range problems: when the probabilities depend on a parameter p, impose 0P(x)10 \le P(x) \le 1 on EVERY row to get an interval for p; the mean (a linear function of p) attains its extreme values at the endpoints of that interval.

The determining system

P(x)=1andE(X)=xP(x)=μgiven\sum P(x) = 1 \quad\text{and}\quad E(X) = \sum x\,P(x) = \mu_{\text{given}}

Worked example

A random variable X takes values 0, 10, 20 with probabilities 0.2, A, B. If E(X) = 11, find A and B.
  1. Normalization: 0.2+A+B=1A+B=0.80.2 + A + B = 1 \Rightarrow A + B = 0.8.
  2. Mean: 0(0.2)+10A+20B=1110A+20B=110(0.2) + 10A + 20B = 11 \Rightarrow 10A + 20B = 11.
  3. From the first equation A=0.8BA = 0.8 - B; substitute: 10(0.8B)+20B=118+10B=11B=0.310(0.8 - B) + 20B = 11 \Rightarrow 8 + 10B = 11 \Rightarrow B = 0.3.
  4. Then A=0.80.3=0.5A = 0.8 - 0.3 = 0.5.
Answer:A=0.5, B=0.3A = 0.5,\ B = 0.3
Practice this conceptself-check · 4 quick reps

Try it yourself

A random variable X takes the values 0, 1, 2, 3 and has mean 1.3. Given P(X=3) = 2P(X=1) and P(X=2) = 0.3, find P(X=0).

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    How many equations are needed to fix two unknown probabilities?
  2. 2.
    If 15+A+B=1\tfrac15 + A + B = 1, what is A + B?
  3. 3.
    What bounds a parameter p appearing inside probabilities?
  4. 4.
    A mean that is linear in p attains its extremes where?

From the bank · past-year question

Example 6Probability DistributionMODERATE
Let XX be a discrete random variable. The probability distribution of X is given below --------------------------------------------- X 30 10 -10 ---------- ----------------------- ---- -----
P(X)P(X)
15\cdot \frac{1}{5}
A B --------------------------------------------- and E(X)=4E(X) = 4, then the value of ABAB is equal to

[Q113 · 19 April Shift II · 2025]

Watch the sign in the E(X) equation

For X = 30, 10, −10 with P = 1/5, A, B and E(X) = 4: 6+10A10B=46 + 10A - 10B = 4, so AB=15A - B = -\tfrac15 (a MINUS). A sign slip here flips A and B and gives the wrong product AB.

Use the extra stated relation as your second equation

A clue like P(X=3)=2P(X=1)P(X=3) = 2P(X=1) is not decoration — it is one of the two equations you need. Combined with the mean and P=1\sum P = 1, it pins every probability down.

For range problems, apply non-negativity to EVERY row

When probabilities like 1+p5,22p5,2p5,2p5\tfrac{1+p}{5}, \tfrac{2-2p}{5}, \tfrac{2-p}{5}, \tfrac{2p}{5} contain p, each must lie in [0,1][0,1]. The tightest of those inequalities gives p's actual range; the mean's min/max occur at the ends of that range, not by guessing.

Concept 7 of 7

Expectation of Standard Distributions: Geometric and Hypergeometric

Intuition

Some questions dress a named distribution in a story. 'Roll until success' is GEOMETRIC — its mean number of trials is 1/p. 'Draw without replacement and count the successes' is HYPERGEOMETRIC — its mean is nK/N. And a 'card-drawing, then 2E(X)+3E(X²)' problem just needs the small pmf built from combinations, then the two moments. Recognise the shape and the mean often takes one line.

Definition

Standard-distribution means that appear here:

  • Geometric (trials until first success), success probability p: mean =1p= \dfrac{1}{p}. Rolling an n-faced die until a number <n< n shows has p=n1np = \dfrac{n-1}{n}, so mean =nn1= \dfrac{n}{n-1}.
  • Hypergeometric (n drawn without replacement from N containing K successes): E(X)=nKNE(X) = \dfrac{nK}{N}.
  • Small-pmf via combinations: for 'X = number of queens in 2 cards' build P(X=k)=(4k)(482k)(522)P(X=k) = \dfrac{\binom{4}{k}\binom{48}{2-k}}{\binom{52}{2}}, then E(X)=kPE(X)=\sum kP, E(X2)=k2PE(X^2)=\sum k^2 P.
  • Larger-of-two: X = larger of two numbers drawn from {1,,m}\{1,\ldots,m\} has P(X=k)=k1(m2)P(X=k) = \dfrac{k-1}{\binom{m}{2}}; then E(X)=kP(X=k)E(X)=\sum k P(X=k).

Means of named distributions

Geometric: E(X)=1p,Hypergeometric: E(X)=nKN\text{Geometric: } E(X) = \dfrac{1}{p},\qquad \text{Hypergeometric: } E(X) = \dfrac{nK}{N}
  • psuccess probability of one trial (geometric)
  • Ntotal items in the lot (hypergeometric)
  • Knumber of successes in the lot
  • nnumber of items drawn without replacement

Worked example

From a lot of 20 items containing 5 defectives, 2 items are drawn at random without replacement. Find the expected number of defectives.
  1. This is hypergeometric with N=20N = 20, K=5K = 5, n=2n = 2.
  2. Mean: E(X)=nKN=2×520E(X) = \dfrac{nK}{N} = \dfrac{2 \times 5}{20}.
  3. =1020=0.5= \dfrac{10}{20} = 0.5.
Answer:E(X)=0.5E(X) = 0.5 defectives
Practice this conceptself-check · 4 quick reps

Try it yourself

A fair n-faced die is rolled repeatedly until a number less than n appears. If the mean number of tosses required is n/9, find n.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Mean number of trials until first success (probability p)?
  2. 2.
    Expected successes when drawing n from N with K successes (no replacement)?
  3. 3.
    Roll an n-faced die until a face < n appears: mean tosses?
  4. 4.
    From 20 baskets with 6 defective, draw 2 without replacement: E(defectives)?

From the bank · past-year question

Example 7Probability DistributionMODERATE
From a lot of 20 baskets, which includes 6 defective baskets, a sample of 2 baskets is drawn at random one by one without replacement. The expected value of number of defective baskets is

[Q104 · 11th May Shift 1 · 2024]

'Until success' means geometric, mean = 1/p

The expected number of trials to the first success is 1p\dfrac1p. Rolling an n-faced die until a number <n< n appears has p=n1np = \dfrac{n-1}{n}, so the mean is nn1\dfrac{n}{n-1} — don't confuse this with the value of a single roll.

Hypergeometric mean is nK/N — no replacement needed for the mean

Even though sampling is without replacement, the expected count of successes is simply nKN\dfrac{nK}{N} (the same as the with-replacement binomial mean). You do NOT need to build the full pmf just to get the mean.

For E(X²) build the small combination pmf first

A question asking for 2E(X)+3E(X2)2E(X)+3E(X^2) (e.g. queens in 2 cards) needs both moments: build P(X=0),P(X=1),P(X=2)P(X=0),P(X=1),P(X=2) from ()\binom{}{}-ratios, then compute E(X)=kPE(X)=\sum kP and E(X2)=k2PE(X^2)=\sum k^2 P with the SAME probabilities.

Summary — formulas & gotchas at a glance

A revision cheat-sheet for the formulas and gotchas above. Click any concept name to jump back to its full explanation.

Formulas (7)

Watch out for (20)

Mastery check — 5 interleaved questions

Try each one before clicking. Questions are interleaved across the concepts above, not grouped — interleaving sharpens transfer.

Example 1Probability DistributionMODERATE
For the probability distribution: x:0,1,2,3,4,5x: 0,1,2,3,4,5; p(x):k,0.3,0.15,0.15,0.1,2kp(x): k, 0.3, 0.15, 0.15, 0.1, 2k. The Expected value of X is

[Q102 · 15th May Shift 1 · 2023]

Example 2Probability DistributionMODERATE
The probability distribution of a random variable X is given by: X = 0,1,2,3,4 with P(X) = 0.4, 0.3, 0.1, 0.1, 0.1. Then the variance of X is

[Q105 · 2nd May Shift 2 · 2023]

Example 3Probability DistributionMODERATE
Player tosses 2 fair coins: wins ₹5 (2H), ₹2 (1H), ₹1 (0H). Variance of winning amount is

[Q131 · 10th May Shift 1 · 2024]

Example 4Probability DistributionMODERATE
The p.m.f. of a random variable X is P(x)=2xn(n+1),  x=1,2,,nP(x) = \frac{2x}{n(n+1)},\; x=1,2,\ldots,n, then E(x)E(x) is

[Q128 · 14th May Shift 2 · 2024]

Example 5Probability DistributionHARD
A random variable X takes values 1,0,1,2-1,0,1,2 with probabilities 1+3p4,1p4,1+2p4,14p4\frac{1+3p}{4},\frac{1-p}{4},\frac{1+2p}{4},\frac{1-4p}{4} respectively, where pp varies over R\mathbb{R}. Then the minimum and maximum values of the mean of X are respectively.

[Q108 · 9th May Shift 1 · 2023]

Drill every past-year question on this subtopic

37 questions from the bank — paginated, with cart and Word-export support.

Related notes