NDA Maths · Statistics

Dispersion — Standard Deviation, Variance, Mean Deviation

How spread out the data is around its centre — mean deviation, variance and standard deviation each measure spread on a different scale.

Why this matters

44 PYQs across 2017–2026. 22 EASY + 18 MODERATE + 4 HARD — almost every paper has one. The favourite shapes are linear-transformation effects on SD, the computational identity that links mean-of-squares to mean-squared and variance, and coefficient-of-variation comparisons. Master the six concepts below and dispersion becomes formulaic, not intimidating.

Concept 1 of 6

Mean Deviation

Intuition

Average distance between each observation and a chosen centre — almost always the mean or the median. It is the simplest measure of spread, but its absolute-value formula makes algebra clunky, which is why SD is preferred in higher statistics.

Definition

For nn observations x1,,xnx_1,\ldots,x_n and a reference value AA, the mean deviation about AA is the average of the absolute deviations from AA. When AA is the median, the mean deviation is minimum among all possible AA.

Mean Deviation about A

MD(A)=1ni=1nxiA\text{MD}(A) = \dfrac{1}{n}\sum_{i=1}^{n}|x_i - A|
  • AAreference point (typically mean or median)
  • xiA|x_i - A|absolute deviation of xix_i from AA

Diagram · mean deviation = average distance from the centre

mean = 523415094
MD = (3 + 1 + 0 + 4) / 4 = 2

Take each value's distance from the centre (the red segments, signs dropped) and average them. Mean deviation can be taken about the mean or the median; about the median it is smallest.

Worked example

Find the mean deviation of 2,4,6,8,102, 4, 6, 8, 10 about the mean.
  1. Compute the mean: xˉ=2+4+6+8+105=6\bar{x} = \dfrac{2+4+6+8+10}{5} = 6.
  2. Compute absolute deviations from 6: 26,46,66,86,106=4,2,0,2,4|2-6|, |4-6|, |6-6|, |8-6|, |10-6| = 4, 2, 0, 2, 4.
  3. Sum the absolute deviations: 4+2+0+2+4=124+2+0+2+4 = 12.
  4. Divide by nn: MD=125=2.4\text{MD} = \dfrac{12}{5} = 2.4.
Answer:MD=2.4\text{MD} = 2.4
Practice this conceptself-check · 4 quick reps

Try it yourself

Find the mean deviation of 3,6,9,123, 6, 9, 12 about the mean.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Mean deviation of 2,4,62, 4, 6 about the mean (xˉ=4)(\bar{x}=4)?
  2. 2.
    Mean deviation of 1,2,31, 2, 3 about the mean?
  3. 3.
    Mean deviation of 10,2010, 20 about the mean?
  4. 4.
    Mean deviation of 5,5,55, 5, 5?

From the bank · past-year question

Example 1StatisticsEASY
What is the mean deviation of the first 10 natural numbers?

[Q110 · Sep · 2024]

Mean deviation about median is always \leq about the mean

Among all reference points AA, the median minimises xiA\sum|x_i - A|. If a PYQ asks for the minimum mean deviation, the answer uses the median, not the mean.

Concept 2 of 6

Variance

Intuition

Average of the squared distances from the mean. Squaring kills the absolute-value bracket and gives variance much better algebraic properties — but the unit becomes the data's unit squared (cm becomes cm², rupees becomes rupees²), which is why we usually report its square root, the standard deviation.

Definition

For nn observations with mean xˉ\bar{x}, the variance is the average of the squared deviations from xˉ\bar{x}. It can also be computed using the identity σ2=x2xˉ2\sigma^2 = \overline{x^2} - \bar{x}^2, where x2\overline{x^2} is the mean of the squares.

Variance — two equivalent forms

σ2=1ni=1n(xixˉ)2=xi2nxˉ2\sigma^2 = \dfrac{1}{n}\sum_{i=1}^{n}(x_i - \bar{x})^2 = \dfrac{\sum x_i^2}{n} - \bar{x}^2
  • σ2\sigma^2variance
  • xˉ\bar{x}arithmetic mean
  • xi2\sum x_i^2sum of squares of observations

Visualization · move the points, watch the squared deviations

05101520mean = 6.00
Variance σ² = 8.00SD σ = 2.83Σ(xᵢ − x̄)² = 40.00

Each red square has side |xᵢ − x̄|, so its AREA is the squared deviation. Variance is the AVERAGE area. Pull all points toward the mean — every square shrinks. Pull them apart — they grow.

Worked example

Find the variance of 2,4,6,8,102, 4, 6, 8, 10.
  1. Compute the mean: xˉ=6\bar{x} = 6.
  2. Compute squared deviations: (26)2,(46)2,(66)2,(86)2,(106)2=16,4,0,4,16(2-6)^2, (4-6)^2, (6-6)^2, (8-6)^2, (10-6)^2 = 16, 4, 0, 4, 16.
  3. Sum the squared deviations: 16+4+0+4+16=4016 + 4 + 0 + 4 + 16 = 40.
  4. Divide by nn: σ2=405=8\sigma^2 = \dfrac{40}{5} = 8.
Answer:σ2=8\sigma^2 = 8
Practice this conceptself-check · 4 quick reps

Try it yourself

For 5 observations, xi=30\sum x_i = 30 and xi2=220\sum x_i^2 = 220. Find the variance.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Variance of 1,2,31, 2, 3 (xˉ=2)(\bar{x}=2)?
  2. 2.
    Variance of 2,4,62, 4, 6 (xˉ=4)(\bar{x}=4)?
  3. 3.
    Variance of 5,5,55, 5, 5?
  4. 4.
    Variance of 0,100, 10 (xˉ=5)(\bar{x}=5)?

From the bank · past-year question

Example 2StatisticsMODERATE
If i=110xi=110\sum_{i=1}^{10}x_{i}=110 and i=110xi2=1540\sum_{i=1}^{10}x_{i}^{2}=1540, then what is the variance?

[Q116 · Sep · 2021]

Computational form saves time on Σxi2\Sigma x_i^2-style PYQs

When you are given xi\sum x_i and xi2\sum x_i^2 directly, use σ2=x2xˉ2\sigma^2 = \overline{x^2} - \bar{x}^2 — not the original definition. NDA papers favour this shape because it tests whether you remember the identity.
Drill 6 more on variance

Concept 3 of 6

Standard Deviation

Intuition

Square root of the variance — brings the unit back to the data's original unit. SD is what statisticians actually report when they talk about "spread". If the data is in centimetres, SD is in centimetres; if in rupees, SD is in rupees. Variance lives in squared units.

Definition

The standard deviation is the non-negative square root of the variance. It has the same unit as the original observations.

Standard Deviation

σ=σ2=1ni=1n(xixˉ)2\sigma = \sqrt{\sigma^2} = \sqrt{\dfrac{1}{n}\sum_{i=1}^{n}(x_i - \bar{x})^2}
  • σ\sigmastandard deviation (always 0\geq 0)

Worked example

Find the standard deviation of 2,4,6,8,102, 4, 6, 8, 10.
  1. From the variance example above, σ2=8\sigma^2 = 8.
  2. Take the square root: σ=8=22\sigma = \sqrt{8} = 2\sqrt{2}.
  3. Numerically, σ2.83\sigma \approx 2.83.
Answer:σ=222.83\sigma = 2\sqrt{2} \approx 2.83
Practice this conceptself-check · 4 quick reps

Try it yourself

Find the standard deviation of 1,2,3,4,51, 2, 3, 4, 5.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Variance 1616. Standard deviation?
  2. 2.
    Variance 4949. Standard deviation?
  3. 3.
    Variance 22. Standard deviation?
  4. 4.
    SD of 3,3,33, 3, 3?

From the bank · past-year question

Example 3StatisticsMODERATE
If the mean and sum of squares of 10 observations are 40 and 16160 respectively, then what is the standard deviation?

[Q103 · Sep · 2023]

SD and mean deviation share units; variance does not

If the data is measured in cm, SD and MD are also in cm but variance is in cm². A PYQ asks "which has the same unit as the mean?" — answer is SD or MD, not variance.

Concept 4 of 6

Linear Transformation of SD and Variance

Intuition

Adding a constant shifts every value but the spread between them is unchanged — so shift has zero effect on SD or variance. Multiplying by aa stretches all distances by a|a|, so SD scales by a|a| and variance by a2a^2.

Definition

If Y=aX+bY = aX + b is a linear transformation of XX, then the variance of YY is a2a^2 times the variance of XX, and the standard deviation of YY is a|a| times the SD of XX. The shift bb has no effect on either.

Variance and SD under Y = aX + b

Var(Y)=a2Var(X)σY=aσX\text{Var}(Y) = a^2\,\text{Var}(X) \qquad \sigma_Y = |a|\,\sigma_X
  • aascale factor
  • bbshift — irrelevant for dispersion

Worked example

The SD of a dataset XX is 4. Find the SD of Y=3X+7Y = 3X + 7.
  1. Identify the transformation: a=3, b=7a = 3,\ b = 7.
  2. The shift b=7b = 7 has no effect on SD.
  3. The multiplier a=3a = 3 scales SD by 3=3|3| = 3.
  4. Therefore σY=3×4=12\sigma_Y = 3 \times 4 = 12.
Answer:σY=12\sigma_Y = 12
Practice this conceptself-check · 4 quick reps

Try it yourself

If the SD of XX is 66, find the SD of Y=2X+10Y = -2X + 10.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    SD is 55. SD of 2X2X?
  2. 2.
    SD is 44. SD of X+100X + 100?
  3. 3.
    Variance is 99. Variance of 3X3X?
  4. 4.
    SD is 66. SD of X-X?

From the bank · past-year question

Example 4StatisticsEASY
The standard deviation of 100 observations is 10. If 5 is multiplied to each of the observations, then what is the new standard deviation?

[Q115 · Apr · 2026]

Squaring aa for variance, taking absolute value for SD

Students often write σY2=aσX2\sigma_Y^2 = a\,\sigma_X^2 (forgetting the square) or σY=aσX\sigma_Y = a\,\sigma_X (forgetting the modulus). If aa is negative, a|a| is the correct scale for SD — SD is non-negative by definition.

Concept 5 of 6

Coefficient of Variation (CV)

Intuition

SD measures absolute spread, but a SD of 10 means different things on a salary scale (small) versus a marks scale (large). CV normalises SD by the mean and reports a unitless percentage, so you can compare variability across totally different datasets.

Definition

The coefficient of variation is the ratio of the standard deviation to the arithmetic mean, expressed as a percentage. The dataset with the higher CV is considered more variable.

Coefficient of Variation

CV=σxˉ×100%\text{CV} = \dfrac{\sigma}{\bar{x}} \times 100 \%
  • σ\sigmastandard deviation
  • xˉ\bar{x}arithmetic mean

Worked example

A dataset has mean 5050 and standard deviation 1010. Find its coefficient of variation.
  1. Apply the formula: CV=σxˉ×100\text{CV} = \dfrac{\sigma}{\bar{x}} \times 100.
  2. Substitute: CV=1050×100\text{CV} = \dfrac{10}{50} \times 100.
  3. Compute: CV=0.2×100=20%\text{CV} = 0.2 \times 100 = 20\%.
Answer:CV=20%\text{CV} = 20\%
Practice this conceptself-check · 4 quick reps

Try it yourself

Dataset A has mean 100100, SD 1515. Dataset B has mean 4040, SD 88. Which is more variable?

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Mean 5050, SD 55. CV?
  2. 2.
    Mean 2020, SD 44. CV?
  3. 3.
    Mean 100100, SD 2525. CV?
  4. 4.
    Mean 4040, SD 00. CV?

From the bank · past-year question

Example 5StatisticsEASY
If the mean of a frequency distribution is 100 and the coefficient of variation is 45%, then what is the value of the variance?

[Q109 · Apr · 2021]

CV is unitless — that's the entire point

Some answers give CV with a unit attached. Wrong. CV is a percentage. If the question compares two datasets with different units (e.g. height in cm vs weight in kg), only CV makes a fair comparison — not SD.

Concept 6 of 6

Computational Identity & Minimum-SSE Property

Intuition

Two identities that NDA papers exploit relentlessly. First: the mean of the squares equals the variance plus the square of the mean. Second: among all reference points, the sum of squared deviations is minimised when the reference is the mean (the median minimises absolute deviations; the mean minimises squared deviations).

Definition

From σ2=x2xˉ2\sigma^2 = \overline{x^2} - \bar{x}^2 follows the identity xi2=n(σ2+xˉ2)\sum x_i^2 = n(\sigma^2 + \bar{x}^2). Also, the function f(a)=(xia)2f(a) = \sum (x_i - a)^2 is minimised when a=xˉa = \bar{x}.

Two load-bearing identities

xi2n=xˉ2+σ2andargminai(xia)2=xˉ\dfrac{\sum x_i^2}{n} = \bar{x}^2 + \sigma^2 \qquad \text{and} \qquad \arg\min_{a}\sum_{i}(x_i - a)^2 = \bar{x}

Worked example

Given xi=50\sum x_i = 50 and xi2=530\sum x_i^2 = 530 for n=10n = 10 observations, find the variance.
  1. Compute the mean: xˉ=5010=5\bar{x} = \dfrac{50}{10} = 5.
  2. Use the identity σ2=xi2nxˉ2\sigma^2 = \dfrac{\sum x_i^2}{n} - \bar{x}^2.
  3. Substitute: σ2=5301052=5325\sigma^2 = \dfrac{530}{10} - 5^2 = 53 - 25.
  4. Compute: σ2=28\sigma^2 = 28.
Answer:σ2=28\sigma^2 = 28
Practice this conceptself-check · 4 quick reps

Try it yourself

If the mean of 4 observations is 5 and the variance is 4, find xi2\sum x_i^2.

Practice — Level 1 (4 reps)

Quick reps to lock in the method. Try each, then check.

  1. 1.
    Mean of squares 3030, mean 55. Variance?
  2. 2.
    n=5n=5, x2=100\sum x^2 = 100, xˉ=4\bar{x}=4. Variance?
  3. 3.
    Variance 99, mean 44. Mean of squares?
  4. 4.
    Mean of squares 5050, mean 77. Variance?

From the bank · past-year question

Example 6StatisticsMODERATE
Let i=19xi2=855\displaystyle\sum_{i=1}^{9}x_i^2=855. If MM is the mean and σ\sigma is the standard deviation of x1,x2,,x9x_1,x_2,\ldots,x_9, then what is the value of M2+σ2M^2+\sigma^2?

[Q111 · Sep · 2024]

x2(xˉ)2\overline{x^2} \neq (\bar{x})^2 — they differ by exactly σ2\sigma^2

Mean of the squares is NOT the square of the mean. Their difference is the variance: x2xˉ2=σ20\overline{x^2} - \bar{x}^2 = \sigma^2 \geq 0. PYQs plant this trap by asking for "mean of squares" or "M2+σ2M^2 + \sigma^2" and expecting you to recognise it as xi2/n\sum x_i^2/n.

Scaling inside the deviation moves the minimiser too

For S(a)=(cxia)2S(a) = \sum (c\,x_i - a)^2, expand and minimise: the minimum is at a=cxˉa = c\,\bar{x}, NOT a=xˉa = \bar{x}. PYQs commonly use c=2c = 2 (e.g. S=(2xia)2S = \sum (2x_i - a)^2) and expect you to identify the minimiser as 2xˉ2\bar{x} — twice the mean, not the mean.

Summary — formulas & gotchas at a glance

A revision cheat-sheet for the formulas and gotchas above. Click any concept name to jump back to its full explanation.

Formulas (6)

  • Mean Deviation

    Mean Deviation about A

    MD(A)=1ni=1nxiA\text{MD}(A) = \dfrac{1}{n}\sum_{i=1}^{n}|x_i - A|
  • Variance

    Variance — two equivalent forms

    σ2=1ni=1n(xixˉ)2=xi2nxˉ2\sigma^2 = \dfrac{1}{n}\sum_{i=1}^{n}(x_i - \bar{x})^2 = \dfrac{\sum x_i^2}{n} - \bar{x}^2
  • Standard Deviation

    Standard Deviation

    σ=σ2=1ni=1n(xixˉ)2\sigma = \sqrt{\sigma^2} = \sqrt{\dfrac{1}{n}\sum_{i=1}^{n}(x_i - \bar{x})^2}
  • Linear Transformation of SD and Variance

    Variance and SD under Y = aX + b

    Var(Y)=a2Var(X)σY=aσX\text{Var}(Y) = a^2\,\text{Var}(X) \qquad \sigma_Y = |a|\,\sigma_X
  • Coefficient of Variation (CV)

    Coefficient of Variation

    CV=σxˉ×100%\text{CV} = \dfrac{\sigma}{\bar{x}} \times 100 \%
  • Computational Identity & Minimum-SSE Property

    Two load-bearing identities

    xi2n=xˉ2+σ2andargminai(xia)2=xˉ\dfrac{\sum x_i^2}{n} = \bar{x}^2 + \sigma^2 \qquad \text{and} \qquad \arg\min_{a}\sum_{i}(x_i - a)^2 = \bar{x}

Watch out for (7)

Mastery check — 5 interleaved questions

Try each one before clicking. Questions are interleaved across the concepts above, not grouped — interleaving sharpens transfer.

Example 1StatisticsEASY
Consider the following statements: Statement 1: Range is not a good measure of dispersion. Statement 2: Range is highly affected by the existence of extreme values. Which one of the following is correct in respect of the above statements?

[Q110 · Apr · 2017]

Example 2StatisticsMODERATE
The mean and variance of five observations are 14 and 13.2 respectively. Three of the five observations are 11, 16 and 20. What are the other two observations?

[Q101 · Apr · 2023]

Example 3StatisticsEASY
Consider the following statements: I. Mean and variance have the same unit of measurement. II. Mean deviation and standard deviation have the same unit of measurement. Which of the statements given above is/are correct?

[Q120 · Sep · 2025]

Example 4StatisticsEASY
The variance of 25 observations is 4. If 2 is added to each observation, then the new variance of the resulting observations is

[Q110 · Sep · 2018]

Example 5StatisticsEASY
Mean=5, SD=2. If 5 added to each value, what is CV of new set?

[Q112 · Apr · 2018]

Drill every past-year question on this subtopic

44 questions from the bank — paginated, with cart and Word-export support.

Related notes