NDA Maths · Statistics
Foundations + Measures of Central Tendency
A single value that summarises where a dataset is centred — mean, median, or mode.
Why this matters
75 PYQs across 2017–2026 — the biggest subtopic in NDA Statistics. Most questions test linear-transformation effects on the mean, grouped-data calculations, replacement / wrong-value corrections, special-case mean shortcuts, the combined-mean of two groups, or the sum-of-deviations identity. Master the eleven concepts below and you cover the entire EASY + MODERATE bandwidth reliably.
Concept 1 of 17
What is data, and why summarise it?
Intuition
Definition
Data is the set of observed values of a variable measured on a collection of items. The full collection is the POPULATION; a subset actually observed is a SAMPLE. Statistics builds summary measures from samples to draw conclusions about the population.
Worked example
- Both datasets sum to 180, so the arithmetic mean is identical: 60.
- But Class A is constant; Class B varies between 40 and 80.
- The mean alone hides the SPREAD. We need a second summary (a dispersion measure) to capture it — that's why this chapter has two parts: tendency and dispersion.
Concept 2 of 17
Types of data — qualitative vs quantitative, discrete vs continuous
Intuition
Definition
- Qualitative (categorical): takes labels, not numbers — operations like mean don't apply.
- Quantitative — discrete: numerical, jumping in integer-sized steps (kids per family, integer marks).
- Quantitative — continuous: numerical, filling an interval smoothly (height, weight, time).
Worked example
- (i) Eye colour — labels (blue, brown, green) — QUALITATIVE.
- (ii) Number of siblings — whole-number counts — QUANTITATIVE, DISCRETE.
- (iii) Running time — any real number such as 12.47 s — QUANTITATIVE, CONTINUOUS.
- (iv) Shirt size — ordered labels (S < M < L < XL) — still QUALITATIVE (ordinal).
Concept 3 of 17
Frequency and tabulation
Intuition
Definition
A frequency distribution lists each distinct value (or class interval) alongside its frequency. The TOTAL FREQUENCY equals the number of observations: . Every chapter formula that involves grouped or repeated data uses this , not the count of distinct values.
Total frequency
- number of distinct values or class intervals
- frequency of the -th value/class
- total number of observations
Worked example
- Distinct values in ascending order: 4, 5, 6, 7.
- Tally: .
- Check — matches the original count.
Practice this concept4 quick reps
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.In , what is the frequency of ?
- 2.A frequency table has . Find .
- 3.In , what is ?
- 4.Frequencies — total observations ?
Concept 4 of 17
Class marks and class width (grouped data)
Intuition
Definition
For a class interval with lower bound and upper bound : the CLASS MARK is and the CLASS WIDTH is . All grouped-data formulas (mean, median, mode, variance) use the class mark as the representative value of the interval.
Class mark and class width
Worked example
- Lower bound , upper bound .
- Class mark: .
- Class width: .
Practice this concept4 quick reps
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Class mark of the interval –?
- 2.Class width of –?
- 3.Class mark of –?
- 4.Class width of –?
Concept 5 of 17
Summation notation Σ
Intuition
Definition
For a sequence , . Two identities are load-bearing throughout this chapter: (a constant summed times) and (linearity).
Definition + two identities
Worked example
- Apply linearity: .
- Substitute: .
Practice this concept4 quick reps
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.
- 2.If , find .
- 3.If , find .
- 4.
Concept 6 of 17
Weighted vs unweighted counting
Intuition
Definition
For raw data, the total is and the count is . For frequency-tabulated data with distinct values of frequency , the total is and the count is . Every measure has a "raw" form (unweighted) and a "grouped" form (weighted) — they're the same idea with frequencies multiplied in.
Worked example
- Weighted contributions: .
- Weighted total: .
- Total count: .
Practice this concept4 quick reps
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Value with frequency — its contribution to ?
- 2.with : ?
- 3.Value with frequency — contribution?
- 4.with : ?
Concept 7 of 17
Arithmetic Mean (raw data)
Intuition
Definition
For observations , the arithmetic mean is the total sum divided by the number of observations.
Arithmetic Mean
- the arithmetic mean
- the -th observation
- the total number of observations
Diagram · mean = the balance point
Treat each value as equal weight on a beam; the mean is the point where it balances. The pulls on the left (deviations −3, −1) exactly cancel those on the right (0, +4), which is the identity Σ(xᵢ − x̄) = 0. One extreme value drags the balance point toward it — why the mean is sensitive to outliers.
Worked example
- Add up all the values: .
- Count the observations: .
- Apply the formula: .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Mean of ?
- 2.Mean of ?
- 3.Mean of ?
- 4.Mean of ?
From the bank · past-year question
[Q108 · Apr · 2023]
Outliers move the mean — sometimes a lot
Concept 8 of 17
Arithmetic Mean (frequency / grouped data)
Intuition
Definition
If the value occurs with frequency , the mean is the frequency-weighted sum divided by the total frequency.
Frequency-weighted Mean
- value (or class mark for grouped data)
- frequency of
- total frequency = total observations
Worked example
- Compute .
- Compute .
- Apply the formula: .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.with : mean?
- 2.with : mean?
- 3.with : mean?
- 4.with : mean?
From the bank · past-year question
[Q117 · Sep · 2025]
Divide by , not by the number of classes
Concept 9 of 17
Linear Transformation of the Mean
Intuition
Definition
If a new variable is formed from each observation, the new mean is .
Linear transformation rule
- scale factor (multiplied)
- shift (added)
Worked example
- Identify the transformation: , so .
- Apply the rule: .
- Compute: .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Mean is . New mean if each value is multiplied by ?
- 2.Mean is . New mean if is added to each?
- 3.Mean is . New mean for ?
- 4.Mean is . New mean for ?
From the bank · past-year question
[Q113 · Apr · 2025]
Shift moves the mean, but not the SD
Concept 10 of 17
Replacement and Wrong-Value Correction of the Mean
Intuition
Definition
If the mean of observations is and a single value is replaced by , the new mean is . For a wrong-value correction, is what was recorded and is the correct value. For discards or additions, itself changes — reason about the new total directly.
Replacement rule (single observation, n unchanged)
- original mean
- number of observations (unchanged in pure replacement)
- the value being removed (or wrongly recorded)
- the value taking its place (or the correct one)
Worked example
- Identify the swap: wrong value , correct value , .
- Apply the rule: .
- Compute the correction: .
- Therefore .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Mean of obs is . A value is corrected to . New mean?
- 2.Mean of obs is . A value is corrected to . New mean?
- 3.Mean of obs is . A value is corrected to . New mean?
- 4.Mean of obs is . A value is corrected to . New mean?
From the bank · past-year question
[Q116 · Apr · 2021]
Divide by , not by 1
Discards: work with totals , not the rule directly
Concept 11 of 17
Special-Case Means — Consecutive Integers, Squares, AP, Binomial
Intuition
Definition
Three shortcuts are load-bearing: (a) Mean of consecutive integers from to is . (b) Mean of squares is . (c) For an AP, the mean equals the average of the first and last terms (or equivalently the middle term). For binomial-weighted means, the denominator is , not the number of terms.
Closed-form means for common sequences
- first and last integer of an arithmetic run
- number of terms (for the squares formula, the upper index)
- first and last term of an AP
Worked example
- These are the first perfect squares, so use the closed form for the mean of squares.
- Mean of to is .
- Substitute : .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Mean of ?
- 2.Mean of the first five even numbers ?
- 3.Mean of ?
- 4.Mean of an AP with first term and last term ?
From the bank · past-year question
[Q120 · Apr · 2025]
AP shortcut fails for GPs and other non-uniform spacings
Binomial-weighted means use
Concept 12 of 17
Combined Mean of Two Groups
Intuition
Definition
For group 1 of size with mean and group 2 of size with mean , the combined mean of the pooled dataset is the frequency-weighted average. Generalises to groups as a weighted average of the group means, with each weight equal to the group's size.
Combined mean of two groups
- sizes of the two groups
- means of the two groups
- combined mean of the pooled dataset
Worked example
- Identify the groups: .
- Compute group totals: ; .
- Apply the formula: .
- Compute: years.
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.boys mean , girls mean . Combined mean?
- 2.Two equal-size groups, means and . Combined mean?
- 3.obs mean , obs mean . Combined?
- 4.Group of mean , group of mean . Combined?
From the bank · past-year question
[Q105 · Apr · 2020]
Plain average of the two means is wrong unless
Reverse-solve: combined + group means give the size ratio
Concept 13 of 17
Median — Middle Value
Intuition
Definition
For raw data with sorted observations, the median is the middle value if is odd, and the average of the two middle values if is even. For grouped data, use the class-interval formula below.
Median (raw and grouped)
- lower bound of the median class
- cumulative frequency before the median class
- frequency of the median class
- class width
Diagram · median = the middle of sorted data
Sort first, then locate the middle position. With an odd count there is one middle value; with an even count the median is the average of the two middle values. It ignores how far the extremes lie — which is why it resists outliers better than the mean.
Worked example
- Sort ascending: .
- Count the observations: (odd).
- Median is the -th value, which is .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Median of ?
- 2.Median of ?
- 3.Median of ?
- 4.Median of ?
From the bank · past-year question
[Q105 · Apr · 2021]
Always sort before reading off the middle
Concept 14 of 17
Mode — Most Frequent Value
Intuition
Definition
For raw data, the mode is the value with the highest frequency. If multiple values tie for highest, the dataset is multimodal. For grouped data, use the class-interval formula below.
Mode (grouped data)
- lower bound of the modal class
- frequency of the modal class
- frequency of the class before
- frequency of the class after
- class width
Diagram · mode = the tallest bar
The mode is the value with the highest frequency — category D here. Data can have two modes (bimodal) or none (all equal); the mode is the only average that also works for non-numeric categories.
Worked example
- Tally each value's frequency: appears 3 times, every other value appears once.
- The highest frequency is 3, achieved only by the value .
- Therefore the mode is .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Mode of ?
- 2.Mode of ?
- 3.Mode of ?
- 4.Mode of ?
From the bank · past-year question
[Q119 · Apr · 2021]
Mode can be undefined or multimodal — don't force one answer
Concept 15 of 17
Geometric Mean (GM)
Intuition
Definition
For positive observations , the geometric mean is the -th root of their product.
Geometric Mean
- number of observations (all positive)
Worked example
- Multiply the values: .
- Take the -th root with : .
- Compute: .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.GM of and ?
- 2.GM of and ?
- 3.GM of ?
- 4.GM of and ?
From the bank · past-year question
[Q114 · Apr · 2021]
GM is only defined for positive numbers
Concept 16 of 17
Harmonic Mean (HM)
Intuition
Definition
For positive observations , the harmonic mean is divided by the sum of the reciprocals.
Harmonic Mean
- number of observations (all positive)
Worked example
- Sum of reciprocals: .
- Number of observations: .
- Apply the formula: .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.HM of and ?
- 2.HM of and ?
- 3.HM of and ?
- 4.Which is largest for distinct positives: AM, GM, or HM?
From the bank · past-year question
[Q114 · Sep · 2021]
Order is always
for two numbers
Concept 17 of 17
Sum of Deviations & the Empirical Relation
Intuition
Definition
for any dataset — this is a defining property of the mean. The empirical relation holds approximately for moderately skewed unimodal distributions and is used to recover the third measure when two are known.
Two identities to memorise
Diagram · mean, median & mode under skew
With a long right tail (positive skew) the mean is dragged toward it, giving Mode < Median < Mean (the order reverses for a left tail). This is the basis of the empirical relation Mode ≈ 3·Median − 2·Mean. For any data, Σ(xᵢ − x̄) = 0 — deviations above and below the mean always cancel.
Worked example
- Use the identity , which holds for any dataset.
- Verify by expansion: .
- Plugging in gives .
Practice this conceptself-check · 4 quick reps
Try it yourself
Practice — Level 1 (4 reps)
Quick reps to lock in the method. Try each, then check.
- 1.Sum of deviations of any dataset about its own mean?
- 2.Mean , median . Mode by the empirical relation?
- 3.Mean of numbers is . Find .
- 4.Mode , mean . Median by the empirical relation?
From the bank · past-year question
[Q107 · Apr · 2021]
Sum of deviations is zero only about the mean
Empirical relation is approximate, not exact
Summary — formulas & gotchas at a glance
A revision cheat-sheet for the formulas and gotchas above. Click any concept name to jump back to its full explanation.
Formulas (14)
- Frequency and tabulation
Total frequency
- Class marks and class width (grouped data)
Class mark and class width
- Summation notation Σ
Definition + two identities
- Arithmetic Mean (raw data)
Arithmetic Mean
- Arithmetic Mean (frequency / grouped data)
Frequency-weighted Mean
- Linear Transformation of the Mean
Linear transformation rule
- Replacement and Wrong-Value Correction of the Mean
Replacement rule (single observation, n unchanged)
- Special-Case Means — Consecutive Integers, Squares, AP, Binomial
Closed-form means for common sequences
- Combined Mean of Two Groups
Combined mean of two groups
- Median — Middle Value
Median (raw and grouped)
- Mode — Most Frequent Value
Mode (grouped data)
- Geometric Mean (GM)
Geometric Mean
- Harmonic Mean (HM)
Harmonic Mean
- Sum of Deviations & the Empirical Relation
Two identities to memorise
Watch out for (16)
- Outliers move the mean — sometimes a lot→ Arithmetic Mean (raw data)
- Divide by , not by the number of classes→ Arithmetic Mean (frequency / grouped data)
- Shift moves the mean, but not the SD→ Linear Transformation of the Mean
- Divide by , not by 1→ Replacement and Wrong-Value Correction of the Mean
- Discards: work with totals , not the rule directly→ Replacement and Wrong-Value Correction of the Mean
- AP shortcut fails for GPs and other non-uniform spacings→ Special-Case Means — Consecutive Integers, Squares, AP, Binomial
- Binomial-weighted means use→ Special-Case Means — Consecutive Integers, Squares, AP, Binomial
- Plain average of the two means is wrong unless→ Combined Mean of Two Groups
- Reverse-solve: combined + group means give the size ratio→ Combined Mean of Two Groups
- Always sort before reading off the middle→ Median — Middle Value
- Mode can be undefined or multimodal — don't force one answer→ Mode — Most Frequent Value
- GM is only defined for positive numbers→ Geometric Mean (GM)
- Order is always→ Harmonic Mean (HM)
- for two numbers→ Harmonic Mean (HM)
- Sum of deviations is zero only about the mean→ Sum of Deviations & the Empirical Relation
- Empirical relation is approximate, not exact→ Sum of Deviations & the Empirical Relation
Mastery check — 5 interleaved questions
Try each one before clicking. Questions are interleaved across the concepts above, not grouped — interleaving sharpens transfer.
[Q106 · Sep · 2017]
[Q120 · Apr · 2022]
[Q109 · Apr · 2019]
[Q112 · Sep · 2024]
[Q101 · Sep · 2022]
Drill every past-year question on this subtopic
75 questions from the bank — paginated, with cart and Word-export support.