United StatesStatisticsSyllabus dot point

How do we measure the center and spread of a quantitative variable with numbers?

Topic 1.7 Summary Statistics for a Quantitative Variable: calculate and interpret measures of center (mean, median) and spread (range, IQR, standard deviation, variance), and judge their resistance to outliers.

A focused answer to AP Statistics Topic 1.7, defining and computing the mean, median, range, IQR, variance, and standard deviation, explaining resistance to outliers, with full worked calculations.

Generated by Claude Opus 4.810 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
Measures of center
Measures of spread
Quartiles and the five-number summary
Choosing resistant or sensitive measures
Try this

What this topic is asking

The College Board (Topic 1.7) wants you to compute and interpret numerical summaries of a quantitative variable: measures of center (mean, median) and spread (range, interquartile range, variance, standard deviation), and to know which are resistant to outliers and skew.

Measures of center

This is why the mean and median split apart in a skewed distribution: in a right-skewed set the long high tail drags the mean above the median, and in a left-skewed set it pulls the mean below. Comparing the two is itself a quick read on skew: mean greater than median suggests right skew, mean less than median suggests left skew, and roughly equal suggests symmetry.

Measures of spread

The standard deviation deserves a careful reading. You find each value's deviation from the mean, square it (so that positive and negative deviations do not cancel and large deviations count more), average the squared deviations using $n - 1$ , and take the square root to return to the original units. The division by $n - 1$ rather than $n$ is the sample standard deviation, which is what the AP course and your calculator use for sample data.

Quartiles and the five-number summary

The quartiles divide ordered data into four equal-count parts: $Q_1$ is the median of the lower half, $Q_3$ is the median of the upper half, and the median itself is $Q_2$ . Together with the minimum and maximum they form the five-number summary ( $\text{min}, Q_1, \text{median}, Q_3, \text{max}$ ), which is exactly what a boxplot draws in the next topic. A small but exam-relevant subtlety is how you split the data to find the quartiles when $n$ is odd: the AP convention excludes the overall median from both halves, so for $\{4, 6, 6, 8, 9, 11, 40\}$ the lower half is $\{4, 6, 6\}$ and the upper half is $\{9, 11, 40\}$ , giving $Q_1 = 6$ and $Q_3 = 11$ .

Choosing resistant or sensitive measures

The deciding idea of this topic is resistance: a statistic is resistant if a few extreme values barely change it. The median and IQR are resistant because they depend on position (the middle, the quartiles), not on the actual size of extreme values; the mean, range, and standard deviation are sensitive because they use every value's magnitude, so a single large outlier can move them substantially. This drives a simple, heavily examined rule. When a distribution is skewed or has outliers, report the median and IQR, because they describe the typical value and typical spread without being distorted by the tail. When a distribution is roughly symmetric with no outliers, report the mean and standard deviation, because they use all the information and are the foundation for the normal model and later inference. Markers frequently award a point purely for choosing the resistant pair (and saying why) when the data are skewed, so naming resistance explicitly is worth easy credit. The complementary insight is diagnostic: if you are told the mean is much larger than the median, you can infer right skew or a high outlier even without seeing the data, which is a favorite multiple-choice move.

Computing center and spread by hand

Find the mean, median, range, IQR, and standard deviation of $\{2, 4, 4, 5, 6, 9\}$ (already ordered, $n = 6$ ).

step 1 Mean

$\bar{x} = \dfrac{2 + 4 + 4 + 5 + 6 + 9}{6} = \dfrac{30}{6} = 5$ .

step 2 Median and range

With $n = 6$ , the median is the average of the $3$ rd and $4$ th values: $\frac{4 + 5}{2} = 4.5$ . The range is $9 - 2 = 7$ .

step 3 Quartiles and IQR

Lower half $\{2, 4, 4\}$ has median $Q_1 = 4$ ; upper half $\{5, 6, 9\}$ has median $Q_3 = 6$ . So $\text{IQR} = 6 - 4 = 2$ .

step 4 Standard deviation

Deviations from the mean $5$ : $-3, -1, -1, 0, 1, 4$ . Squared: $9, 1, 1, 0, 1, 16$ , summing to $28$ . Then

s = \sqrt{\frac{28}{6 - 1}} = \sqrt{\frac{28}{5}} = \sqrt{5.6} \approx 2.37.

step 5 Interpret

The typical value is around $4.5$ to $5$ , the middle half of the data spans only $2$ units (IQR), and values lie a typical distance of about $2.37$ from the mean. The value $9$ stretches the range to $7$ , illustrating why the range is the least resistant spread.

Try this

Q1. A data set has mean $20$ and median $14$ . What does this suggest about its shape? [1 point]

Cue. The mean exceeds the median, suggesting the distribution is skewed right (a high tail pulls the mean up).

Q2. Find the IQR of $\{3, 5, 7, 8, 12, 15, 19\}$ . [2 points]

Cue. Median is $8$ ; lower half $\{3, 5, 7\}$ gives $Q_1 = 5$ , upper half $\{12, 15, 19\}$ gives $Q_3 = 15$ , so $\text{IQR} = 15 - 5 = 10$ .

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). A data set is strongly skewed right with a few very large values. Which pair of statistics best describes its center and spread? (A) Mean and standard deviation (B) Median and IQR (C) Mean and range (D) Mode and variance

Show worked answer →

The correct answer is (B).

For a skewed distribution or one with outliers, the resistant measures are the median (center) and the IQR (spread); neither is pulled by the extreme values.

(A) the mean and standard deviation are both sensitive to the long tail, so they overstate the center and spread here. (C) and (D) mix a non-resistant measure with another. The rule is: skewed or outliers means median and IQR.

AP 2021 (style)4 marksSection II (free response). A small business records the number of orders on

7

days:

4, 6, 6, 8, 9, 11, 40

. (a) Calculate the mean and the median. (b) Explain why they differ so much. (c) State, with reason, which is the better measure of a typical day, and calculate the IQR to describe spread resistantly.

Show worked answer →

A 4-point computation-and-reasoning question.

(a) (1 point) Mean $= \frac{4+6+6+8+9+11+40}{7} = \frac{84}{7} = 12$ . Median $=$ the $4$ th of $7$ ordered values $= 8$ .
(b) (1 point) They differ because the value $40$ is a high outlier that inflates the mean (which uses every value) but barely moves the median (the middle value).
(c) (2 points) The median ( $8$ ) is the better measure of a typical day because it is resistant to the outlier, while the mean ( $12$ ) is dragged up by the single busy day (1 point). IQR: $Q_1$ is the median of the lower half $\{4,6,6\}$ which is $6$ ; $Q_3$ is the median of the upper half $\{9,11,40\}$ which is $11$ ; so IQR $= 11 - 6 = 5$ orders (1 point).

Markers reward correct mean and median, attributing the gap to the outlier, choosing the median with a resistance reason, and a correct IQR.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)