Skip to main content
United StatesStatisticsSyllabus dot point

How do we measure the center and spread of a quantitative variable with numbers?

Topic 1.7 Summary Statistics for a Quantitative Variable: calculate and interpret measures of center (mean, median) and spread (range, IQR, standard deviation, variance), and judge their resistance to outliers.

A focused answer to AP Statistics Topic 1.7, defining and computing the mean, median, range, IQR, variance, and standard deviation, explaining resistance to outliers, with full worked calculations.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Measures of center
  3. Measures of spread
  4. Quartiles and the five-number summary
  5. Choosing resistant or sensitive measures
  6. Try this

What this topic is asking

The College Board (Topic 1.7) wants you to compute and interpret numerical summaries of a quantitative variable: measures of center (mean, median) and spread (range, interquartile range, variance, standard deviation), and to know which are resistant to outliers and skew.

Measures of center

This is why the mean and median split apart in a skewed distribution: in a right-skewed set the long high tail drags the mean above the median, and in a left-skewed set it pulls the mean below. Comparing the two is itself a quick read on skew: mean greater than median suggests right skew, mean less than median suggests left skew, and roughly equal suggests symmetry.

Measures of spread

The standard deviation deserves a careful reading. You find each value's deviation from the mean, square it (so that positive and negative deviations do not cancel and large deviations count more), average the squared deviations using nβˆ’1n - 1, and take the square root to return to the original units. The division by nβˆ’1n - 1 rather than nn is the sample standard deviation, which is what the AP course and your calculator use for sample data.

Quartiles and the five-number summary

The quartiles divide ordered data into four equal-count parts: Q1Q_1 is the median of the lower half, Q3Q_3 is the median of the upper half, and the median itself is Q2Q_2. Together with the minimum and maximum they form the five-number summary (min,Q1,median,Q3,max\text{min}, Q_1, \text{median}, Q_3, \text{max}), which is exactly what a boxplot draws in the next topic. A small but exam-relevant subtlety is how you split the data to find the quartiles when nn is odd: the AP convention excludes the overall median from both halves, so for {4,6,6,8,9,11,40}\{4, 6, 6, 8, 9, 11, 40\} the lower half is {4,6,6}\{4, 6, 6\} and the upper half is {9,11,40}\{9, 11, 40\}, giving Q1=6Q_1 = 6 and Q3=11Q_3 = 11.

Choosing resistant or sensitive measures

The deciding idea of this topic is resistance: a statistic is resistant if a few extreme values barely change it. The median and IQR are resistant because they depend on position (the middle, the quartiles), not on the actual size of extreme values; the mean, range, and standard deviation are sensitive because they use every value's magnitude, so a single large outlier can move them substantially. This drives a simple, heavily examined rule. When a distribution is skewed or has outliers, report the median and IQR, because they describe the typical value and typical spread without being distorted by the tail. When a distribution is roughly symmetric with no outliers, report the mean and standard deviation, because they use all the information and are the foundation for the normal model and later inference. Markers frequently award a point purely for choosing the resistant pair (and saying why) when the data are skewed, so naming resistance explicitly is worth easy credit. The complementary insight is diagnostic: if you are told the mean is much larger than the median, you can infer right skew or a high outlier even without seeing the data, which is a favorite multiple-choice move.

Try this

Q1. A data set has mean 2020 and median 1414. What does this suggest about its shape? [1 point]

  • Cue. The mean exceeds the median, suggesting the distribution is skewed right (a high tail pulls the mean up).

Q2. Find the IQR of {3,5,7,8,12,15,19}\{3, 5, 7, 8, 12, 15, 19\}. [2 points]

  • Cue. Median is 88; lower half {3,5,7}\{3, 5, 7\} gives Q1=5Q_1 = 5, upper half {12,15,19}\{12, 15, 19\} gives Q3=15Q_3 = 15, so IQR=15βˆ’5=10\text{IQR} = 15 - 5 = 10.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). A data set is strongly skewed right with a few very large values. Which pair of statistics best describes its center and spread? (A) Mean and standard deviation (B) Median and IQR (C) Mean and range (D) Mode and variance
Show worked answer β†’

The correct answer is (B).

For a skewed distribution or one with outliers, the resistant measures are the median (center) and the IQR (spread); neither is pulled by the extreme values.

(A) the mean and standard deviation are both sensitive to the long tail, so they overstate the center and spread here. (C) and (D) mix a non-resistant measure with another. The rule is: skewed or outliers means median and IQR.

AP 2021 (style)4 marksSection II (free response). A small business records the number of orders on 77 days: 4,6,6,8,9,11,404, 6, 6, 8, 9, 11, 40. (a) Calculate the mean and the median. (b) Explain why they differ so much. (c) State, with reason, which is the better measure of a typical day, and calculate the IQR to describe spread resistantly.
Show worked answer β†’

A 4-point computation-and-reasoning question.

(a) (1 point) Mean =4+6+6+8+9+11+407=847=12= \frac{4+6+6+8+9+11+40}{7} = \frac{84}{7} = 12. Median == the 44th of 77 ordered values =8= 8.
(b) (1 point) They differ because the value 4040 is a high outlier that inflates the mean (which uses every value) but barely moves the median (the middle value).
(c) (2 points) The median (88) is the better measure of a typical day because it is resistant to the outlier, while the mean (1212) is dragged up by the single busy day (1 point). IQR: Q1Q_1 is the median of the lower half {4,6,6}\{4,6,6\} which is 66; Q3Q_3 is the median of the upper half {9,11,40}\{9,11,40\} which is 1111; so IQR =11βˆ’6=5= 11 - 6 = 5 orders (1 point).

Markers reward correct mean and median, attributing the gap to the outlier, choosing the median with a resistance reason, and a correct IQR.

Related dot points

Sources & how we know this