How do we measure the center and spread of a quantitative variable with numbers?
Topic 1.7 Summary Statistics for a Quantitative Variable: calculate and interpret measures of center (mean, median) and spread (range, IQR, standard deviation, variance), and judge their resistance to outliers.
A focused answer to AP Statistics Topic 1.7, defining and computing the mean, median, range, IQR, variance, and standard deviation, explaining resistance to outliers, with full worked calculations.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 1.7) wants you to compute and interpret numerical summaries of a quantitative variable: measures of center (mean, median) and spread (range, interquartile range, variance, standard deviation), and to know which are resistant to outliers and skew.
Measures of center
This is why the mean and median split apart in a skewed distribution: in a right-skewed set the long high tail drags the mean above the median, and in a left-skewed set it pulls the mean below. Comparing the two is itself a quick read on skew: mean greater than median suggests right skew, mean less than median suggests left skew, and roughly equal suggests symmetry.
Measures of spread
The standard deviation deserves a careful reading. You find each value's deviation from the mean, square it (so that positive and negative deviations do not cancel and large deviations count more), average the squared deviations using , and take the square root to return to the original units. The division by rather than is the sample standard deviation, which is what the AP course and your calculator use for sample data.
Quartiles and the five-number summary
The quartiles divide ordered data into four equal-count parts: is the median of the lower half, is the median of the upper half, and the median itself is . Together with the minimum and maximum they form the five-number summary (), which is exactly what a boxplot draws in the next topic. A small but exam-relevant subtlety is how you split the data to find the quartiles when is odd: the AP convention excludes the overall median from both halves, so for the lower half is and the upper half is , giving and .
Choosing resistant or sensitive measures
The deciding idea of this topic is resistance: a statistic is resistant if a few extreme values barely change it. The median and IQR are resistant because they depend on position (the middle, the quartiles), not on the actual size of extreme values; the mean, range, and standard deviation are sensitive because they use every value's magnitude, so a single large outlier can move them substantially. This drives a simple, heavily examined rule. When a distribution is skewed or has outliers, report the median and IQR, because they describe the typical value and typical spread without being distorted by the tail. When a distribution is roughly symmetric with no outliers, report the mean and standard deviation, because they use all the information and are the foundation for the normal model and later inference. Markers frequently award a point purely for choosing the resistant pair (and saying why) when the data are skewed, so naming resistance explicitly is worth easy credit. The complementary insight is diagnostic: if you are told the mean is much larger than the median, you can infer right skew or a high outlier even without seeing the data, which is a favorite multiple-choice move.
Try this
Q1. A data set has mean and median . What does this suggest about its shape? [1 point]
- Cue. The mean exceeds the median, suggesting the distribution is skewed right (a high tail pulls the mean up).
Q2. Find the IQR of . [2 points]
- Cue. Median is ; lower half gives , upper half gives , so .
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2019 (style)1 marksSection I (multiple choice). A data set is strongly skewed right with a few very large values. Which pair of statistics best describes its center and spread? (A) Mean and standard deviation (B) Median and IQR (C) Mean and range (D) Mode and varianceShow worked answer β
The correct answer is (B).
For a skewed distribution or one with outliers, the resistant measures are the median (center) and the IQR (spread); neither is pulled by the extreme values.
(A) the mean and standard deviation are both sensitive to the long tail, so they overstate the center and spread here. (C) and (D) mix a non-resistant measure with another. The rule is: skewed or outliers means median and IQR.
AP 2021 (style)4 marksSection II (free response). A small business records the number of orders on days: . (a) Calculate the mean and the median. (b) Explain why they differ so much. (c) State, with reason, which is the better measure of a typical day, and calculate the IQR to describe spread resistantly.Show worked answer β
A 4-point computation-and-reasoning question.
(a) (1 point) Mean . Median the th of ordered values .
(b) (1 point) They differ because the value is a high outlier that inflates the mean (which uses every value) but barely moves the median (the middle value).
(c) (2 points) The median () is the better measure of a typical day because it is resistant to the outlier, while the mean () is dragged up by the single busy day (1 point). IQR: is the median of the lower half which is ; is the median of the upper half which is ; so IQR orders (1 point).
Markers reward correct mean and median, attributing the gap to the outlier, choosing the median with a resistance reason, and a correct IQR.
Related dot points
- Topic 1.6 Describing the Distribution of a Quantitative Variable: describe a quantitative distribution by its shape, center, spread, and unusual features (outliers, gaps, clusters) in context.
A focused answer to AP Statistics Topic 1.6, the SOCS framework for describing a quantitative distribution by shape, outliers, center, and spread, with the vocabulary of skew, modality, and clusters, and worked descriptions.
- Topic 1.8 Graphical Representations of Summary Statistics: construct and interpret boxplots from the five-number summary, and identify outliers using the 1.5 times IQR rule.
A focused answer to AP Statistics Topic 1.8, on building and reading boxplots from the five-number summary, the 1.5 times IQR rule for outliers, and what a boxplot does and does not reveal, with a worked construction.
- Topic 1.9 Comparing Distributions of a Quantitative Variable: compare two or more distributions of a quantitative variable by shape, center, spread, and unusual features, in context, using comparative language.
A focused answer to AP Statistics Topic 1.9, on comparing two or more distributions by shape, center, spread, and unusual features using explicit comparative language, with a worked side-by-side comparison.
- Topic 1.5 Representing a Quantitative Variable with Graphs: construct and interpret dotplots, stem-and-leaf plots, and histograms for a quantitative variable, and choose an appropriate display.
A focused answer to AP Statistics Topic 1.5, on displaying a quantitative variable with dotplots, stem-and-leaf plots, and histograms, choosing bin widths, and reading the displays, with a worked histogram construction.
- Topic 1.10 The Normal Distribution: use z-scores, the empirical (68-95-99.7) rule, and the standard normal model to find proportions and percentiles for approximately normal data.
A focused answer to AP Statistics Topic 1.10, on the normal model, standardizing with z-scores, the 68-95-99.7 empirical rule, and finding proportions and percentiles, with full worked z-score and normal-area calculations.
Sources & how we know this
- AP Statistics Course and Exam Description β College Board (2020)