How do boxplots display the five-number summary, and how do we flag outliers formally?
Topic 1.8 Graphical Representations of Summary Statistics: construct and interpret boxplots from the five-number summary, and identify outliers using the 1.5 times IQR rule.
A focused answer to AP Statistics Topic 1.8, on building and reading boxplots from the five-number summary, the 1.5 times IQR rule for outliers, and what a boxplot does and does not reveal, with a worked construction.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 1.8) wants you to turn the five-number summary into a boxplot, read a boxplot, and apply the rule to flag outliers formally. It also wants you to know what a boxplot does and does not show.
The boxplot
The box itself spans the middle of the data (its width is the IQR), so the box shows where the bulk of the data sits and the line shows the center. A modified boxplot, which is what the AP course uses, applies the outlier rule so that the whiskers stop at the last non-outlier and outliers appear as dots; this is more informative than a plain boxplot whose whiskers always run to the min and max.
The 1.5 times IQR rule
This rule gives a reproducible, objective definition of "outlier," replacing the informal "looks far away" of Topic 1.6. Because the IQR is resistant, the fences are not themselves distorted by the very outliers they are detecting, which is the elegance of the rule.
Reading shape from a boxplot
A boxplot encodes skew through the relative lengths of its parts. If the median sits closer to (left side of the box) and the right whisker is longer, the distribution is skewed right; the mirror image is skewed left; a centered median with balanced whiskers suggests symmetry. Comparing the whisker lengths and the position of the median inside the box is the standard way to read shape from a boxplot, and it is faster than computing anything. This makes boxplots superb for comparing several groups at once: drawing parallel boxplots on the same axis lets you compare centers (median lines), spreads (box widths), and skew (whisker lengths) across groups in a single glance, which is exactly the task of the next topic.
What a boxplot cannot show
The crucial limitation, and a favorite exam point, is that a boxplot is built only from five numbers, so it throws away the internal shape of the data. It cannot show how many modes a distribution has: a smoothly unimodal distribution and a sharply bimodal one can produce identical boxplots, because both can share the same five-number summary. It also hides gaps and clusters within the box or whiskers. For that reason, when a question asks whether a boxplot fully describes a distribution, the honest answer is no: to see modality, gaps, and clusters you need a dotplot, stemplot, or histogram. A strong exam answer states this limitation explicitly, often phrased as "a boxplot cannot reveal the number of peaks (modality) because it shows only the five-number summary." Knowing the strengths (comparison, skew, resistant outlier detection) and the weaknesses (no modality, no clusters) of each display is the meta-skill that Unit 1's display topics build toward.
Try this
Q1. A data set has , . Find the upper fence for outliers. [2 points]
- Cue. , so upper fence .
Q2. Give one feature of a distribution that two different data sets could hide while sharing the same boxplot. [1 point]
- Cue. Modality (number of peaks), or gaps and clusters; a boxplot uses only the five-number summary.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2018 (style)1 marksSection I (multiple choice). A data set has and . Using the rule, a value is an outlier if it is above which threshold? (A) (B) (C) (D) Show worked answer →
The correct answer is (C).
, so . The upper fence is . Any value above is a high outlier.
(A) is just . (B) adds only (one IQR, not ). (D) adds (two IQR). The rule adds exactly to for the upper fence.
AP 2023 (style)4 marksSection II (free response). A data set of daily download counts has five-number summary , , median , , . (a) Apply the rule to determine whether any values are outliers. (b) Describe what the boxplot of these data would look like and what it suggests about shape. (c) State one feature of the distribution a boxplot cannot reveal.Show worked answer →
A 4-point question on boxplots and outliers.
(a) (2 points) ; . Lower fence ; upper fence (1 point). The minimum is above , so no low outliers; the maximum exceeds , so is a high outlier (1 point).
(b) (1 point) The box spans to with the median at ; the right whisker is long (or a point at is plotted as an outlier), suggesting the distribution is skewed right.
(c) (1 point) A boxplot cannot reveal the number of modes (peaks) or gaps and clusters within the data; for example a bimodal distribution and a uniform one can produce identical boxplots.
Markers reward correct fences and outlier identification, a description tying the long right whisker to right skew, and a correct limitation (modality or clusters).
Related dot points
- Topic 1.7 Summary Statistics for a Quantitative Variable: calculate and interpret measures of center (mean, median) and spread (range, IQR, standard deviation, variance), and judge their resistance to outliers.
A focused answer to AP Statistics Topic 1.7, defining and computing the mean, median, range, IQR, variance, and standard deviation, explaining resistance to outliers, with full worked calculations.
- Topic 1.6 Describing the Distribution of a Quantitative Variable: describe a quantitative distribution by its shape, center, spread, and unusual features (outliers, gaps, clusters) in context.
A focused answer to AP Statistics Topic 1.6, the SOCS framework for describing a quantitative distribution by shape, outliers, center, and spread, with the vocabulary of skew, modality, and clusters, and worked descriptions.
- Topic 1.9 Comparing Distributions of a Quantitative Variable: compare two or more distributions of a quantitative variable by shape, center, spread, and unusual features, in context, using comparative language.
A focused answer to AP Statistics Topic 1.9, on comparing two or more distributions by shape, center, spread, and unusual features using explicit comparative language, with a worked side-by-side comparison.
- Topic 1.5 Representing a Quantitative Variable with Graphs: construct and interpret dotplots, stem-and-leaf plots, and histograms for a quantitative variable, and choose an appropriate display.
A focused answer to AP Statistics Topic 1.5, on displaying a quantitative variable with dotplots, stem-and-leaf plots, and histograms, choosing bin widths, and reading the displays, with a worked histogram construction.
- Topic 1.10 The Normal Distribution: use z-scores, the empirical (68-95-99.7) rule, and the standard normal model to find proportions and percentiles for approximately normal data.
A focused answer to AP Statistics Topic 1.10, on the normal model, standardizing with z-scores, the 68-95-99.7 empirical rule, and finding proportions and percentiles, with full worked z-score and normal-area calculations.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)