United StatesStatisticsSyllabus dot point

Which graphs display a categorical variable, and how do we describe and compare them honestly?

Topic 1.4 Representing a Categorical Variable with Graphs: choose, construct, and interpret bar graphs and other displays of a single categorical variable, and describe the distribution of categories.

A focused answer to AP Statistics Topic 1.4, on displaying one categorical variable with bar graphs (frequency and relative frequency) and pie charts, reading and describing them, and the pitfalls of misleading scales, with worked examples.

Generated by Claude Opus 4.88 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
The bar graph
Frequency versus relative frequency bars
Pie charts and their limits
Describing a categorical distribution
Misleading displays
Try this

What this topic is asking

The College Board (Topic 1.4) wants you to display a single categorical variable with an appropriate graph, principally a bar graph (in frequency or relative frequency form), and sometimes a pie chart, and to describe the distribution and spot misleading displays.

The bar graph

The gap is the visual signal that distinguishes a bar graph from a histogram. In a histogram (for quantitative data) the bars touch, because they cover adjacent intervals of a number line. In a bar graph the bars are separated, because "apple" and "banana" are not adjacent points on any scale; their left-to-right order is arbitrary and you may sort them however is clearest, often from tallest to shortest.

Frequency versus relative frequency bars

A frequency bar graph uses counts on the vertical axis; a relative frequency bar graph uses proportions or percentages. The shape is identical; only the axis labels change. Relative frequency is preferred when comparing two groups of different sizes, for exactly the reason it matters in tables: it puts both groups on a per-total scale so that a difference in sample size does not distort the picture.

Pie charts and their limits

A pie chart shows each category as a slice proportional to its relative frequency, so the whole circle represents $100\%$ . Pie charts read well when there are few categories and you want to emphasize parts of a whole, but they make it hard to compare slices of similar size, and they cannot show counts directly. For most exam purposes a bar graph is the safer, more readable choice, and you should be ready to say so if asked which display is better.

Describing a categorical distribution

Describing a categorical display is not the same as describing a quantitative one. There is no center, spread, or shape in the quantitative sense, because the categories have no numerical order. Instead, a good description names the modal (most common) category, the least common category, and gives a sense of how the relative frequencies compare, all in context. For instance: "Type O is the most common blood type in this sample at $45\%$ , with A close behind at $40\%$ ; B and AB are much rarer, at $11\%$ and $4\%$ ." Resist the temptation to talk about "skew" or "outliers," which are quantitative ideas; the bars could be reordered without changing the data, so those words do not apply. The exam specifically rewards descriptions tied to the context and the actual proportions, rather than generic phrases.

Misleading displays

A favorite exam target is the truncated axis. A bar graph whose vertical axis starts at, say, $0.30$ instead of $0$ makes the bars' visible heights non-proportional to their values, so a small real difference looks dramatic. Other manipulations include using pictures of different widths (so area, not height, conveys size) or omitting some categories. When you are asked to critique a display, the cleanest answer names the specific feature (for example "the axis does not start at zero") and explains the effect ("this exaggerates the difference between categories"). Being able to both build an honest graph and diagnose a dishonest one is exactly the dual skill Topic 1.4 trains, and it recurs whenever the exam shows you a graphic and asks whether it is appropriate or fair.

Reading a relative frequency bar graph

A relative frequency bar graph of $300$ commuters shows: car $0.50$ , train $0.25$ , bus $0.15$ , bicycle $0.10$ . Describe the distribution and find the number who cycle.

step 1 Identify the most and least common categories

Car is the most common mode at a relative frequency of $0.50$ ; bicycle is the least common at $0.10$ .

step 2 Describe the spread of categories in context

Half of the $300$ commuters travel by car, a quarter by train, and the remaining quarter is split between bus ( $0.15$ ) and bicycle ( $0.10$ ). Public and active transport together account for the other half.

step 3 Back-calculate a count

Number who cycle: $0.10 \times 300 = 30$ commuters.

step 4 Interpret

Car dominates this sample, with bicycle the rarest mode; $30$ of the $300$ commuters cycle. A bar graph (not a histogram) is appropriate because mode of transport is categorical.

Try this

Q1. State one visual feature that distinguishes a bar graph from a histogram. [1 point]

Cue. Bar-graph bars are separated by gaps (categorical); histogram bars touch (adjacent quantitative intervals).

Q2. A bar graph's vertical axis starts at $40$ rather than $0$ . Explain the effect. [2 points]

Cue. The visible bar heights are no longer proportional to their values, so differences between categories are exaggerated, misleading the reader.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). Which display is most appropriate for showing the distribution of a single categorical variable such as preferred phone brand? (A) A histogram (B) A bar graph (C) A boxplot (D) A scatterplot

Show worked answer →

The correct answer is (B).

A bar graph displays the frequency or relative frequency of each category of a categorical variable, with bars separated by gaps because the categories are distinct.

(A) histograms and (C) boxplots are for quantitative variables. (D) a scatterplot displays the relationship between two quantitative variables. Matching the display to the variable type (here categorical) is the skill being tested.

AP 2022 (style)3 marksSection II (free response). A bar graph shows the relative frequency of four blood types in a sample of

500

donors: O

0.45

, A

0.40

, B

0.11

, AB

0.04

. (a) Describe the distribution of blood type in this sample. (b) A second graph uses a vertical axis starting at

0.30

rather than

0

. Explain how this could mislead a reader. (c) State the number of donors with type B.

Show worked answer →

A 3-point question on describing and critiquing categorical displays.

(a) (1 point) Description: type O is the most common (relative frequency $0.45$ ), closely followed by A ( $0.40$ ); B ( $0.11$ ) and especially AB ( $0.04$ ) are uncommon. Naming the most and least common categories and noting the rough ordering earns the point.
(b) (1 point) Starting the axis at $0.30$ exaggerates the visual difference between bars: the gap between O and AB looks far larger than it is, because a truncated axis is not proportional from zero. This can mislead a reader into overstating differences.
(c) (1 point) Number of type B donors: $0.11 \times 500 = 55$ donors.

Markers reward a description that identifies the most and least common categories, a correct critique of the truncated axis, and the correct back-calculation from relative frequency to count.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)