Skip to main content
United StatesStatisticsSyllabus dot point

Which graphs display a categorical variable, and how do we describe and compare them honestly?

Topic 1.4 Representing a Categorical Variable with Graphs: choose, construct, and interpret bar graphs and other displays of a single categorical variable, and describe the distribution of categories.

A focused answer to AP Statistics Topic 1.4, on displaying one categorical variable with bar graphs (frequency and relative frequency) and pie charts, reading and describing them, and the pitfalls of misleading scales, with worked examples.

Generated by Claude Opus 4.88 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. The bar graph
  3. Frequency versus relative frequency bars
  4. Pie charts and their limits
  5. Describing a categorical distribution
  6. Misleading displays
  7. Try this

What this topic is asking

The College Board (Topic 1.4) wants you to display a single categorical variable with an appropriate graph, principally a bar graph (in frequency or relative frequency form), and sometimes a pie chart, and to describe the distribution and spot misleading displays.

The bar graph

The gap is the visual signal that distinguishes a bar graph from a histogram. In a histogram (for quantitative data) the bars touch, because they cover adjacent intervals of a number line. In a bar graph the bars are separated, because "apple" and "banana" are not adjacent points on any scale; their left-to-right order is arbitrary and you may sort them however is clearest, often from tallest to shortest.

Frequency versus relative frequency bars

A frequency bar graph uses counts on the vertical axis; a relative frequency bar graph uses proportions or percentages. The shape is identical; only the axis labels change. Relative frequency is preferred when comparing two groups of different sizes, for exactly the reason it matters in tables: it puts both groups on a per-total scale so that a difference in sample size does not distort the picture.

Pie charts and their limits

A pie chart shows each category as a slice proportional to its relative frequency, so the whole circle represents 100%100\%. Pie charts read well when there are few categories and you want to emphasize parts of a whole, but they make it hard to compare slices of similar size, and they cannot show counts directly. For most exam purposes a bar graph is the safer, more readable choice, and you should be ready to say so if asked which display is better.

Describing a categorical distribution

Describing a categorical display is not the same as describing a quantitative one. There is no center, spread, or shape in the quantitative sense, because the categories have no numerical order. Instead, a good description names the modal (most common) category, the least common category, and gives a sense of how the relative frequencies compare, all in context. For instance: "Type O is the most common blood type in this sample at 45%45\%, with A close behind at 40%40\%; B and AB are much rarer, at 11%11\% and 4%4\%." Resist the temptation to talk about "skew" or "outliers," which are quantitative ideas; the bars could be reordered without changing the data, so those words do not apply. The exam specifically rewards descriptions tied to the context and the actual proportions, rather than generic phrases.

Misleading displays

A favorite exam target is the truncated axis. A bar graph whose vertical axis starts at, say, 0.300.30 instead of 00 makes the bars' visible heights non-proportional to their values, so a small real difference looks dramatic. Other manipulations include using pictures of different widths (so area, not height, conveys size) or omitting some categories. When you are asked to critique a display, the cleanest answer names the specific feature (for example "the axis does not start at zero") and explains the effect ("this exaggerates the difference between categories"). Being able to both build an honest graph and diagnose a dishonest one is exactly the dual skill Topic 1.4 trains, and it recurs whenever the exam shows you a graphic and asks whether it is appropriate or fair.

Try this

Q1. State one visual feature that distinguishes a bar graph from a histogram. [1 point]

  • Cue. Bar-graph bars are separated by gaps (categorical); histogram bars touch (adjacent quantitative intervals).

Q2. A bar graph's vertical axis starts at 4040 rather than 00. Explain the effect. [2 points]

  • Cue. The visible bar heights are no longer proportional to their values, so differences between categories are exaggerated, misleading the reader.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). Which display is most appropriate for showing the distribution of a single categorical variable such as preferred phone brand? (A) A histogram (B) A bar graph (C) A boxplot (D) A scatterplot
Show worked answer →

The correct answer is (B).

A bar graph displays the frequency or relative frequency of each category of a categorical variable, with bars separated by gaps because the categories are distinct.

(A) histograms and (C) boxplots are for quantitative variables. (D) a scatterplot displays the relationship between two quantitative variables. Matching the display to the variable type (here categorical) is the skill being tested.

AP 2022 (style)3 marksSection II (free response). A bar graph shows the relative frequency of four blood types in a sample of 500500 donors: O 0.450.45, A 0.400.40, B 0.110.11, AB 0.040.04. (a) Describe the distribution of blood type in this sample. (b) A second graph uses a vertical axis starting at 0.300.30 rather than 00. Explain how this could mislead a reader. (c) State the number of donors with type B.
Show worked answer →

A 3-point question on describing and critiquing categorical displays.

(a) (1 point) Description: type O is the most common (relative frequency 0.450.45), closely followed by A (0.400.40); B (0.110.11) and especially AB (0.040.04) are uncommon. Naming the most and least common categories and noting the rough ordering earns the point.
(b) (1 point) Starting the axis at 0.300.30 exaggerates the visual difference between bars: the gap between O and AB looks far larger than it is, because a truncated axis is not proportional from zero. This can mislead a reader into overstating differences.
(c) (1 point) Number of type B donors: 0.11×500=550.11 \times 500 = 55 donors.

Markers reward a description that identifies the most and least common categories, a correct critique of the truncated axis, and the correct back-calculation from relative frequency to count.

Related dot points

Sources & how we know this