United StatesStatisticsSyllabus dot point

How do we display two categorical variables together, and what do two-way tables and segmented bar graphs reveal?

Topic 2.2 Representing Two Categorical Variables: construct and interpret two-way (contingency) tables and segmented or side-by-side bar graphs for two categorical variables.

A focused answer to AP Statistics Topic 2.2, on building and reading two-way tables and segmented or side-by-side bar graphs for two categorical variables, with marginal totals and a worked table.

Generated by Claude Opus 4.89 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
The two-way table
Joint and marginal distributions
Segmented and side-by-side bar graphs
Reading association from the displays
Try this

What this topic is asking

The College Board (Topic 2.2) wants you to display two categorical variables together with a two-way (contingency) table and with segmented or side-by-side bar graphs, and to read the joint and marginal information these displays carry.

The two-way table

A complete table has every cell filled and every margin computed, with the row totals and column totals each summing to the grand total $n$ . Filling in margins is not decoration: the marginal totals are needed to compute the proportions of the next topic, and a missing margin is a common reason a table-construction question loses marks.

Joint and marginal distributions

The table carries two kinds of information at once. The joint distribution is the set of cell counts (or cell proportions out of the grand total): it answers "how many are in both this row and this column?" The marginal distribution is given by the margins: a row total describes one variable on its own, ignoring the other. For example, the column totals tell you the overall split of the column variable across everyone, regardless of the row variable. Distinguishing joint from marginal is foundational, because conditional proportions (the next topic) build on both.

To display two categorical variables graphically, two bar-graph variants are standard. A segmented (stacked) bar graph draws one bar per category of the first variable, divided into segments whose heights show the proportions of the second variable within that category; comparing the segment patterns across bars reveals whether the variables are associated. A side-by-side (clustered) bar graph instead places the second variable's bars next to each other within each group, which can make exact comparisons easier when there are only a few categories. Both displays are usually drawn with relative frequencies (proportions) rather than raw counts when the groups have different sizes, so that the comparison is fair, exactly as in Unit 1. The choice between segmented and side-by-side is largely about readability: segmented bars emphasize parts of a whole, while side-by-side bars emphasize direct category-to-category comparison.

Reading association from the displays

The reason two-way tables and these bar graphs matter is that they reveal association between the two categorical variables. If the proportion preferring coffee is much higher among males than females, the segmented bars for the two sexes look different, and that visible difference is the association. If the variables were unrelated, the breakdown within each group would look the same. So when you read one of these displays, you are really asking "does the distribution of one variable change as I move across the categories of the other?" A strong exam answer names the variables, points to the differing proportions, and states the association in context, while remembering Topic 2.1's caution that an association in observational data is not proof of cause. The numerical version of this comparison, using conditional proportions, is exactly what Topic 2.3 formalises, so a clean two-way table here sets up the next topic directly.

Building a two-way table and reading its margins

A class of $60$ students is surveyed on whether they walk or ride to school, broken down by year group (Year 11, Year 12). Of $35$ Year 11s, $21$ walk; of $25$ Year 12s, $10$ walk. Build the table and find the marginal distribution of travel mode.

step 1 Fill the cells

Year 11: walk $21$ , ride $35 - 21 = 14$ . Year 12: walk $10$ , ride $25 - 10 = 15$ .

step 2 Compute the margins

Row totals: Year 11 $= 35$ , Year 12 $= 25$ . Column totals: walk $= 21 + 10 = 31$ , ride $= 14 + 15 = 29$ . Grand total $= 31 + 29 = 60$ , matching $n$ .

step 3 State the marginal distribution of travel mode

Walk $= 31/60 \approx 0.52$ ; ride $= 29/60 \approx 0.48$ . So across the whole class, about $52\%$ walk and $48\%$ ride, ignoring year group.

step 4 Interpret

The completed table classifies all $60$ students jointly by year and travel mode; its margins show that travel mode is split roughly evenly overall, while the cells (more walkers among Year 11s) hint at an association we would quantify with conditional proportions next.

Try this

Q1. In a two-way table, what do the column totals describe? [1 point]

Cue. The marginal distribution of the column variable: that variable on its own, ignoring the row variable.

Q2. Why are relative frequencies preferred over counts in a segmented bar graph comparing two groups of different sizes? [2 points]

Cue. Proportions put both groups on a per-total scale, so a difference in group size does not distort the visual comparison of the second variable's breakdown.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2017 (style)1 marksSection I (multiple choice). In a two-way table, the totals in the right-hand margin (the row totals) give which distribution? (A) The joint distribution (B) The conditional distribution of the column variable (C) The marginal distribution of the row variable (D) The relative frequencies of each cell

Show worked answer →

The correct answer is (C).

The row totals in the right margin summarize one variable on its own, ignoring the other; this is the marginal distribution of the row variable. The margins of a two-way table give the marginal distributions.

(A) the joint distribution is the cell counts (or cell proportions). (B) a conditional distribution fixes one variable's category and looks within it. (D) cell relative frequencies are joint, not marginal. The margin totals define marginal distributions.

AP 2020 (style)4 marksSection II (free response). A survey of

200

people records sex (male, female) and whether they prefer tea or coffee. Of

90

males,

54

prefer coffee; of

110

females,

44

prefer coffee. (a) Construct the complete two-way table with marginal totals. (b) State the marginal distribution of beverage preference. (c) Describe one advantage of a segmented bar graph for displaying these data.

Show worked answer →

A 4-point question on building and reading a two-way table.

(a) (2 points) Males: coffee $54$ , tea $90 - 54 = 36$ . Females: coffee $44$ , tea $110 - 44 = 66$ . Table cells: male/coffee $54$ , male/tea $36$ , female/coffee $44$ , female/tea $66$ . Column totals: coffee $54 + 44 = 98$ , tea $36 + 66 = 102$ . Grand total $200$ (1 point for cells, 1 point for correct margins).
(b) (1 point) Marginal distribution of beverage: coffee $98/200 = 0.49$ , tea $102/200 = 0.51$ (proportions of the whole sample, ignoring sex).
(c) (1 point) A segmented bar graph shows, within each sex, the proportion choosing each beverage, making it easy to compare the conditional distributions of preference between males and females visually.

Markers reward a correct table with margins, the marginal beverage distribution as proportions, and a sensible advantage of the segmented display.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)