How do we classify variables, and why does the type of variable decide everything we do next?
Topic 1.2 The Language of Variation - Variables: classify variables as categorical or quantitative, and quantitative variables as discrete or continuous, and explain why the type determines the appropriate graphs and statistics.
A focused answer to AP Statistics Topic 1.2, classifying variables as categorical or quantitative (and discrete or continuous), with the consequences for which displays and summaries are valid, plus worked classification examples.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 1.2) wants you to classify a variable as categorical or quantitative, to split quantitative variables into discrete and continuous, and to understand that this classification dictates which graphs and summary statistics are valid. The vocabulary here is the foundation for the rest of the unit.
Two kinds of variable
The acid test is whether arithmetic makes sense. You can average heights, so height is quantitative. Averaging blood types ("the mean blood type is AB-ish") is nonsense, so blood type is categorical. Be alert to numbers that are really labels: a postcode or a player's jersey number is written with digits but is categorical, because adding or averaging those numbers is meaningless.
Discrete versus continuous
A count of text messages sent in a day is discrete; the time spent on the phone is continuous. In practice continuous variables are recorded to a finite precision (say, ), but the underlying quantity could in principle take any value in a range, which is what makes it continuous.
The classification decides the tools
The reason this topic comes second in the whole course is that every later choice depends on it:
- Categorical variables are summarized with frequency and relative frequency (counts and proportions), and displayed with bar graphs and (for two categories together) two-way tables. There is no "mean category."
- Quantitative variables are summarized with center (mean, median) and spread (range, IQR, standard deviation), and displayed with dotplots, stem-and-leaf plots, histograms, and boxplots.
Choosing a tool that does not match the variable type, for example trying to draw a histogram of brand of phone, is a classic error the exam punishes.
Getting the borderline cases right
Most exam classification questions hinge on a small number of slippery cases, and it pays to rehearse them. Numbers used as labels (postcodes, ID numbers, jersey numbers, area codes) are categorical despite being written with digits, because the numerals only identify a group and arithmetic on them is meaningless. Ordinal scales (a satisfaction rating from 1 to 5, a pain score from 1 to 10, a class rank) carry order but unequal or merely conventional spacing; the AP course treats these as categorical when the focus is on the ordered labels, though they are commonly analyzed as quantitative discrete scores when averaging is intended, and the cleanest exam answer states which interpretation you are using and why. Counts are quantitative and discrete; measurements are quantitative and continuous. A final subtlety is that the same underlying idea can be measured as either type depending on how it is recorded: age can be recorded as a continuous measurement in years, or bucketed into categories ("under 18", "18 to 65", "over 65"), at which point it becomes categorical. Reading carefully how each variable was actually recorded, rather than guessing from its name, is the skill the exam rewards.
Why this language threads through the course
The categorical-versus-quantitative split is not a one-off vocabulary exercise; it reappears at every stage. In Unit 2 you explore relationships, and whether the two variables are both categorical, both quantitative, or one of each determines whether you build a two-way table, a scatterplot, or comparative boxplots. In the inference units, categorical data lead to tests for proportions and chi-square procedures, while quantitative data lead to tests for means and regression inference. So learning to classify confidently now is an investment that pays off repeatedly: a marker can tell instantly whether a student understands the data when they pick the matching display and summary, and choosing the wrong family of tools signals a conceptual gap that costs marks well beyond Unit 1.
Try this
Q1. Classify the variable "favorite social-media platform" and state how you would summarize it. [2 points]
- Cue. It is categorical, so summarize with counts or relative frequencies (proportions) and display with a bar graph; there is no mean.
Q2. Is "number of goals scored by a team in a match" discrete or continuous? Justify. [1 point]
- Cue. Discrete, because goals are counted in whole numbers () and cannot take in-between values.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2018 (style)1 marksSection I (multiple choice). For a study of cars, which of the following lists classifies the variables correctly? Variables: color, number of doors, fuel economy in miles per gallon, drivetrain (front, rear, all-wheel). (A) All four are quantitative (B) Color and drivetrain are categorical; number of doors and fuel economy are quantitative (C) All four are categorical (D) Only fuel economy is quantitativeShow worked answer →
The correct answer is (B).
Color and drivetrain take labels, not numbers used as quantities, so they are categorical. Number of doors is a count, which is a quantitative (discrete) variable, and fuel economy in miles per gallon is a measured number, a quantitative (continuous) variable.
(A) and (C) ignore the distinction. (D) wrongly treats number of doors as categorical; because it is a meaningful count you can average, it is quantitative. The test is whether arithmetic such as averaging the values is meaningful.
AP 2021 (style)3 marksSection II (free response). A hospital records, for each patient: blood type (A, B, AB, O), age in years, number of prior visits, and pain rating on a scale of 1 to 10. (a) Classify each variable as categorical or quantitative. (b) For the quantitative variables, classify each as discrete or continuous. (c) Justify why pain rating is treated the way you classified it.Show worked answer →
A 3-point classification question.
(a) (1 point) Blood type is categorical; age, number of prior visits, and pain rating are quantitative (see part c for the nuance on pain rating).
(b) (1 point) Among the quantitative variables: age is continuous (measured on a continuous scale, even if reported as whole years); number of prior visits is discrete (a count); pain rating is discrete (it takes only the whole values 1 to 10).
(c) (1 point) Pain rating is an ordered scale with equally spaced whole-number labels; because the numbers carry order and are treated as a count-like score on which arithmetic such as averaging is conventionally performed, it is treated as quantitative and discrete. Award the point for a coherent justification, noting it is sometimes described as ordinal.
Markers reward correct categorical-versus-quantitative classification, a correct discrete-versus-continuous split, and a reasoned justification for the borderline variable.
Related dot points
- Topic 1.1 Introducing Statistics - What Can We Learn from Data?: identify questions to be answered, based on variation in one-variable data, and recognize what a data set can and cannot tell us.
A focused answer to AP Statistics Topic 1.1, on how variation in data raises statistical questions, what kinds of question data can answer, and the limits of what a single data set reveals, with worked examples.
- Topic 1.3 Representing a Categorical Variable with Tables: build and interpret frequency and relative frequency tables for a single categorical variable, and read proportions and percentages from them.
A focused answer to AP Statistics Topic 1.3, on building frequency and relative frequency tables for one categorical variable, converting between counts, proportions, and percentages, and interpreting them in context, with worked tables.
- Topic 1.4 Representing a Categorical Variable with Graphs: choose, construct, and interpret bar graphs and other displays of a single categorical variable, and describe the distribution of categories.
A focused answer to AP Statistics Topic 1.4, on displaying one categorical variable with bar graphs (frequency and relative frequency) and pie charts, reading and describing them, and the pitfalls of misleading scales, with worked examples.
- Topic 1.5 Representing a Quantitative Variable with Graphs: construct and interpret dotplots, stem-and-leaf plots, and histograms for a quantitative variable, and choose an appropriate display.
A focused answer to AP Statistics Topic 1.5, on displaying a quantitative variable with dotplots, stem-and-leaf plots, and histograms, choosing bin widths, and reading the displays, with a worked histogram construction.
- Topic 1.6 Describing the Distribution of a Quantitative Variable: describe a quantitative distribution by its shape, center, spread, and unusual features (outliers, gaps, clusters) in context.
A focused answer to AP Statistics Topic 1.6, the SOCS framework for describing a quantitative distribution by shape, outliers, center, and spread, with the vocabulary of skew, modality, and clusters, and worked descriptions.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)