Skip to main content
United StatesStatisticsSyllabus dot point

How do we classify variables, and why does the type of variable decide everything we do next?

Topic 1.2 The Language of Variation - Variables: classify variables as categorical or quantitative, and quantitative variables as discrete or continuous, and explain why the type determines the appropriate graphs and statistics.

A focused answer to AP Statistics Topic 1.2, classifying variables as categorical or quantitative (and discrete or continuous), with the consequences for which displays and summaries are valid, plus worked classification examples.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Two kinds of variable
  3. Discrete versus continuous
  4. The classification decides the tools
  5. Getting the borderline cases right
  6. Why this language threads through the course
  7. Try this

What this topic is asking

The College Board (Topic 1.2) wants you to classify a variable as categorical or quantitative, to split quantitative variables into discrete and continuous, and to understand that this classification dictates which graphs and summary statistics are valid. The vocabulary here is the foundation for the rest of the unit.

Two kinds of variable

The acid test is whether arithmetic makes sense. You can average heights, so height is quantitative. Averaging blood types ("the mean blood type is AB-ish") is nonsense, so blood type is categorical. Be alert to numbers that are really labels: a postcode or a player's jersey number is written with digits but is categorical, because adding or averaging those numbers is meaningless.

Discrete versus continuous

A count of text messages sent in a day is discrete; the time spent on the phone is continuous. In practice continuous variables are recorded to a finite precision (say, 1.73 m1.73\ \text{m}), but the underlying quantity could in principle take any value in a range, which is what makes it continuous.

The classification decides the tools

The reason this topic comes second in the whole course is that every later choice depends on it:

  • Categorical variables are summarized with frequency and relative frequency (counts and proportions), and displayed with bar graphs and (for two categories together) two-way tables. There is no "mean category."
  • Quantitative variables are summarized with center (mean, median) and spread (range, IQR, standard deviation), and displayed with dotplots, stem-and-leaf plots, histograms, and boxplots.

Choosing a tool that does not match the variable type, for example trying to draw a histogram of brand of phone, is a classic error the exam punishes.

Getting the borderline cases right

Most exam classification questions hinge on a small number of slippery cases, and it pays to rehearse them. Numbers used as labels (postcodes, ID numbers, jersey numbers, area codes) are categorical despite being written with digits, because the numerals only identify a group and arithmetic on them is meaningless. Ordinal scales (a satisfaction rating from 1 to 5, a pain score from 1 to 10, a class rank) carry order but unequal or merely conventional spacing; the AP course treats these as categorical when the focus is on the ordered labels, though they are commonly analyzed as quantitative discrete scores when averaging is intended, and the cleanest exam answer states which interpretation you are using and why. Counts are quantitative and discrete; measurements are quantitative and continuous. A final subtlety is that the same underlying idea can be measured as either type depending on how it is recorded: age can be recorded as a continuous measurement in years, or bucketed into categories ("under 18", "18 to 65", "over 65"), at which point it becomes categorical. Reading carefully how each variable was actually recorded, rather than guessing from its name, is the skill the exam rewards.

Why this language threads through the course

The categorical-versus-quantitative split is not a one-off vocabulary exercise; it reappears at every stage. In Unit 2 you explore relationships, and whether the two variables are both categorical, both quantitative, or one of each determines whether you build a two-way table, a scatterplot, or comparative boxplots. In the inference units, categorical data lead to tests for proportions and chi-square procedures, while quantitative data lead to tests for means and regression inference. So learning to classify confidently now is an investment that pays off repeatedly: a marker can tell instantly whether a student understands the data when they pick the matching display and summary, and choosing the wrong family of tools signals a conceptual gap that costs marks well beyond Unit 1.

Try this

Q1. Classify the variable "favorite social-media platform" and state how you would summarize it. [2 points]

  • Cue. It is categorical, so summarize with counts or relative frequencies (proportions) and display with a bar graph; there is no mean.

Q2. Is "number of goals scored by a team in a match" discrete or continuous? Justify. [1 point]

  • Cue. Discrete, because goals are counted in whole numbers (0,1,2,0, 1, 2, \dots) and cannot take in-between values.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). For a study of cars, which of the following lists classifies the variables correctly? Variables: color, number of doors, fuel economy in miles per gallon, drivetrain (front, rear, all-wheel). (A) All four are quantitative (B) Color and drivetrain are categorical; number of doors and fuel economy are quantitative (C) All four are categorical (D) Only fuel economy is quantitative
Show worked answer →

The correct answer is (B).

Color and drivetrain take labels, not numbers used as quantities, so they are categorical. Number of doors is a count, which is a quantitative (discrete) variable, and fuel economy in miles per gallon is a measured number, a quantitative (continuous) variable.

(A) and (C) ignore the distinction. (D) wrongly treats number of doors as categorical; because it is a meaningful count you can average, it is quantitative. The test is whether arithmetic such as averaging the values is meaningful.

AP 2021 (style)3 marksSection II (free response). A hospital records, for each patient: blood type (A, B, AB, O), age in years, number of prior visits, and pain rating on a scale of 1 to 10. (a) Classify each variable as categorical or quantitative. (b) For the quantitative variables, classify each as discrete or continuous. (c) Justify why pain rating is treated the way you classified it.
Show worked answer →

A 3-point classification question.

(a) (1 point) Blood type is categorical; age, number of prior visits, and pain rating are quantitative (see part c for the nuance on pain rating).
(b) (1 point) Among the quantitative variables: age is continuous (measured on a continuous scale, even if reported as whole years); number of prior visits is discrete (a count); pain rating is discrete (it takes only the whole values 1 to 10).
(c) (1 point) Pain rating is an ordered scale with equally spaced whole-number labels; because the numbers carry order and are treated as a count-like score on which arithmetic such as averaging is conventionally performed, it is treated as quantitative and discrete. Award the point for a coherent justification, noting it is sometimes described as ordinal.

Markers reward correct categorical-versus-quantitative classification, a correct discrete-versus-continuous split, and a reasoned justification for the borderline variable.

Related dot points

Sources & how we know this