How do you choose the correct inference procedure for a categorical-data scenario?
Topic 8.7 Skills Focus: Selecting an Appropriate Inference Procedure for Categorical Data: choose among the one-proportion, two-proportion, and chi-square (goodness of fit, homogeneity, independence) procedures based on the scenario.
A focused answer to AP Statistics Topic 8.7, on choosing among one-proportion, two-proportion, and chi-square (goodness of fit, homogeneity, independence) procedures for categorical data, based on the number of variables, categories, and samples, with a worked decision.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 8.7) is a skills focus: given a categorical-data scenario, choose the correct procedure among the one-proportion z, two-proportion z, and the three chi-square tests (goodness of fit, homogeneity, independence), and justify the choice. It synthesizes the categorical inference of Units 6 and 8.
The categorical decision tree
These questions partition every categorical scenario. The trickiest boundaries are: (a) two-proportion z versus homogeneity, both compare groups, but two-proportion z is for a two-category variable across two groups, while homogeneity handles two or more categories and two or more groups (a homogeneity test and a two-proportion two-sided z-test actually agree); and (b) homogeneity versus independence, the same table arithmetic, distinguished by several samples (homogeneity) versus one sample, two variables (independence).
Why goodness of fit is different
Goodness of fit stands apart: it tests one categorical variable's distribution against a claimed distribution (equal, a ratio, or stated percentages), using a one-way list of counts, not a two-way table. The cue is a sentence like "the company claims colors appear in a ratio", a single variable measured against an external claim. There are no groups to compare and no second variable; that is what separates it from homogeneity and independence.
Implement and communicate
Implementation details follow the choice: a proportion test uses and the large-counts condition with counts of ; a chi-square test uses , expected counts, the condition, and degrees of freedom ( for goodness of fit, for a table). The conclusion wording must match: "the proportion differs," "the distributions differ across groups" (homogeneity), or "the variables are associated" (independence). As always, association is not causation, and statistical significance is not practical importance.
Try this
Q1. Separate random samples from four hospitals are classified by infection outcome (yes/no). Which procedure? [1 point]
- Cue. Chi-square test of homogeneity (several separate samples, one variable across groups).
Q2. What single cue separates goodness of fit from the two-way chi-square tests? [1 point]
- Cue. Goodness of fit tests one variable against a claimed distribution (a one-way list of counts), with no second variable and no separate groups.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2019 (style)1 marksSection I (multiple choice). A single random sample is classified by two categorical variables, each with three levels, to see if they are related. The correct procedure is (A) one-proportion z-test (B) chi-square goodness of fit (C) chi-square test of independence (D) two-proportion z-testShow worked answer →
The correct answer is (C).
One sample with two categorical variables (each multi-level) calls for a chi-square test of independence (are the variables associated?).
(A) and (D) are for one or two proportions (two-category situations). (B) is for one variable against a claimed distribution, not two variables.
AP 2021 (style)4 marksSection II (free response). For each scenario, name the appropriate categorical inference procedure and justify the choice. (a) Testing whether a single coin's heads proportion differs from . (b) Comparing the proportion of two independent groups that prefer a product. (c) Testing whether one die's six face counts match a uniform distribution. (d) Taking one sample of people and testing whether handedness and eye dominance are related.Show worked answer →
A 4-point procedure-selection question.
(a) (1 point) One-proportion z-test: one categorical variable with two outcomes, testing .
(b) (1 point) Two-proportion z-test (or interval): two independent groups, comparing a proportion.
(c) (1 point) Chi-square goodness-of-fit test: one variable, six categories, against a claimed (uniform) distribution.
(d) (1 point) Chi-square test of independence: one sample, two categorical variables, testing association.
Markers reward distinguishing two-category proportion procedures from multi-category chi-square ones, and goodness of fit (one variable vs a distribution) from independence (two variables, one sample).
Related dot points
- Topic 8.6 Carrying Out a Chi-Square Test for Homogeneity or Independence: compute the chi-square statistic from a two-way table, find the P-value using (rows minus 1)(columns minus 1) degrees of freedom, and state a conclusion in context.
A focused answer to AP Statistics Topic 8.6, on computing the chi-square statistic from a two-way table, finding the P-value with (r minus 1)(c minus 1) degrees of freedom, and stating a conclusion in context, with a full worked test.
- Topic 8.3 Carrying Out a Chi-Square Test for Goodness of Fit: compute the chi-square statistic from observed and expected counts, find the P-value using k minus 1 degrees of freedom, and state a conclusion in context.
A focused answer to AP Statistics Topic 8.3, on computing the chi-square statistic from observed and expected counts, finding the P-value with k minus 1 degrees of freedom, and stating a conclusion in context, with a full worked test.
- Topic 8.5 Setting Up a Chi-Square Test for Homogeneity or Independence: distinguish a test of homogeneity from a test of independence based on the design, state the appropriate hypotheses, and check the conditions.
A focused answer to AP Statistics Topic 8.5, on distinguishing a chi-square test of homogeneity (several groups, same variable) from a test of independence (one sample, two variables), stating the right hypotheses, and checking the conditions.
- Topic 6.11 Carrying Out a Test for the Difference of Two Population Proportions: compute the two-sample z test statistic using the pooled standard error, find the P-value, and state a conclusion in context.
A focused answer to AP Statistics Topic 6.11, on computing the two-sample z statistic with the pooled standard error, finding the P-value, and stating a conclusion in context, with a full worked two-proportion test.
- Topic 7.10 Skills Focus: Selecting, Implementing, and Communicating Inference Procedures: identify the appropriate confidence interval or significance test for a scenario (proportion or mean, one or two samples, paired or independent), and carry it out and communicate the result correctly.
A focused answer to AP Statistics Topic 7.10, on choosing the correct inference procedure (proportion vs mean, one vs two samples, paired vs independent, interval vs test) for a scenario and implementing and communicating it correctly, with a worked decision and procedure.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)