How do you state the hypotheses and decide between a chi-square test of homogeneity and one of independence?
Topic 8.5 Setting Up a Chi-Square Test for Homogeneity or Independence: distinguish a test of homogeneity from a test of independence based on the design, state the appropriate hypotheses, and check the conditions.
A focused answer to AP Statistics Topic 8.5, on distinguishing a chi-square test of homogeneity (several groups, same variable) from a test of independence (one sample, two variables), stating the right hypotheses, and checking the conditions.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 8.5) wants you to distinguish a chi-square test of homogeneity from one of independence based on the design, state the appropriate hypotheses for each, and check the conditions.
The design decides the test
This is the central decision of the topic. The chi-square arithmetic is identical for the two tests (same expected counts, same statistic, same degrees of freedom), but the design, hypotheses, and conclusion wording differ. Homogeneity compares separate groups ("do these populations have the same distribution?"); independence examines one population ("are these two traits associated?"). Identify the sampling design before writing anything.
Hypotheses for each test
The wording must match the design. For homogeneity, name the variable and the groups ("the distribution of sport participation is the same across the three schools"). For independence, name the two variables ("education level and employment status are independent"). Hypotheses are stated in words about the categorical relationship, not as numeric parameter equalities, which is a notable contrast with the mean and proportion tests.
Checking the conditions
The condition check uses the expected counts (each ), exactly as in goodness of fit and as computed in Topic 8.4. The randomness requirement is read according to the design: homogeneity needs each group to be a random sample (or randomly assigned in an experiment); independence needs the single sample to be random. Verifying these is the gate to the test in Topic 8.6.
Try this
Q1. One sample of voters is classified by gender and party preference. Which test? [1 point]
- Cue. Independence (one sample, two categorical variables); : gender and party preference are independent.
Q2. State the homogeneity null when comparing a response across four separate groups. [1 point]
- Cue. : the distribution of the response variable is the same across all four groups.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2019 (style)1 marksSection I (multiple choice). A researcher takes one random sample and records two categorical variables on each person. The appropriate chi-square test is (A) goodness of fit (B) homogeneity (C) independence (D) a two-proportion z-testShow worked answer →
The correct answer is (C).
One sample with two categorical variables recorded on each unit calls for a chi-square test of independence (are the two variables associated?).
(A) is for one variable against a claimed distribution. (B) is for comparing one variable across several separate samples/groups. (D) compares only two proportions, not a full table.
AP 2021 (style)3 marksSection II (free response). Design 1: separate random samples of students from each of three schools are classified by whether they participate in sport (yes/no). Design 2: one random sample of adults is classified by both education level (three levels) and employment status (two levels). (a) Name the appropriate chi-square test for each design. (b) State the hypotheses for each. (c) Explain the design difference that determines the choice.Show worked answer →
A 3-point design-and-setup question.
(a) (1 point) Design 1: chi-square test of homogeneity (several separate samples, one variable). Design 2: chi-square test of independence (one sample, two variables).
(b) (1 point) Homogeneity: : the distribution of sport participation is the same across the three schools; : it is not the same for all schools. Independence: : education level and employment status are independent; : they are associated (not independent).
(c) (1 point) Homogeneity uses multiple separate samples with a fixed size from each group, comparing one variable's distribution; independence uses one sample and measures two variables on each unit.
Markers reward matching design to test, the distinct hypothesis wordings, and the multiple-samples versus one-sample distinction.
Related dot points
- Topic 8.6 Carrying Out a Chi-Square Test for Homogeneity or Independence: compute the chi-square statistic from a two-way table, find the P-value using (rows minus 1)(columns minus 1) degrees of freedom, and state a conclusion in context.
A focused answer to AP Statistics Topic 8.6, on computing the chi-square statistic from a two-way table, finding the P-value with (r minus 1)(c minus 1) degrees of freedom, and stating a conclusion in context, with a full worked test.
- Topic 8.4 Expected Counts in Two-Way Tables: compute the expected count for each cell of a two-way table under the null hypothesis using the row total times column total divided by the grand total.
A focused answer to AP Statistics Topic 8.4, on computing expected counts in a two-way table under the null of no association, using row total times column total over the grand total, and why this formula encodes independence.
- Topic 8.2 Setting Up a Chi-Square Goodness of Fit Test: state the hypotheses for a goodness-of-fit test, compute expected counts from a claimed distribution, and verify the conditions.
A focused answer to AP Statistics Topic 8.2, on stating the hypotheses for a goodness-of-fit test, computing expected counts from a claimed distribution, and checking the random, large-counts (expected at least 5), and 10% conditions.
- Topic 2.2 Representing Two Categorical Variables: construct and interpret two-way (contingency) tables and segmented or side-by-side bar graphs for two categorical variables.
A focused answer to AP Statistics Topic 2.2, on building and reading two-way tables and segmented or side-by-side bar graphs for two categorical variables, with marginal totals and a worked table.
- Topic 8.7 Skills Focus: Selecting an Appropriate Inference Procedure for Categorical Data: choose among the one-proportion, two-proportion, and chi-square (goodness of fit, homogeneity, independence) procedures based on the scenario.
A focused answer to AP Statistics Topic 8.7, on choosing among one-proportion, two-proportion, and chi-square (goodness of fit, homogeneity, independence) procedures for categorical data, based on the number of variables, categories, and samples, with a worked decision.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)