Skip to main content
United StatesStatisticsSyllabus dot point

How do you construct a confidence interval for the difference between two population proportions?

Topic 6.8 Confidence Intervals for the Difference of Two Proportions: check the conditions and construct a two-sample z-interval for the difference between two population proportions, using the unpooled standard error.

A focused answer to AP Statistics Topic 6.8, on building a two-sample z-interval for the difference of two population proportions - checking conditions for both samples and using the unpooled standard error - with a full worked interval.

Generated by Claude Opus 4.811 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. The interval and the unpooled standard error
  3. Checking the conditions for two samples
  4. Interpreting and using the interval
  5. Try this

What this topic is asking

The College Board (Topic 6.8) wants you to construct a two-sample z-interval for the difference of two population proportions p1βˆ’p2p_1 - p_2: check the conditions for both samples and use the unpooled standard error (each sample's own p^\hat{p}).

The interval and the unpooled standard error

The point estimate is the observed difference p^1βˆ’p^2\hat{p}_1 - \hat{p}_2. The standard error follows the Topic 5.6 rule: variances of independent quantities add, even for a difference. Each sample contributes p^i(1βˆ’p^i)/ni\hat{p}_i(1-\hat{p}_i)/n_i; you sum these and take the square root. This is unpooled because an interval has no common hypothesized proportion, so each sample is estimated separately. (A two-proportion test, Topic 6.10, pools the proportions; the interval does not. Keeping these apart is essential.)

Checking the conditions for two samples

Everything from the one-sample interval is doubled. There are four large-counts checks (successes and failures in each sample), and an extra independence requirement between samples. The large-counts check uses the observed p^1\hat{p}_1 and p^2\hat{p}_2 (it is an interval, so there is no null value to use).

Interpreting and using the interval

Interpret as usual: "We are C%C\% confident the difference p1βˆ’p2p_1 - p_2 is between [low] and [high]." The decisive feature is whether the interval contains 00. If the interval is entirely positive, p1>p2p_1 > p_2 is supported; entirely negative, p1<p2p_1 < p_2; and if it straddles 00, then 00 is a plausible difference, so the data do not establish that the two proportions differ. This zero-check is the two-sample analogue of the one-sample plausible-value test and previews the two-proportion significance test. The order of subtraction matters for the sign: state clearly which group is group 11 so a positive interval is interpreted correctly.

Try this

Q1. Write the standard error for a two-proportion interval and name why it is "unpooled." [2 points]

  • Cue. SE=p^1(1βˆ’p^1)n1+p^2(1βˆ’p^2)n2SE = \sqrt{\dfrac{\hat{p}_1(1-\hat{p}_1)}{n_1} + \dfrac{\hat{p}_2(1-\hat{p}_2)}{n_2}}; unpooled because each sample uses its own p^i\hat{p}_i (no common proportion is assumed).

Q2. A 95%95\% interval for p1βˆ’p2p_1 - p_2 is (0.04,0.19)(0.04, 0.19). Is there evidence of a difference? [1 point]

  • Cue. Yes; the interval excludes 00, so 00 is not plausible and there is convincing evidence the proportions differ.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). For a two-sample interval for p1βˆ’p2p_1 - p_2, the standard error is (A) p^(1βˆ’p^)(1/n1+1/n2)\sqrt{\hat{p}(1-\hat{p})(1/n_1 + 1/n_2)} (B) p^1(1βˆ’p^1)n1+p^2(1βˆ’p^2)n2\sqrt{\dfrac{\hat{p}_1(1-\hat{p}_1)}{n_1} + \dfrac{\hat{p}_2(1-\hat{p}_2)}{n_2}} (C) p^1(1βˆ’p^1)n1βˆ’p^2(1βˆ’p^2)n2\sqrt{\dfrac{\hat{p}_1(1-\hat{p}_1)}{n_1} - \dfrac{\hat{p}_2(1-\hat{p}_2)}{n_2}} (D) p^1βˆ’p^2n1+n2\dfrac{\hat{p}_1 - \hat{p}_2}{\sqrt{n_1 + n_2}}
Show worked answer β†’

The correct answer is (B).

A confidence interval for p1βˆ’p2p_1 - p_2 uses the unpooled standard error: each sample contributes its own p^i(1βˆ’p^i)/ni\hat{p}_i(1-\hat{p}_i)/n_i, and the two are added (variances add for independent samples), then square-rooted.

(A) is the pooled standard error, used only in a two-proportion test, not an interval. (C) subtracts the variances (wrong). (D) is not a valid standard error.

AP 2022 (style)4 marksSection II (free response). In a random sample of 150150 men, 9090 own a bicycle; in an independent random sample of 200200 women, 9696 own a bicycle. (a) Check the conditions for a two-sample z-interval for p1βˆ’p2p_1 - p_2, the difference in proportions (men minus women). (b) Construct a 95%95\% confidence interval. (c) Justify in context whether there is convincing evidence of a difference in bicycle ownership.
Show worked answer β†’

A 4-point two-proportion interval.

(a) (1 point) Random and independent: two independent random samples. Large counts: men 9090 and 6060, women 9696 and 104104, all β‰₯10\ge 10. 10%10\%: both samples plausibly under 10%10\% of their populations.
(b) (2 points) p^1=90/150=0.60\hat{p}_1 = 90/150 = 0.60, p^2=96/200=0.48\hat{p}_2 = 96/200 = 0.48. Difference =0.12= 0.12. SE=0.60(0.40)150+0.48(0.52)200=0.0016+0.001248=0.002848=0.05337SE = \sqrt{\dfrac{0.60(0.40)}{150} + \dfrac{0.48(0.52)}{200}} = \sqrt{0.0016 + 0.001248} = \sqrt{0.002848} = 0.05337. Interval =0.12Β±1.96(0.05337)=0.12Β±0.1046=(0.015,Β 0.225)= 0.12 \pm 1.96(0.05337) = 0.12 \pm 0.1046 = (0.015,\ 0.225).
(c) (1 point) The entire interval is above 00, so 00 is not a plausible value for p1βˆ’p2p_1 - p_2; there is convincing evidence that the proportion owning a bicycle differs (men higher than women).

Markers reward checking conditions for both samples, the unpooled standard error, the zβˆ—=1.96z^{*} = 1.96 interval, and judging the difference against 00.

Related dot points

Sources & how we know this