United StatesStatisticsSyllabus dot point

Why does the approximately normal sampling distribution of a sample proportion make inference about a population proportion possible?

Topic 6.1 Introducing Statistics: Why Be Normal?: explain how the approximately normal sampling distribution of a sample proportion lets us quantify uncertainty and make inferences about an unknown population proportion.

A focused answer to AP Statistics Topic 6.1, on why the approximately normal sampling distribution of a sample proportion is the engine that lets us build confidence intervals and significance tests about an unknown population proportion.

Generated by Claude Opus 4.89 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
From one sample to a statement about the population
Why normality is the key
The two questions inference answers
A caution built into the idea
Try this

What this topic is asking

The College Board (Topic 6.1) opens Unit 6 with the idea behind inference: because the sampling distribution of a sample proportion $\hat{p}$ is approximately normal under known conditions, a single sample lets us make a quantified statement about the unknown population proportion $p$ . This topic is conceptual; it sets up the confidence intervals and significance tests that fill the rest of the unit.

From one sample to a statement about the population

You never see $p$ directly. You see one $\hat{p}$ , which would change if you took a different sample. The breakthrough idea, built across Unit 5, is that this variation is not chaotic: the collection of all possible $\hat{p}$ values forms a sampling distribution with a predictable center, spread, and shape. Topic 6.1 turns that fact into a tool. Because the sampling distribution is centered at $p$ , your single $\hat{p}$ is a sensible estimate; because its spread is known, you can say how far off it is likely to be; because its shape is approximately normal, you can convert that into exact percentages.

Why normality is the key

Without an approximately normal sampling distribution, a single $\hat{p}$ would be just a number with no attached uncertainty. Normality supplies the ruler: the $68$ - $95$ - $99.7$ pattern and z-scores translate "how many standard deviations is $\hat{p}$ from a value of $p$ " into a probability. That is why every proportion procedure in Unit 6 begins by checking the large-counts and $10\%$ conditions; they are exactly the conditions that earn the normal model.

The two questions inference answers

Inference about $p$ takes two complementary forms, both powered by the normal sampling distribution.

Estimation (confidence intervals). We have no claimed value of $p$ ; we want a plausible range. Center an interval at $\hat{p}$ and extend it by a margin of error built from the normal model: $\hat{p} \pm z^{*}\sqrt{\hat{p}(1-\hat{p})/n}$ .
Testing (significance tests). Someone claims a specific $p$ (for example $p = 0.5$ ). We ask how surprising our $\hat{p}$ would be if that claim were true, using a z-statistic and a P-value.

Both rest on the same picture: $\hat{p}$ is one point on a known bell curve. Recognizing that a single statistic is a draw from a distribution, not the truth itself, is the central reasoning move of the entire inference half of the course, and it is why Topic 6.1 is framed as an idea rather than a calculation.

A caution built into the idea

Inference quantifies uncertainty; it does not remove it. A confidence interval can miss $p$ , and a test can reach the wrong conclusion, because $\hat{p}$ genuinely varies. The normal model tells you how often that happens (the confidence level, the error rates of Topic 6.7), which is the honest alternative to pretending one sample reveals the truth exactly. This is the mindset every later topic depends on.

Why one sample is enough to say something

A school will estimate the proportion $p$ of its $1500$ students who walk to school, using a single random sample of $n = 120$ . Explain, with reference to the sampling distribution of $\hat{p}$ , why this one sample supports a reasonable statement about $p$ .

step 1 Name the parameter and statistic

The unknown parameter is $p$ , the true proportion of all $1500$ students who walk. The statistic is $\hat{p}$ , the sample proportion from the $120$ students, which will vary from sample to sample.

step 2 Describe the sampling distribution

Across all possible random samples of size $120$ , $\hat{p}$ has mean $\mu_{\hat{p}} = p$ (it is unbiased) and standard deviation $\sqrt{p(1-p)/120}$ . So a single $\hat{p}$ is centered on the truth and has a known, modest spread.

step 3 Justify the normal shape

The sample is random. For any plausible $p$ that is not extreme, $np \ge 10$ and $n(1-p) \ge 10$ . The $10\%$ condition is met because $120$ is $8\%$ of $1500$ , just under the limit, so the standard deviation formula is acceptable. Thus $\hat{p}$ is approximately normal.

step 4 Connect to a statement about $p$

Because $\hat{p}$ is approximately normal and centered at $p$ , we can attach a margin of error: most samples land within about $2\sqrt{p(1-p)/120}$ of $p$ . A confidence interval built around the observed $\hat{p}$ therefore captures $p$ with a known success rate, so one sample, properly analyzed, yields a defensible statement about all $1500$ students.

Try this

Q1. State the parameter and the statistic in a study estimating the proportion of voters who support a measure. [1 point]

Cue. Parameter: $p$ , the true proportion of all voters who support it. Statistic: $\hat{p}$ , the sample proportion who support it.

Q2. Why does proportion inference require the large-counts condition? [1 point]

Cue. It is what makes the sampling distribution of $\hat{p}$ approximately normal, and normality is what lets us attach probabilities (margins of error, P-values) to a single $\hat{p}$ .

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). A pollster takes one random sample and reports

\hat{p} = 0.52

. Which statement best describes why this single value still allows a statement about the unknown population proportion

p

? (A)

\hat{p}

always equals

p

(B)

\hat{p}

comes from a known, approximately normal sampling distribution centered at

p

Show worked answer →

The correct answer is (B).

Inference works because the sample proportion $\hat{p}$ is one draw from a sampling distribution that, when conditions hold, is approximately normal with mean $p$ and known spread $\sqrt{p(1-p)/n}$ . That known shape lets us quantify how far $\hat{p}$ is likely to fall from $p$ .

(A) is false: $\hat{p}$ varies from sample to sample. (C) is false: uncertainty shrinks but never vanishes. (D) confuses the population shape with the sampling distribution of $\hat{p}$ .

AP 2021 (style)3 marksSection II (free response). A researcher will estimate the proportion

p

of a town's adults who recycle from a single random sample of

n = 200

. (a) Explain why the researcher does not need to sample everyone to make a reasonable statement about

p

. (b) Identify the feature of the sampling distribution of

\hat{p}

that makes this possible, and justify in context why that feature applies here.

Show worked answer →

A 3-point conceptual question that previews the whole unit.

(a) (1 point) Because $\hat{p}$ behaves predictably across samples: its sampling distribution is centered at the true $p$ , so a single sample proportion is a reasonable estimate and its likely error can be quantified.
(b) (2 points) The key feature is that the sampling distribution of $\hat{p}$ is approximately normal (1 point) with mean $p$ and standard deviation $\sqrt{p(1-p)/n}$ . This applies here because the sample is random and, with $n = 200$ , the large-counts condition will hold for any plausible $p$ , so we can use the normal model to attach a margin of error or a P-value to the estimate (1 point).

Markers reward connecting the single sample to a known, approximately normal sampling distribution and justifying normality in context.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)