Why does the approximately normal sampling distribution of a sample proportion make inference about a population proportion possible?
Topic 6.1 Introducing Statistics: Why Be Normal?: explain how the approximately normal sampling distribution of a sample proportion lets us quantify uncertainty and make inferences about an unknown population proportion.
A focused answer to AP Statistics Topic 6.1, on why the approximately normal sampling distribution of a sample proportion is the engine that lets us build confidence intervals and significance tests about an unknown population proportion.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 6.1) opens Unit 6 with the idea behind inference: because the sampling distribution of a sample proportion is approximately normal under known conditions, a single sample lets us make a quantified statement about the unknown population proportion . This topic is conceptual; it sets up the confidence intervals and significance tests that fill the rest of the unit.
From one sample to a statement about the population
You never see directly. You see one , which would change if you took a different sample. The breakthrough idea, built across Unit 5, is that this variation is not chaotic: the collection of all possible values forms a sampling distribution with a predictable center, spread, and shape. Topic 6.1 turns that fact into a tool. Because the sampling distribution is centered at , your single is a sensible estimate; because its spread is known, you can say how far off it is likely to be; because its shape is approximately normal, you can convert that into exact percentages.
Why normality is the key
Without an approximately normal sampling distribution, a single would be just a number with no attached uncertainty. Normality supplies the ruler: the -- pattern and z-scores translate "how many standard deviations is from a value of " into a probability. That is why every proportion procedure in Unit 6 begins by checking the large-counts and conditions; they are exactly the conditions that earn the normal model.
The two questions inference answers
Inference about takes two complementary forms, both powered by the normal sampling distribution.
- Estimation (confidence intervals). We have no claimed value of ; we want a plausible range. Center an interval at and extend it by a margin of error built from the normal model: .
- Testing (significance tests). Someone claims a specific (for example ). We ask how surprising our would be if that claim were true, using a z-statistic and a P-value.
Both rest on the same picture: is one point on a known bell curve. Recognizing that a single statistic is a draw from a distribution, not the truth itself, is the central reasoning move of the entire inference half of the course, and it is why Topic 6.1 is framed as an idea rather than a calculation.
A caution built into the idea
Inference quantifies uncertainty; it does not remove it. A confidence interval can miss , and a test can reach the wrong conclusion, because genuinely varies. The normal model tells you how often that happens (the confidence level, the error rates of Topic 6.7), which is the honest alternative to pretending one sample reveals the truth exactly. This is the mindset every later topic depends on.
Try this
Q1. State the parameter and the statistic in a study estimating the proportion of voters who support a measure. [1 point]
- Cue. Parameter: , the true proportion of all voters who support it. Statistic: , the sample proportion who support it.
Q2. Why does proportion inference require the large-counts condition? [1 point]
- Cue. It is what makes the sampling distribution of approximately normal, and normality is what lets us attach probabilities (margins of error, P-values) to a single .
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2019 (style)1 marksSection I (multiple choice). A pollster takes one random sample and reports . Which statement best describes why this single value still allows a statement about the unknown population proportion ? (A) always equals (B) comes from a known, approximately normal sampling distribution centered at (C) larger samples remove all uncertainty (D) the population is normalShow worked answer β
The correct answer is (B).
Inference works because the sample proportion is one draw from a sampling distribution that, when conditions hold, is approximately normal with mean and known spread . That known shape lets us quantify how far is likely to fall from .
(A) is false: varies from sample to sample. (C) is false: uncertainty shrinks but never vanishes. (D) confuses the population shape with the sampling distribution of .
AP 2021 (style)3 marksSection II (free response). A researcher will estimate the proportion of a town's adults who recycle from a single random sample of . (a) Explain why the researcher does not need to sample everyone to make a reasonable statement about . (b) Identify the feature of the sampling distribution of that makes this possible, and justify in context why that feature applies here.Show worked answer β
A 3-point conceptual question that previews the whole unit.
(a) (1 point) Because behaves predictably across samples: its sampling distribution is centered at the true , so a single sample proportion is a reasonable estimate and its likely error can be quantified.
(b) (2 points) The key feature is that the sampling distribution of is approximately normal (1 point) with mean and standard deviation . This applies here because the sample is random and, with , the large-counts condition will hold for any plausible , so we can use the normal model to attach a margin of error or a P-value to the estimate (1 point).
Markers reward connecting the single sample to a known, approximately normal sampling distribution and justifying normality in context.
Related dot points
- Topic 6.2 Constructing a Confidence Interval for a Population Proportion: identify the conditions, compute the point estimate, critical value, standard error, and margin of error, and construct and interpret a one-sample z-interval for a proportion.
A focused answer to AP Statistics Topic 6.2, on building a one-sample z-interval for a population proportion - checking conditions, finding the critical value, standard error, and margin of error - with a full worked interval and contextual interpretation.
- Topic 6.4 Setting Up a Test for a Population Proportion: state null and alternative hypotheses about a population proportion, identify the significance level, and verify the conditions for a one-sample z-test.
A focused answer to AP Statistics Topic 6.4, on writing the null and alternative hypotheses for a population proportion, choosing the significance level, and checking the random, large-counts (using the null value), and 10% conditions for a one-sample z-test.
- Topic 5.5 Sampling Distributions for Sample Proportions: describe the mean, standard deviation, and shape of the sampling distribution of a sample proportion, and check the conditions (10% and large counts) for the normal model.
A focused answer to AP Statistics Topic 5.5, on the mean, standard deviation, and approximately normal shape of the sampling distribution of a sample proportion, the 10% and large-counts conditions, and finding probabilities, with full worked calculations.
- Topic 5.3 The Central Limit Theorem: state and apply the central limit theorem, that the sampling distribution of the sample mean becomes approximately normal as the sample size grows, regardless of the population's shape.
A focused answer to AP Statistics Topic 5.3, on the central limit theorem, why the sample mean's distribution becomes normal as n grows regardless of population shape, the large-sample guideline, and its role in inference, with a worked application.
- Topic 5.1 Introducing Statistics: Why Is My Sample Not Like Yours? Recognize sampling variability, the difference between a parameter and a statistic, and that a statistic varies from sample to sample in a predictable way.
A focused answer to AP Statistics Topic 5.1, on sampling variability, the parameter-versus-statistic distinction, and why a statistic varies predictably from sample to sample, motivating the idea of a sampling distribution.
Sources & how we know this
- AP Statistics Course and Exam Description β College Board (2020)