Skip to main content
United StatesStatisticsSyllabus dot point

What are the center, spread, and shape of the sampling distribution of a sample proportion?

Topic 5.5 Sampling Distributions for Sample Proportions: describe the mean, standard deviation, and shape of the sampling distribution of a sample proportion, and check the conditions (10% and large counts) for the normal model.

A focused answer to AP Statistics Topic 5.5, on the mean, standard deviation, and approximately normal shape of the sampling distribution of a sample proportion, the 10% and large-counts conditions, and finding probabilities, with full worked calculations.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Center and spread of the sample proportion
  3. The two conditions
  4. Why the shape becomes normal
  5. Using the distribution
  6. Try this

What this topic is asking

The College Board (Topic 5.5) wants you to describe the mean, standard deviation, and shape of the sampling distribution of a sample proportion p^\hat{p}, and to check the conditions (the 10%10\% condition and the large-counts condition) that allow the normal model.

Center and spread of the sample proportion

The mean result, μp^=p\mu_{\hat{p}} = p, says p^\hat{p} is unbiased: its sampling distribution centers on the true proportion. The standard deviation formula comes straight from the binomial: the count of successes has standard deviation np(1p)\sqrt{np(1-p)}, and dividing the count by nn to get a proportion divides the standard deviation by nn, giving np(1p)n=p(1p)n\dfrac{\sqrt{np(1-p)}}{n} = \sqrt{\dfrac{p(1-p)}{n}}. The key feature is the n\sqrt{n} in the denominator: larger samples give a smaller spread, so p^\hat{p} clusters more tightly around pp.

The two conditions

The two conditions do different jobs. The 10%10\% condition protects the standard deviation formula (independence). The large-counts condition protects the shape (normality). A full answer checks both, because using the normal model without large counts, or the standard deviation formula on a too-large fraction of the population, is unjustified.

Why the shape becomes normal

The large-counts condition is the proportion's version of the central limit theorem. A sample proportion is really a sample mean of 00s and 11s (failure/success), so the same averaging logic applies: with enough trials, the distribution of p^\hat{p} smooths into a bell shape. When pp is near 00 or 11, the underlying binomial is skewed, so you need a larger nn before the normal approximation is good, which is exactly what np10np \ge 10 and n(1p)10n(1-p) \ge 10 enforce (they demand enough expected successes and failures). When both counts are at least about 1010, the skew has washed out and the normal model is accurate. This is why the condition is symmetric in successes and failures: a rare event (pp small) needs a bigger sample to accumulate 1010 expected successes, and a near-certain event (pp large) needs a bigger sample to accumulate 1010 expected failures. Recognizing that the proportion is a mean in disguise links Topic 5.5 to the CLT and explains why the same "large enough sample" theme recurs.

Using the distribution

Once the conditions confirm that p^\hat{p} is approximately normal with mean pp and standard deviation p(1p)n\sqrt{\dfrac{p(1-p)}{n}}, every probability question about p^\hat{p} is a normal-model calculation: standardize with z=p^pp(1p)/nz = \dfrac{\hat{p} - p}{\sqrt{p(1-p)/n}} and read the area from the standard normal. This is the direct foundation of inference for proportions in Unit 6: a confidence interval for pp uses this same standard deviation (estimated as a standard error), and a significance test computes exactly this z-score under an assumed pp. So Topic 5.5 is not an isolated calculation but the template for the proportion procedures that follow. A complete exam answer states the center and spread, checks both conditions, standardizes, finds the area, and interprets the result in context (for example, "there is about a 6.7%6.7\% chance a sample of this size gives a proportion above 0.460.46"), the same structured workflow used throughout the inference units.

Try this

Q1. For n=400n = 400 and p=0.5p = 0.5, find the mean and standard deviation of p^\hat{p}. [2 points]

  • Cue. μp^=0.5\mu_{\hat{p}} = 0.5; σp^=0.5×0.5400=0.000625=0.025\sigma_{\hat{p}} = \sqrt{\dfrac{0.5 \times 0.5}{400}} = \sqrt{0.000625} = 0.025.

Q2. State the large-counts condition and what it guarantees. [1 point]

  • Cue. np10np \ge 10 and n(1p)10n(1-p) \ge 10; it guarantees the sampling distribution of p^\hat{p} is approximately normal.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). For samples of size n=100n = 100 from a population with p=0.3p = 0.3, the standard deviation of the sampling distribution of p^\hat{p} is (A) 0.30.3 (B) 0.0210.021 (C) 0.0460.046 (D) 0.210.21
Show worked answer →

The correct answer is (C).

The standard deviation of p^\hat{p} is p(1p)n=0.3×0.7100=0.00210.046\sqrt{\dfrac{p(1-p)}{n}} = \sqrt{\dfrac{0.3 \times 0.7}{100}} = \sqrt{0.0021} \approx 0.046.

(A) is pp, not the standard deviation. (D) is p(1p)p(1-p), not divided by nn or square-rooted. (B) misplaces a factor. The correct formula gives about 0.0460.046.

AP 2022 (style)4 marksSection II (free response). In a population, 40%40\% of adults exercise daily. A random sample of n=150n = 150 adults is taken; let p^\hat{p} be the sample proportion who exercise daily. (a) Find the mean and standard deviation of the sampling distribution of p^\hat{p}. (b) Verify the conditions for the normal model. (c) Find the probability that p^\hat{p} exceeds 0.460.46, and interpret in context.
Show worked answer →

A 4-point question on the sampling distribution of a proportion.

(a) (2 points) μp^=p=0.40\mu_{\hat{p}} = p = 0.40 (1 point); σp^=p(1p)n=0.4×0.6150=0.0016=0.04\sigma_{\hat{p}} = \sqrt{\dfrac{p(1-p)}{n}} = \sqrt{\dfrac{0.4 \times 0.6}{150}} = \sqrt{0.0016} = 0.04 (1 point).
(b) (1 point) Large counts: np=150(0.4)=6010np = 150(0.4) = 60 \ge 10 and n(1p)=150(0.6)=9010n(1-p) = 150(0.6) = 90 \ge 10; assuming the sample is under 10%10\% of the adult population, both conditions hold, so p^\hat{p} is approximately normal.
(c) (1 point) z=0.460.400.04=1.5z = \dfrac{0.46 - 0.40}{0.04} = 1.5, so P(p^>0.46)=P(Z>1.5)0.0668P(\hat{p} > 0.46) = P(Z > 1.5) \approx 0.0668; about a 6.7%6.7\% chance the sample proportion exceeds 0.460.46.

Markers reward the correct mean and standard deviation, verifying the large-counts (and 10%) conditions, and the probability with a contextual interpretation.

Related dot points

Sources & how we know this