United StatesStatisticsSyllabus dot point

Why do different random samples from the same population give different statistics?

Topic 5.1 Introducing Statistics: Why Is My Sample Not Like Yours? Recognize sampling variability, the difference between a parameter and a statistic, and that a statistic varies from sample to sample in a predictable way.

A focused answer to AP Statistics Topic 5.1, on sampling variability, the parameter-versus-statistic distinction, and why a statistic varies predictably from sample to sample, motivating the idea of a sampling distribution.

Generated by Claude Opus 4.89 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
Parameter versus statistic
Sampling variability
Why predictable variability is the key
Connecting to what came before
Try this

What this topic is asking

The College Board (Topic 5.1) wants you to recognize sampling variability, distinguish a parameter from a statistic, and understand that a statistic varies from sample to sample in a predictable way, the idea that motivates the whole unit.

Parameter versus statistic

The notation encodes the distinction: Greek-style or unadorned symbols ( $\mu$ , $p$ , $\sigma$ ) for population parameters, and English letters or hatted symbols ( $\bar{x}$ , $\hat{p}$ , $s$ ) for sample statistics. The whole point of inference is that the parameter is fixed but unknown, while the statistic is known but variable, so we use the variable statistic to estimate the fixed parameter, accounting for the variability.

Sampling variability

This reframes "why is my sample not like yours?" The answer is that two honest random samples should differ, because they are different subsets of the population. The achievement of the unit is to show that this variability is lawful: a statistic does not bounce around arbitrarily but follows a describable distribution, so we can say how close to the parameter a typical sample statistic falls.

Why predictable variability is the key

The reason Topic 5.1 opens the unit is that predictable variability is what makes inference possible at all. If a sample statistic varied wildly and unpredictably, a single sample would tell us nothing reliable about the population. But because the sampling distribution has a known center (typically the parameter itself, for an unbiased statistic), a known spread (which shrinks as $n$ grows), and often a known shape (approximately normal, by the central limit theorem), we can quantify how far a sample statistic is likely to be from the truth. That quantification is exactly what a confidence interval and a significance test do. So the apparently humble observation that samples differ is the seed of everything that follows: every interval and every test in Units 6 through 9 is, at heart, a statement about where a statistic falls in its sampling distribution. Understanding that a statistic is itself a random variable with a distribution, rather than a single fixed number, is the conceptual leap Topic 5.1 asks you to make.

Connecting to what came before

Two earlier threads converge here. From Unit 3, random sampling is what makes the sampling distribution well-behaved: only when samples are drawn randomly is the statistic's variation governed by known probability, so the whole unit assumes random selection. From Unit 4, a statistic computed from a random sample is a random variable (its value depends on the chance of which sample is drawn), so the mean, standard deviation, and shape ideas of Topic 4.7 to 4.9 apply directly to it. The sampling distribution of $\bar{x}$ or $\hat{p}$ is just the probability distribution of that random variable. Seeing Unit 5 as "apply the random-variable machinery of Unit 4 to a statistic computed from a random sample of Unit 3" ties the course together and explains why the formulas in the coming topics, the mean and standard deviation of $\hat{p}$ and $\bar{x}$ , look like the combining rules you already know.

Identifying parameters, statistics, and variability

A factory's machines fill bottles. The true mean fill is unknown. An inspector takes three separate random samples of $25$ bottles and gets sample means $498$ ml, $502$ ml, and $500$ ml. (a) Identify the parameter and the statistics. (b) Explain why the three means differ. (c) Describe what many such samples would show.

step 1 Parameter and statistics (part a)

The parameter is the true mean fill of all bottles, a fixed unknown value (call it $\mu$ ). The three sample means, $498$ , $502$ , and $500$ ml, are statistics (each a $\bar{x}$ from a sample of $25$ ).

step 2 Why they differ (part b)

Each random sample of $25$ bottles contains different bottles, so each sample mean reflects a slightly different set of fills. The variation among $498$ , $502$ , and $500$ is sampling variability, the expected result of sampling, not a defect in the machines or the sampling.

step 3 What many samples reveal (part c)

If the inspector took hundreds of random samples of $25$ and recorded every $\bar{x}$ , the values would form the sampling distribution of the sample mean: clustered around the true mean $\mu$ , with a predictable spread (smaller than the bottle-to-bottle spread) and an approximately normal shape.

step 4 Interpret

No single sample mean equals $\mu$ exactly, and that is expected. The power of the sampling distribution is that the means cluster predictably around $\mu$ , so one sample mean of, say, $500$ ml is a usable estimate of the unknown true mean, with quantifiable uncertainty.

Try this

Q1. Distinguish a parameter from a statistic, with an example of each. [2 points]

Cue. A parameter describes a population and is fixed (for example the true proportion $p$ ); a statistic is computed from a sample and varies (for example the sample proportion $\hat{p}$ ).

Q2. Two random samples from the same population give different proportions. Is this a problem? Explain. [1 point]

Cue. No; it is sampling variability, the expected result of different samples containing different individuals, and it follows a predictable sampling distribution.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). A poll of

500

voters finds

52\%

support a candidate. The

52\%

is best described as a (A) parameter (B) statistic (C) population proportion (D) census result

Show worked answer →

The correct answer is (B).

The $52\%$ comes from a sample of $500$ voters, so it is a statistic (a number computed from sample data). A parameter would describe the whole population.

(A) and (C) describe the fixed population value, which is unknown here. (D) a census measures everyone, not a sample of $500$ . A value from a sample is a statistic.

AP 2021 (style)4 marksSection II (free response). Two students each take a separate random sample of

40

students from the same school and compute the sample mean study time. Student A gets

5.2

hours; student B gets

4.8

hours. (a) Explain why the two means differ even though they sampled the same school. (b) Identify which quantities are parameters and which are statistics. (c) Explain what taking many such samples would reveal, justifying in context.

Show worked answer →

A 4-point question on sampling variability.

(a) (1 point) Each random sample contains different students, so the sample mean varies from sample to sample; this is sampling variability, expected even with good random sampling.
(b) (1 point) The true mean study time for all students at the school is a parameter (fixed, unknown); each sample mean ( $5.2$ and $4.8$ hours) is a statistic computed from a sample.
(c) (2 points) Taking many random samples and recording each mean (1 point) would build the sampling distribution of the sample mean, showing how the statistic varies and centering near the true parameter (1 point, in context).

Markers reward attributing the difference to sampling variability, correctly labelling the parameter and statistics, and recognizing that repeated sampling reveals the sampling distribution.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)