Skip to main content
United StatesStatisticsSyllabus dot point

Why do different random samples from the same population give different statistics?

Topic 5.1 Introducing Statistics: Why Is My Sample Not Like Yours? Recognize sampling variability, the difference between a parameter and a statistic, and that a statistic varies from sample to sample in a predictable way.

A focused answer to AP Statistics Topic 5.1, on sampling variability, the parameter-versus-statistic distinction, and why a statistic varies predictably from sample to sample, motivating the idea of a sampling distribution.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Parameter versus statistic
  3. Sampling variability
  4. Why predictable variability is the key
  5. Connecting to what came before
  6. Try this

What this topic is asking

The College Board (Topic 5.1) wants you to recognize sampling variability, distinguish a parameter from a statistic, and understand that a statistic varies from sample to sample in a predictable way, the idea that motivates the whole unit.

Parameter versus statistic

The notation encodes the distinction: Greek-style or unadorned symbols (μ\mu, pp, σ\sigma) for population parameters, and English letters or hatted symbols (xˉ\bar{x}, p^\hat{p}, ss) for sample statistics. The whole point of inference is that the parameter is fixed but unknown, while the statistic is known but variable, so we use the variable statistic to estimate the fixed parameter, accounting for the variability.

Sampling variability

This reframes "why is my sample not like yours?" The answer is that two honest random samples should differ, because they are different subsets of the population. The achievement of the unit is to show that this variability is lawful: a statistic does not bounce around arbitrarily but follows a describable distribution, so we can say how close to the parameter a typical sample statistic falls.

Why predictable variability is the key

The reason Topic 5.1 opens the unit is that predictable variability is what makes inference possible at all. If a sample statistic varied wildly and unpredictably, a single sample would tell us nothing reliable about the population. But because the sampling distribution has a known center (typically the parameter itself, for an unbiased statistic), a known spread (which shrinks as nn grows), and often a known shape (approximately normal, by the central limit theorem), we can quantify how far a sample statistic is likely to be from the truth. That quantification is exactly what a confidence interval and a significance test do. So the apparently humble observation that samples differ is the seed of everything that follows: every interval and every test in Units 6 through 9 is, at heart, a statement about where a statistic falls in its sampling distribution. Understanding that a statistic is itself a random variable with a distribution, rather than a single fixed number, is the conceptual leap Topic 5.1 asks you to make.

Connecting to what came before

Two earlier threads converge here. From Unit 3, random sampling is what makes the sampling distribution well-behaved: only when samples are drawn randomly is the statistic's variation governed by known probability, so the whole unit assumes random selection. From Unit 4, a statistic computed from a random sample is a random variable (its value depends on the chance of which sample is drawn), so the mean, standard deviation, and shape ideas of Topic 4.7 to 4.9 apply directly to it. The sampling distribution of xˉ\bar{x} or p^\hat{p} is just the probability distribution of that random variable. Seeing Unit 5 as "apply the random-variable machinery of Unit 4 to a statistic computed from a random sample of Unit 3" ties the course together and explains why the formulas in the coming topics, the mean and standard deviation of p^\hat{p} and xˉ\bar{x}, look like the combining rules you already know.

Try this

Q1. Distinguish a parameter from a statistic, with an example of each. [2 points]

  • Cue. A parameter describes a population and is fixed (for example the true proportion pp); a statistic is computed from a sample and varies (for example the sample proportion p^\hat{p}).

Q2. Two random samples from the same population give different proportions. Is this a problem? Explain. [1 point]

  • Cue. No; it is sampling variability, the expected result of different samples containing different individuals, and it follows a predictable sampling distribution.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). A poll of 500500 voters finds 52%52\% support a candidate. The 52%52\% is best described as a (A) parameter (B) statistic (C) population proportion (D) census result
Show worked answer →

The correct answer is (B).

The 52%52\% comes from a sample of 500500 voters, so it is a statistic (a number computed from sample data). A parameter would describe the whole population.

(A) and (C) describe the fixed population value, which is unknown here. (D) a census measures everyone, not a sample of 500500. A value from a sample is a statistic.

AP 2021 (style)4 marksSection II (free response). Two students each take a separate random sample of 4040 students from the same school and compute the sample mean study time. Student A gets 5.25.2 hours; student B gets 4.84.8 hours. (a) Explain why the two means differ even though they sampled the same school. (b) Identify which quantities are parameters and which are statistics. (c) Explain what taking many such samples would reveal, justifying in context.
Show worked answer →

A 4-point question on sampling variability.

(a) (1 point) Each random sample contains different students, so the sample mean varies from sample to sample; this is sampling variability, expected even with good random sampling.
(b) (1 point) The true mean study time for all students at the school is a parameter (fixed, unknown); each sample mean (5.25.2 and 4.84.8 hours) is a statistic computed from a sample.
(c) (2 points) Taking many random samples and recording each mean (1 point) would build the sampling distribution of the sample mean, showing how the statistic varies and centering near the true parameter (1 point, in context).

Markers reward attributing the difference to sampling variability, correctly labelling the parameter and statistics, and recognizing that repeated sampling reveals the sampling distribution.

Related dot points

Sources & how we know this