Why do different random samples from the same population give different statistics?
Topic 5.1 Introducing Statistics: Why Is My Sample Not Like Yours? Recognize sampling variability, the difference between a parameter and a statistic, and that a statistic varies from sample to sample in a predictable way.
A focused answer to AP Statistics Topic 5.1, on sampling variability, the parameter-versus-statistic distinction, and why a statistic varies predictably from sample to sample, motivating the idea of a sampling distribution.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 5.1) wants you to recognize sampling variability, distinguish a parameter from a statistic, and understand that a statistic varies from sample to sample in a predictable way, the idea that motivates the whole unit.
Parameter versus statistic
The notation encodes the distinction: Greek-style or unadorned symbols (, , ) for population parameters, and English letters or hatted symbols (, , ) for sample statistics. The whole point of inference is that the parameter is fixed but unknown, while the statistic is known but variable, so we use the variable statistic to estimate the fixed parameter, accounting for the variability.
Sampling variability
This reframes "why is my sample not like yours?" The answer is that two honest random samples should differ, because they are different subsets of the population. The achievement of the unit is to show that this variability is lawful: a statistic does not bounce around arbitrarily but follows a describable distribution, so we can say how close to the parameter a typical sample statistic falls.
Why predictable variability is the key
The reason Topic 5.1 opens the unit is that predictable variability is what makes inference possible at all. If a sample statistic varied wildly and unpredictably, a single sample would tell us nothing reliable about the population. But because the sampling distribution has a known center (typically the parameter itself, for an unbiased statistic), a known spread (which shrinks as grows), and often a known shape (approximately normal, by the central limit theorem), we can quantify how far a sample statistic is likely to be from the truth. That quantification is exactly what a confidence interval and a significance test do. So the apparently humble observation that samples differ is the seed of everything that follows: every interval and every test in Units 6 through 9 is, at heart, a statement about where a statistic falls in its sampling distribution. Understanding that a statistic is itself a random variable with a distribution, rather than a single fixed number, is the conceptual leap Topic 5.1 asks you to make.
Connecting to what came before
Two earlier threads converge here. From Unit 3, random sampling is what makes the sampling distribution well-behaved: only when samples are drawn randomly is the statistic's variation governed by known probability, so the whole unit assumes random selection. From Unit 4, a statistic computed from a random sample is a random variable (its value depends on the chance of which sample is drawn), so the mean, standard deviation, and shape ideas of Topic 4.7 to 4.9 apply directly to it. The sampling distribution of or is just the probability distribution of that random variable. Seeing Unit 5 as "apply the random-variable machinery of Unit 4 to a statistic computed from a random sample of Unit 3" ties the course together and explains why the formulas in the coming topics, the mean and standard deviation of and , look like the combining rules you already know.
Try this
Q1. Distinguish a parameter from a statistic, with an example of each. [2 points]
- Cue. A parameter describes a population and is fixed (for example the true proportion ); a statistic is computed from a sample and varies (for example the sample proportion ).
Q2. Two random samples from the same population give different proportions. Is this a problem? Explain. [1 point]
- Cue. No; it is sampling variability, the expected result of different samples containing different individuals, and it follows a predictable sampling distribution.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2019 (style)1 marksSection I (multiple choice). A poll of voters finds support a candidate. The is best described as a (A) parameter (B) statistic (C) population proportion (D) census resultShow worked answer →
The correct answer is (B).
The comes from a sample of voters, so it is a statistic (a number computed from sample data). A parameter would describe the whole population.
(A) and (C) describe the fixed population value, which is unknown here. (D) a census measures everyone, not a sample of . A value from a sample is a statistic.
AP 2021 (style)4 marksSection II (free response). Two students each take a separate random sample of students from the same school and compute the sample mean study time. Student A gets hours; student B gets hours. (a) Explain why the two means differ even though they sampled the same school. (b) Identify which quantities are parameters and which are statistics. (c) Explain what taking many such samples would reveal, justifying in context.Show worked answer →
A 4-point question on sampling variability.
(a) (1 point) Each random sample contains different students, so the sample mean varies from sample to sample; this is sampling variability, expected even with good random sampling.
(b) (1 point) The true mean study time for all students at the school is a parameter (fixed, unknown); each sample mean ( and hours) is a statistic computed from a sample.
(c) (2 points) Taking many random samples and recording each mean (1 point) would build the sampling distribution of the sample mean, showing how the statistic varies and centering near the true parameter (1 point, in context).
Markers reward attributing the difference to sampling variability, correctly labelling the parameter and statistics, and recognizing that repeated sampling reveals the sampling distribution.
Related dot points
- Topic 5.2 The Normal Distribution, Revisited: revisit the normal model and z-scores in the context of distributions of statistics, finding proportions and using the standard normal as the basis for later inference.
A focused answer to AP Statistics Topic 5.2, revisiting the normal model and z-scores for distributions of statistics, finding proportions and percentiles, and setting up the standard normal as the engine of sampling-distribution calculations.
- Topic 5.3 The Central Limit Theorem: state and apply the central limit theorem, that the sampling distribution of the sample mean becomes approximately normal as the sample size grows, regardless of the population's shape.
A focused answer to AP Statistics Topic 5.3, on the central limit theorem, why the sample mean's distribution becomes normal as n grows regardless of population shape, the large-sample guideline, and its role in inference, with a worked application.
- Topic 5.4 Biased and Unbiased Point Estimates: define an unbiased estimator (its sampling distribution centers on the parameter), and distinguish the bias of an estimator from its variability.
A focused answer to AP Statistics Topic 5.4, defining unbiased estimators whose sampling distributions center on the parameter, distinguishing bias from variability, and why both matter when choosing an estimator, with a worked comparison.
- Topic 5.5 Sampling Distributions for Sample Proportions: describe the mean, standard deviation, and shape of the sampling distribution of a sample proportion, and check the conditions (10% and large counts) for the normal model.
A focused answer to AP Statistics Topic 5.5, on the mean, standard deviation, and approximately normal shape of the sampling distribution of a sample proportion, the 10% and large-counts conditions, and finding probabilities, with full worked calculations.
- Topic 5.7 Sampling Distributions for Sample Means: describe the mean, standard deviation, and shape of the sampling distribution of a sample mean, using the central limit theorem and the standard deviation formula sigma over root n.
A focused answer to AP Statistics Topic 5.7, on the mean, standard deviation, and shape of the sampling distribution of a sample mean, the sigma-over-root-n formula, the conditions for normality, and finding probabilities, with full worked calculations.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)