What makes a statistic an unbiased estimator, and how do bias and variability differ?
Topic 5.4 Biased and Unbiased Point Estimates: define an unbiased estimator (its sampling distribution centers on the parameter), and distinguish the bias of an estimator from its variability.
A focused answer to AP Statistics Topic 5.4, defining unbiased estimators whose sampling distributions center on the parameter, distinguishing bias from variability, and why both matter when choosing an estimator, with a worked comparison.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 5.4) wants you to define an unbiased estimator, a statistic whose sampling distribution centers on the parameter, and to distinguish the bias of an estimator from its variability.
Unbiased estimators
Unbiasedness is a statement about the center of the sampling distribution, not about any single estimate. No individual sample mean need equal (and usually none does), but if the sample means from all possible samples average to , then is unbiased. This is why the previous topics established that and : those equalities are precisely the statements that and are unbiased estimators.
Bias versus variability
The classic image is a target. Bias is whether your shots cluster around the bullseye (unbiased) or around a point off to the side (biased). Variability is how tightly your shots group, regardless of where they center. You want shots that are both on target (unbiased) and tightly grouped (low variability). The two problems, wrong aim versus inconsistent aim, are fixed by different things, and conflating them is the error this topic guards against.
Why both properties matter
A good estimator needs both low bias and low variability, and recognizing the trade-off between them is the heart of Topic 5.4. An unbiased estimator that is wildly variable can be far from the parameter on any given sample, even though it is right on average, so unbiasedness alone does not guarantee a useful single estimate. A very precise estimator that is biased will be reliably wrong, consistently missing the parameter in the same direction, so low variability alone is no comfort either. The exam often presents two estimators, one unbiased but imprecise, one precise but slightly biased, and asks you to compare them, which requires you to evaluate both properties rather than declaring a winner on one. The practical resolution depends on context and on the size of the bias: a tiny bias with much lower variability can beat a large variability with no bias. Crucially, because bias cannot be reduced by collecting more data (only the variability shrinks with ), an unbiased estimator has a real advantage: you can drive its variability down with a larger sample and trust that it is converging on the right value. This is exactly why and , being unbiased, are the estimators used throughout inference.
Connecting to sampling and inference
Topic 5.4 ties the descriptive idea of bias from Unit 3 to the formal sampling-distribution view. In Unit 3, a biased sampling method systematically missed the truth; here, a biased estimator systematically centers off the parameter, and again, more data does not cure it. The link to inference is direct: confidence intervals and significance tests assume the statistic is an unbiased estimator, so that the interval is centered on a value that, on average, equals the parameter. The standard error, the estimated standard deviation of the sampling distribution, that drives the width of every interval is a measure of the estimator's variability, and the behavior from the previous topics is why larger samples give narrower, more precise intervals around an unbiased center. So understanding bias and variability separately is essential preparation: it tells you that inference works because the estimators are unbiased (right on average) and that precision improves with sample size (variability shrinks), the two facts every later procedure depends on.
Try this
Q1. Define an unbiased estimator in terms of its sampling distribution. [2 points]
- Cue. An estimator whose sampling distribution is centered on the parameter, that is, the mean of the sampling distribution equals the parameter, so it does not systematically over- or under-estimate.
Q2. Does increasing the sample size reduce an estimator's bias? Explain. [1 point]
- Cue. No; a larger sample reduces variability (the spread of the sampling distribution) but not bias, which is a systematic centering problem unaffected by sample size.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2019 (style)1 marksSection I (multiple choice). A statistic is an unbiased estimator of a parameter if (A) it always equals the parameter (B) its sampling distribution is centered at the parameter (C) it has the smallest possible variability (D) it comes from a large sampleShow worked answer →
The correct answer is (B).
An estimator is unbiased if the mean of its sampling distribution equals the parameter, so on average it neither overestimates nor underestimates. It need not equal the parameter on any single sample.
(A) is too strong; no single estimate need equal the parameter. (C) describes low variability, a separate property. (D) sample size affects variability, not bias. Centered at the parameter defines unbiased.
AP 2021 (style)4 marksSection II (free response). Two methods estimate a population proportion. Method A's estimates center exactly on the true value but vary widely; method B's estimates are tightly clustered but center slightly below the true value. (a) Which method is unbiased? Justify. (b) Describe the variability of each. (c) Explain the trade-off and which you might prefer, justifying in context.Show worked answer →
A 4-point question on bias and variability.
(a) (1 point) Method A is unbiased: its estimates center on the true value, so the mean of its sampling distribution equals the parameter.
(b) (1 point) Method A has high variability (estimates spread widely); method B has low variability (estimates tightly clustered).
(c) (2 points) There is a trade-off: A is unbiased but imprecise, B is precise but biased low (1 point); one might prefer A for an honest center, or B if its bias is small and its precision makes estimates reliably close, depending on context (1 point).
Markers reward identifying the unbiased method by its centering, contrasting the variability, and discussing the bias-variability trade-off in context.
Related dot points
- Topic 5.1 Introducing Statistics: Why Is My Sample Not Like Yours? Recognize sampling variability, the difference between a parameter and a statistic, and that a statistic varies from sample to sample in a predictable way.
A focused answer to AP Statistics Topic 5.1, on sampling variability, the parameter-versus-statistic distinction, and why a statistic varies predictably from sample to sample, motivating the idea of a sampling distribution.
- Topic 5.3 The Central Limit Theorem: state and apply the central limit theorem, that the sampling distribution of the sample mean becomes approximately normal as the sample size grows, regardless of the population's shape.
A focused answer to AP Statistics Topic 5.3, on the central limit theorem, why the sample mean's distribution becomes normal as n grows regardless of population shape, the large-sample guideline, and its role in inference, with a worked application.
- Topic 5.5 Sampling Distributions for Sample Proportions: describe the mean, standard deviation, and shape of the sampling distribution of a sample proportion, and check the conditions (10% and large counts) for the normal model.
A focused answer to AP Statistics Topic 5.5, on the mean, standard deviation, and approximately normal shape of the sampling distribution of a sample proportion, the 10% and large-counts conditions, and finding probabilities, with full worked calculations.
- Topic 5.7 Sampling Distributions for Sample Means: describe the mean, standard deviation, and shape of the sampling distribution of a sample mean, using the central limit theorem and the standard deviation formula sigma over root n.
A focused answer to AP Statistics Topic 5.7, on the mean, standard deviation, and shape of the sampling distribution of a sample mean, the sigma-over-root-n formula, the conditions for normality, and finding probabilities, with full worked calculations.
- Topic 3.4 Potential Problems with Sampling: identify undercoverage, voluntary response, convenience, nonresponse, and response bias, explain how each distorts results, and recognize that bias is not reduced by a larger sample.
A focused answer to AP Statistics Topic 3.4, identifying undercoverage, voluntary response, convenience, nonresponse, and response bias, the direction each pushes results, and why bias persists no matter how large the sample.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)