Skip to main content
United StatesStatisticsSyllabus dot point

Why is the slope of a least-squares regression line a statistic with its own sampling distribution, and what does that allow us to infer?

Topic 9.1 Introducing Statistics: Do Those Points Align?: explain why a sample regression slope varies from sample to sample, motivating inference about the true population slope of a linear model.

A focused answer to AP Statistics Topic 9.1, on why a sample regression slope is a statistic that varies across samples, motivating confidence intervals and tests about the true population slope of a linear model.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. The sample slope is a statistic
  3. Why a non-zero slope is not proof
  4. The parameter and the tools, previewed
  5. The mindset for the unit
  6. Try this

What this topic is asking

The College Board (Topic 9.1) opens Unit 9 with the idea behind slope inference: the slope bb of a least-squares line fitted to a sample is a statistic that varies from sample to sample, so it estimates, but rarely equals, the true population slope β\beta. This motivates confidence intervals and tests about β\beta.

The sample slope is a statistic

Units 2 fit least-squares lines descriptively; Unit 9 treats the slope as a quantity with sampling variability. Just as a sample mean xˉ\bar{x} estimates μ\mu and varies across samples, the sample slope bb estimates β\beta and varies across samples. Take a new random sample, refit the line, and you get a slightly different slope. The collection of all possible sample slopes forms a sampling distribution centered (under the model's conditions) at the true slope β\beta.

Why a non-zero slope is not proof

This is the central caution of the unit, and the reason inference is needed. The question "do those points align?" is really "is the observed slope larger than sampling variability alone would typically produce if β=0\beta = 0?" A small, easily-explained-by-chance slope is consistent with no relationship; a slope too large to be chance is evidence of a real linear association. Distinguishing the two requires the sampling distribution of bb, not just its observed value.

The parameter and the tools, previewed

The parameter of interest is the population slope β\beta. Unit 9 builds the two familiar tools on it, mirroring Units 6 and 7.

  1. Confidence interval for β\beta (Topics 9.2 to 9.3): estimate the true slope with b±t(standard error of b)b \pm t^{*}(\text{standard error of } b), and judge claims (including whether 00 is plausible).
  2. Significance test for β\beta (Topics 9.4 to 9.5): test H0:β=0H_0: \beta = 0 (no linear relationship) with a t-statistic and P-value.

Both use a tt-distribution with n2n - 2 degrees of freedom (two are spent estimating the intercept and slope), and both rely on regression conditions about the residuals. Topic 9.1 plants the idea that bb is a variable estimate of a fixed β\beta; the later topics supply the machinery.

The mindset for the unit

As in every inference unit, the key move is to see the observed slope as one draw from a distribution, not the truth. The fitted line summarizes one sample; the population line is fixed but unknown. Inference about β\beta is the disciplined way to ask whether an observed trend is real or could be a fluke of sampling, the precise meaning of "do those points align?"

Try this

Q1. What parameter does the sample slope bb estimate, and why does bb vary? [2 points]

  • Cue. bb estimates the population slope β\beta; it varies because each random sample yields a different fitted line (sampling variability).

Q2. Why is a non-zero sample slope not proof of a relationship? [1 point]

  • Cue. Even if β=0\beta = 0, chance variation across samples can produce a non-zero bb; inference is needed to rule that out.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). Two analysts each fit a least-squares line to a different random sample from the same population and get different slopes. This is best explained by (A) one made an error (B) the population slope changed (C) sampling variability in the sample slope (D) the data are not linear
Show worked answer →

The correct answer is (C).

A sample slope bb is a statistic estimating the true slope β\beta; different samples give different slopes because of sampling variability. The slope has its own sampling distribution centered at β\beta.

(A) needs no error. (B) the population slope β\beta is fixed. (D) the data can be linear and still give different sample slopes from different samples.

AP 2021 (style)3 marksSection II (free response). A biologist fits a least-squares line predicting plant height from rainfall, using one random sample, and obtains a positive sample slope. (a) Explain why a positive sample slope does not by itself prove that rainfall and height are truly related in the population. (b) Identify the parameter the biologist should make inferences about. (c) State, in general terms, how the biologist could decide whether the relationship is real.
Show worked answer →

A 3-point conceptual question.

(a) (1 point) The sample slope bb varies from sample to sample; even if the true slope β\beta were 00 (no relationship), a single random sample could produce a non-zero slope by chance, so a positive bb alone is not proof of a real relationship.
(b) (1 point) The true population slope β\beta of the linear model relating height to rainfall.
(c) (1 point) Build a confidence interval for β\beta or run a significance test of H0:β=0H_0: \beta = 0; if 00 is implausible (outside the interval, or a small P-value), there is evidence of a real linear relationship.

Markers reward recognizing sampling variability in bb, naming β\beta as the parameter, and naming a test or interval as the decision tool.

Related dot points

Sources & how we know this