United StatesStatisticsSyllabus dot point

Why is the slope of a least-squares regression line a statistic with its own sampling distribution, and what does that allow us to infer?

Topic 9.1 Introducing Statistics: Do Those Points Align?: explain why a sample regression slope varies from sample to sample, motivating inference about the true population slope of a linear model.

A focused answer to AP Statistics Topic 9.1, on why a sample regression slope is a statistic that varies across samples, motivating confidence intervals and tests about the true population slope of a linear model.

Generated by Claude Opus 4.89 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
The sample slope is a statistic
Why a non-zero slope is not proof
The parameter and the tools, previewed
The mindset for the unit
Try this

What this topic is asking

The College Board (Topic 9.1) opens Unit 9 with the idea behind slope inference: the slope $b$ of a least-squares line fitted to a sample is a statistic that varies from sample to sample, so it estimates, but rarely equals, the true population slope $\beta$ . This motivates confidence intervals and tests about $\beta$ .

The sample slope is a statistic

Units 2 fit least-squares lines descriptively; Unit 9 treats the slope as a quantity with sampling variability. Just as a sample mean $\bar{x}$ estimates $\mu$ and varies across samples, the sample slope $b$ estimates $\beta$ and varies across samples. Take a new random sample, refit the line, and you get a slightly different slope. The collection of all possible sample slopes forms a sampling distribution centered (under the model's conditions) at the true slope $\beta$ .

Why a non-zero slope is not proof

This is the central caution of the unit, and the reason inference is needed. The question "do those points align?" is really "is the observed slope larger than sampling variability alone would typically produce if $\beta = 0$ ?" A small, easily-explained-by-chance slope is consistent with no relationship; a slope too large to be chance is evidence of a real linear association. Distinguishing the two requires the sampling distribution of $b$ , not just its observed value.

The parameter and the tools, previewed

The parameter of interest is the population slope $\beta$ . Unit 9 builds the two familiar tools on it, mirroring Units 6 and 7.

Confidence interval for $\beta$ (Topics 9.2 to 9.3): estimate the true slope with $b \pm t^{*}(\text{standard error of } b)$ , and judge claims (including whether $0$ is plausible).
Significance test for $\beta$ (Topics 9.4 to 9.5): test $H_0: \beta = 0$ (no linear relationship) with a t-statistic and P-value.

Both use a $t$ -distribution with $n - 2$ degrees of freedom (two are spent estimating the intercept and slope), and both rely on regression conditions about the residuals. Topic 9.1 plants the idea that $b$ is a variable estimate of a fixed $\beta$ ; the later topics supply the machinery.

The mindset for the unit

As in every inference unit, the key move is to see the observed slope as one draw from a distribution, not the truth. The fitted line summarizes one sample; the population line is fixed but unknown. Inference about $\beta$ is the disciplined way to ask whether an observed trend is real or could be a fluke of sampling, the precise meaning of "do those points align?"

Why a sample slope needs inference

An economist fits a least-squares line predicting spending from income on one random sample of $40$ households and gets a sample slope of $b = 0.32$ . Explain why this positive slope does not on its own confirm a real relationship, and what the economist should do.

step 1 Identify the statistic and parameter

The statistic is the sample slope $b = 0.32$ , from one sample. The parameter is the true population slope $\beta$ of the line relating spending to income.

step 2 Recognize the variability

Across all possible random samples of $40$ households, $b$ varies, forming a sampling distribution centered at $\beta$ . So $0.32$ is one draw; a different sample would give a different slope.

step 3 See why it is not proof

Even if $\beta = 0$ (income and spending were truly unrelated), sampling variability could still produce a non-zero $b$ like $0.32$ in a given sample. A positive sample slope is therefore consistent with both a real relationship and with no relationship plus chance variation.

step 4 State the remedy

The economist should make inference about $\beta$ : build a confidence interval for $\beta$ or test $H_0: \beta = 0$ . If $0$ is implausible (outside the interval, or a small P-value), there is convincing evidence of a real linear relationship between income and spending; if not, the observed slope could be chance.

Try this

Q1. What parameter does the sample slope $b$ estimate, and why does $b$ vary? [2 points]

Cue. $b$ estimates the population slope $\beta$ ; it varies because each random sample yields a different fitted line (sampling variability).

Q2. Why is a non-zero sample slope not proof of a relationship? [1 point]

Cue. Even if $\beta = 0$ , chance variation across samples can produce a non-zero $b$ ; inference is needed to rule that out.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). Two analysts each fit a least-squares line to a different random sample from the same population and get different slopes. This is best explained by (A) one made an error (B) the population slope changed (C) sampling variability in the sample slope (D) the data are not linear

Show worked answer →

The correct answer is (C).

A sample slope $b$ is a statistic estimating the true slope $\beta$ ; different samples give different slopes because of sampling variability. The slope has its own sampling distribution centered at $\beta$ .

(A) needs no error. (B) the population slope $\beta$ is fixed. (D) the data can be linear and still give different sample slopes from different samples.

AP 2021 (style)3 marksSection II (free response). A biologist fits a least-squares line predicting plant height from rainfall, using one random sample, and obtains a positive sample slope. (a) Explain why a positive sample slope does not by itself prove that rainfall and height are truly related in the population. (b) Identify the parameter the biologist should make inferences about. (c) State, in general terms, how the biologist could decide whether the relationship is real.

Show worked answer →

A 3-point conceptual question.

(a) (1 point) The sample slope $b$ varies from sample to sample; even if the true slope $\beta$ were $0$ (no relationship), a single random sample could produce a non-zero slope by chance, so a positive $b$ alone is not proof of a real relationship.
(b) (1 point) The true population slope $\beta$ of the linear model relating height to rainfall.
(c) (1 point) Build a confidence interval for $\beta$ or run a significance test of $H_0: \beta = 0$ ; if $0$ is implausible (outside the interval, or a small P-value), there is evidence of a real linear relationship.

Markers reward recognizing sampling variability in $b$ , naming $\beta$ as the parameter, and naming a test or interval as the decision tool.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)