Skip to main content
United StatesStatisticsSyllabus dot point

How does a linear regression model predict one variable from another, and how do we interpret its slope and intercept?

Topic 2.6 Linear Regression Models: write, interpret, and use a least-squares regression equation to predict a response, interpreting the slope and intercept in context, and recognizing the danger of extrapolation.

A focused answer to AP Statistics Topic 2.6, on the form of a regression equation, interpreting slope and intercept in context, making predictions, and the danger of extrapolation, with a worked prediction and interpretation.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. The regression equation
  3. Interpreting the slope
  4. Interpreting the intercept
  5. Prediction and the danger of extrapolation
  6. Try this

What this topic is asking

The College Board (Topic 2.6) wants you to work with a least-squares regression model: write the prediction equation, interpret the slope and intercept in context, use the model to predict a response, and recognize the danger of extrapolation beyond the data.

The regression equation

The hat notation distinguishes the model's prediction from reality. An actual data point has an observed yy; the line gives y^\hat{y} for the same xx; the gap between them is the residual of the next topic. Writing y^\hat{y} rather than yy in your equation is a small notation habit the exam rewards and penalizes its absence.

Interpreting the slope

The slope is a rate of change, not a value. It must say "predicted" because the line gives averages, not guarantees, and it must name both variables' units. Two recurring errors are dropping "predicted" (which wrongly implies every individual changes by exactly bb) and reversing the variables (interpreting the slope as a change in xx per unit yy).

Interpreting the intercept

The intercept aa is the predicted response when x=0x = 0. Sometimes this is meaningful (if x=0x = 0 is a sensible, observed value), but very often it is not, because x=0x = 0 lies far outside the data or is physically impossible. A regression of weight on height has an intercept at height 00 cm, which is nonsense; you should interpret it as "the predicted weight when height is 00 is aa, but since no one is 00 cm tall this is an extrapolation and not meaningful." Saying this, rather than pretending the intercept is a real prediction, demonstrates the understanding the exam is checking.

Prediction and the danger of extrapolation

To predict, substitute an xx-value into the equation and compute y^\hat{y}. This is reliable only within the range of the observed data, where you have evidence the linear pattern holds. Extrapolation, predicting for an xx outside that range, is risky because there is no data to confirm that the relationship stays linear (or stays at all) out there. Ice-cream sales may rise linearly with temperature from 1010 to 3535 degrees, but predicting sales at 5050 degrees assumes a pattern you have never observed and that may break down (people might stay indoors). The exam frequently sets up an extrapolation trap, giving a model with a stated valid range and then asking for a prediction outside it; the expected answer computes the value if asked but flags that it is an unreliable extrapolation. This is the regression version of "know the limits of your data," and it connects back to Topic 1.1's theme of what data can and cannot tell you. Because slope and intercept interpretation, prediction, and the extrapolation caution together account for most regression marks on the exam, drilling the exact phrasing, "predicted," "per one-unit," units, and "outside the range of the data," turns these into reliable points.

Try this

Q1. A line predicting test score from hours studied is y^=40+8x\hat{y} = 40 + 8x. Interpret the slope. [2 points]

  • Cue. For each additional hour studied, the predicted test score increases by 88 points, on average.

Q2. Why is predicting outside the range of the data (extrapolation) unreliable? [1 point]

  • Cue. There is no data out there to confirm the linear pattern continues, so the prediction rests on an untested assumption and may be badly wrong.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). A regression line predicting weight (kg) from height (cm) is y^=50+0.6x\hat{y} = -50 + 0.6x. How is the slope 0.60.6 interpreted? (A) A person 00 cm tall weighs 0.60.6 kg (B) For each additional cm of height, predicted weight increases by 0.60.6 kg on average (C) Weight is 0.60.6 times height (D) The correlation is 0.60.6
Show worked answer →

The correct answer is (B).

The slope is the predicted change in the response per one-unit increase in the explanatory variable: for each extra cm of height, predicted weight rises by 0.60.6 kg, on average. Slope interpretation must include "predicted," "per one-unit increase," and units.

(A) describes the intercept (and meaninglessly here). (C) misreads the model as proportional. (D) confuses slope with correlation. The slope is a rate of change in the predicted response.

AP 2021 (style)4 marksSection II (free response). The least-squares line relating ice-cream sales (\,, y)totemperature(degreesC,) to temperature (degrees C, x)foroneshopis) for one shop is \hat{y} = 30 + 12x,validfortemperaturesfrom, valid for temperatures from 10to to 35degrees.(a)Interprettheslopeincontext.(b)Interprettheintercept,andcommentonwhetheritismeaningful.(c)Predictsalesat degrees. (a) Interpret the slope in context. (b) Interpret the intercept, and comment on whether it is meaningful. (c) Predict sales at 20degrees,andexplainwhypredictingsalesat degrees, and explain why predicting sales at 50$ degrees would be unwise.
Show worked answer →

A 4-point regression-interpretation question.

(a) (1 point) Slope: for each additional degree C of temperature, predicted ice-cream sales increase by $12, on average.
(b) (1 point) Intercept: at 00 degrees C, the model predicts \30insales;thisisanextrapolation(0isoutsidethedatarangeof30 in sales; this is an extrapolation (0 is outside the data range of 10to to 35$ degrees), so it is not a meaningful prediction, just where the line crosses the axis.
(c) (2 points) Prediction at 2020 degrees: \hat{y} = 30 + 12(20) = 30 + 240 = \270(1point).Predictingat (1 point). Predicting at 50degreesisunwisebecause degrees is unwise because 50isfaroutsidethedatarange( is far outside the data range (10to to 35$); this is extrapolation, and the linear pattern may not hold there (1 point).

Markers reward a slope interpretation with "predicted," "per one-unit," and units; an intercept interpretation noting the extrapolation; a correct prediction; and the extrapolation caution.

Related dot points

Sources & how we know this