How do you fit a line to data, and how do you interpret its slope and intercept and use it to predict?
Fit a line of best fit (linear regression) to two-variable data, interpret the slope and y-intercept in context, and use the line to make predictions (A.DSR, Data and Statistical Reasoning).
A Georgia Milestones Algebra: Concepts & Connections answer on lines of best fit and linear regression, interpreting the slope as a rate and the y-intercept as a starting value in context, using the line to predict, and the difference between interpolation and extrapolation.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
This Data and Statistical Reasoning (A.DSR) standard asks you to fit a line to two-variable data (a line of best fit, found by linear regression), to interpret its slope and y-intercept in context, and to use it to predict. The Georgia Milestones EOC tests interpretation heavily, because a fitted line is only useful if you can say what its slope and intercept mean for the situation, and it tests prediction by substituting an -value. It connects directly to the linear-function work: a line of best fit is a linear model, written (or ).
What a line of best fit is
A line of best fit is the straight line that comes closest to the points in a scatterplot, summarizing a linear association. It is computed by linear regression, which finds the line minimizing the overall distance from the points (technically the sum of squared vertical distances). On the EOC and its calculator tool, you are usually given the line or generate it with technology, then interpret and use it. The fitted value is written ("y-hat") to distinguish a prediction from an actual data value.
Interpreting the slope and intercept
Because a line of best fit is a linear model, its parameters carry the same meanings as any line, stated in context.
- Slope : the predicted change in for each one-unit increase in , a rate. For (hours studied versus score), the slope 4 means each extra hour is associated with about a 4-point higher predicted score.
- y-intercept : the predicted when . Here, 60 is the predicted score for 0 hours studied.
Making predictions
To predict, substitute an -value into the line.
Interpolation versus extrapolation
Predictions are trustworthy only near the data.
- Interpolation: predicting within the range of the observed -values. Generally reliable.
- Extrapolation: predicting far outside the range. Risky, because the linear trend may not continue.
If the data covered 0 to 8 hours, predicting at 6 hours is interpolation (safe), but predicting at 40 hours is extrapolation (the model likely breaks down).
How the Milestones examines this topic
- Multiple choice. Identify what the slope or y-intercept of a fitted line represents.
- Numeric entry. Predict a -value by substituting an -value into the line.
- Constructed response. Interpret slope and intercept in context and make a prediction, noting reliability.
Why "associated with," not "causes"
The careful wording of a regression interpretation is itself tested, and the reason is important. A line of best fit describes an association in the data, not a cause-and-effect mechanism. Saying the slope means each extra hour "is associated with" a 4-point higher score is accurate; saying it "causes" a 4-point increase claims more than the data support, because other factors (prior knowledge, sleep, motivation) could be driving both. This is the same correlation-is-not-causation caution covered in the next topic, and the EOC enforces it by rewarding hedged language ("predicted," "associated with," "on average") and penalizing causal overstatement. Building the habit of describing what the line predicts, rather than what it makes happen, keeps your interpretations defensible.
Why extrapolation is dangerous
Extrapolation fails because a line of best fit is only known to summarize the trend where there is data. Beyond that range, nothing guarantees the relationship stays linear: a plant's height might grow linearly for a few weeks but then level off, so a line fitted to early data would absurdly predict a giant plant after a year. The model has no information about what happens outside the observed values, so a prediction far beyond the data is an unjustified guess. On the EOC, recognizing that a prediction at an extreme -value is extrapolation, and therefore unreliable, is a common reasoning point, and it is why interpolation (staying within the data) is the safe kind of prediction.
Try this
Q1. For (years experience versus salary in thousands), interpret the slope. [1 point]
- Cue. Each additional year of experience is associated with about a $3000 higher predicted salary.
Q2. Data cover from 1 to 10. Is predicting at interpolation or extrapolation? [1 point]
- Cue. Extrapolation (far outside the data), so unreliable.
Exam-style practice questions
Practice questions written in the style of GaDOE exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
Milestones (style)2 marksConstructed response. A line of best fit for hours studied versus test score is . Interpret the slope and the y-intercept, and predict the score for 6 hours.Show worked answer →
Slope 4: each additional hour studied is associated with about a 4-point higher score. y-intercept 60: the predicted score for 0 hours studied is 60. Prediction for 6 hours: .
The slope is the rate (points per hour) and the y-intercept is the predicted value at . Substitute to predict. Full credit needs both interpretations in context plus the predicted value.
Milestones (style)1 marksMultiple choice. A line of best fit is . What does the slope 2.5 represent? (A) the starting value (B) the predicted increase in per unit increase in (C) the correlation (D) the y-interceptShow worked answer →
The correct answer is (B).
The slope of a line of best fit is the predicted change in for each one-unit increase in . Here, is predicted to increase by 2.5 for every increase of 1 in . The starting value and the y-intercept (option A and D describe the same thing, the constant 10), and the correlation is a separate measure.
Related dot points
- Represent two-variable quantitative data with scatterplots and describe the association by its form, direction, strength, and any outliers (A.DSR, Data and Statistical Reasoning).
A Georgia Milestones Algebra: Concepts & Connections answer on scatterplots and two-variable quantitative data, describing the association by its form (linear or nonlinear), direction (positive or negative), strength, and outliers or clusters.
- Interpret the correlation coefficient, distinguish correlation from causation, and use residuals and a residual plot to judge how well a linear model fits (A.DSR, Data and Statistical Reasoning).
A Georgia Milestones Algebra: Concepts & Connections answer on the correlation coefficient r, why correlation does not imply causation, computing a residual as actual minus predicted, and reading a residual plot to judge whether a linear model is appropriate.
- Represent one-variable quantitative data with dot plots, histograms, and box plots, and describe the shape of a distribution (A.DSR, Data and Statistical Reasoning).
A Georgia Milestones Algebra: Concepts & Connections answer on displaying one-variable quantitative data with dot plots, histograms, and box plots, reading the five-number summary from a box plot, and describing the shape of a distribution as symmetric, skewed, or having outliers.
- Compute and interpret measures of center (mean, median) and spread (range, IQR, standard deviation), and compare two distributions using center, spread, and shape (A.DSR, Data and Statistical Reasoning).
A Georgia Milestones Algebra: Concepts & Connections answer on measures of center (mean and median) and spread (range, IQR, standard deviation), choosing mean or median based on skew and outliers, and comparing two distributions by their center, spread, and shape.
Sources & how we know this
- Georgia's K-12 Mathematics Standards (Algebra: Concepts & Connections) — Georgia Department of Education (2023)
- Georgia Milestones Assessment System — Georgia Department of Education (2024)