Skip to main content
New YorkMathsSyllabus dot point

How do you fit and interpret a regression model, and how do residual plots help you choose the right one?

Fit linear, exponential, and other regression models to data; interpret the parameters and the correlation coefficient in context; use a residual plot to judge whether a model is appropriate; and use a model to make predictions.

A NY Regents Algebra II answer on regression: fitting linear and exponential models, interpreting parameters and the correlation coefficient, reading a residual plot to judge model fit, and making predictions.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Fitting a regression model
  3. The correlation coefficient
  4. Residual plots
  5. Predicting and reasoning for credit
  6. Try this

What this topic is asking

The Regents Algebra II exam (the Interpreting Data, S-ID, and Making Inferences, S-IC, clusters) wants you to fit regression models (linear, exponential, and others), interpret their parameters and the correlation coefficient in context, use a residual plot to judge whether a model is appropriate, and use a model to make predictions. This extends the Algebra I line of best fit to nonlinear models and adds the residual-plot diagnostic.

Fitting a regression model

Regression finds the function that best fits a data set. Algebra II extends beyond the linear case to exponential and other models, all produced by a graphing calculator's regression functions. You interpret the parameters in context just as with a line of best fit:

  • Linear y^=mx+b\hat{y} = mx + b: slope mm is the predicted change in yy per unit of xx; intercept bb is the predicted value at x=0x = 0.
  • Exponential y^=abx\hat{y} = ab^x: aa is the initial value and bb is the growth factor (decay if 0<b<10 < b < 1).

The correlation coefficient

The correlation coefficient rr measures how well a linear model fits.

So r=0.95r = -0.95 is a strong negative linear relationship, and r=0.1r = 0.1 is essentially no linear relationship. As in Algebra I, a strong rr supports prediction but never proves causation.

Residual plots

A residual is actual minus predicted, and a residual plot graphs these residuals against the xx-values. It is the key diagnostic for whether a model is the right type.

Predicting and reasoning for credit

Once a model is chosen, predict by substituting an xx-value into the equation. A clarifying point worth stressing is that the residual plot, not just rr, decides the model type: a fairly high rr can still accompany a curved residual pattern that reveals a linear model is wrong for nonlinear data. The Regents commonly pairs a prediction (substitute and evaluate) with an explanation of how a residual plot supports the chosen model, and the explanation credits require describing the pattern (random scatter good, systematic pattern bad). A second habit is to keep interpreting the correlation coefficient correctly: magnitude is strength, sign is direction, and correlation is never causation.

Try this

Q1. A regression has r=0.88r = 0.88. Describe the relationship. [1 credit]

  • Cue. Strong positive linear relationship (magnitude near 1, positive sign).

Q2. An exponential model is y^=20(1.5)x\hat{y} = 20(1.5)^x. Predict yy at x=3x = 3. [2 credits]

  • Cue. 20(1.5)3=20(3.375)=67.520(1.5)^3 = 20(3.375) = 67.5.

Exam-style practice questions

Practice questions written in the style of NYSED exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Regents (style)2 marksPart I (multiple choice). A regression gives a correlation coefficient of r=0.95r = -0.95. What does this indicate? (1) a weak positive relationship (2) a strong negative relationship (3) no relationship (4) a strong positive relationship
Show worked answer →

The correct answer is (2).

The correlation coefficient rr ranges from 1-1 to 11. A value of 0.95-0.95 is close to 1-1, indicating a strong relationship, and the negative sign means it is negative (as one variable increases, the other decreases). The magnitude near 1 signals strength; the sign signals direction.

Regents (style)4 marksPart III (constructed response). A bacteria population over hours is modeled by the exponential regression y=50(1.8)xy = 50(1.8)^x. (a) Predict the population at hour 4, to the nearest whole number. (b) Explain how a residual plot would help you decide whether an exponential model fits better than a linear one.
Show worked answer →

A 4-credit question: credit for the prediction and the residual-plot reasoning.

(a) y=50(1.8)4=50(10.4976)525y = 50(1.8)^4 = 50(10.4976) \approx 525 bacteria.
(b) A residual plot graphs the residuals (actual minus predicted) against xx. If the exponential model fits well, the residuals scatter randomly around zero with no pattern. If a linear model were forced on exponential data, the residual plot would show a clear curved pattern, signalling a poor fit. So a patternless residual plot supports the exponential model. Giving the prediction without addressing the residual-plot pattern loses the explanation credits.

Related dot points

Sources & how we know this