What does the correlation coefficient measure, and why does correlation not prove causation?
Interpret the correlation coefficient of a linear fit and distinguish correlation from causation (LA A1: S-ID.C.8, S-ID.C.9).
A Louisiana LEAP 2025 Algebra I answer on correlation (LA A1: S-ID.C.8, C.9): the correlation coefficient r and what its sign and size mean, strength of fit, and why correlation does not imply causation.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
Standards A1: S-ID.C.8 and S-ID.C.9 ask you to interpret the correlation coefficient of a linear fit and to distinguish correlation from causation. On LEAP 2025 these are Type I and Type II items in the Additional and Supporting Content category. The calculator reports on the calculator sessions; you interpret its sign, size, and meaning.
What the correlation coefficient measures
The correlation coefficient summarizes how well a straight line fits a scatter plot. It lives between and .
So is a strong positive linear relationship, a strong negative one, and almost no linear relationship.
Reading r from a scatter plot
A loose, shapeless cloud would have near ; a perfectly straight line would have .
Correlation is not causation
A strong correlation means two variables move together, not that one causes the other. S-ID.C.9 stresses three reasons a correlation can mislead:
- Lurking variable: a hidden third factor drives both (ice cream sales and drownings both rise with hot weather).
- Coincidence: unrelated trends happen to track each other over a period.
- Reverse causation: the supposed effect actually drives the supposed cause.
To establish causation, you need a controlled experiment that isolates the variable, observational correlation alone cannot.
How LEAP examines this topic
- Multiple choice. Interpret the sign and size of , or match to a scatter plot.
- Type II reasoning. Explain why a correlation does not prove causation, naming a lurking variable.
- Equation response. Read or compare correlation coefficients from calculator output.
A clarifying idea: measures linear fit only. A scatter plot with a strong curved pattern can have an near , because the relationship, though real, is not linear.
Why correlation cannot establish causation
The gap between correlation and causation is one of the most important ideas in all of statistics, and S-ID.C.9 places it in Algebra I because it is so easily misused. Correlation is a purely descriptive measure: it quantifies how tightly two variables move together in the data you happened to observe. It says nothing about why they move together, and there are several distinct explanations that all produce the same strong . A lurking variable is the classic culprit: hot weather raises both ice cream sales and swimming (and thus drownings), so the two outcomes correlate strongly even though neither causes the other. Reverse causation can masquerade as causation in the wrong direction. And with enough variables compared, some will correlate by sheer coincidence. The only reliable way to establish that one variable causes another is a controlled experiment, where you deliberately change the suspected cause while holding other factors fixed and observe the effect, which observational data simply cannot do. This is why careful writing uses "associated with" rather than "causes," and why a headline claiming a cause from a mere correlation should be treated skeptically. Understanding this protects you from a pervasive error and is exactly the reasoning the standard wants you to articulate, often by naming a plausible lurking variable for a given correlation.
Try this
Q1. A linear fit has . Describe the relationship. [2 points]
- Cue. A strong positive linear relationship (close to , positive sign).
Q2. Shoe size and reading ability are positively correlated in children. Name a lurking variable. [1 point]
- Cue. Age: older children have bigger feet and read better; age drives both.
Exam-style practice questions
Practice questions written in the style of LDOE exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
LA LEAP 2025 Math (style)2 marksMultiple choice. A linear fit has a correlation coefficient of . What does this indicate? (A) a strong negative linear relationship (B) a weak negative relationship (C) a strong positive relationship (D) no relationshipShow worked answer →
The correct answer is (A).
The correlation coefficient ranges from to . The sign gives the direction: negative means as increases, tends to decrease. The size (closeness to or ) gives the strength: is close to , so the linear relationship is strong. Thus is a strong negative linear relationship. A value near would indicate little or no linear relationship.
LA LEAP 2025 Math (style)2 marksIce cream sales and drowning incidents are strongly positively correlated. Does this mean ice cream causes drownings? Explain.Show worked answer →
No. Correlation does not imply causation.
Both ice cream sales and drownings rise in hot summer weather, which is a lurking variable (a third factor) driving both. So the two are correlated without one causing the other. S-ID.C.9 is exactly this point: a strong correlation shows the variables move together, but a hidden common cause, coincidence, or reverse causation can explain it. Establishing causation requires a controlled experiment, not just a correlation.
Related dot points
- Fit a linear model to a scatter plot and interpret the slope and intercept in context, using the line to predict (LA A1: S-ID.B.6, S-ID.C.7).
A Louisiana LEAP 2025 Algebra I answer on scatter plots and linear models (LA A1: S-ID.B.6, C.7): describing association, fitting a line of best fit, interpreting its slope and intercept, and predicting with it.
- Represent data with dot plots, histograms, and box plots, and read the shape of a distribution from them (LA A1: S-ID.A.1).
A Louisiana LEAP 2025 Algebra I answer on representing data (LA A1: S-ID.A.1): dot plots, histograms, and box plots, the five-number summary behind a box plot, and reading shape, skew, and spread.
- Use measures of center (mean, median) and spread (range, interquartile range) to describe and compare data sets, and account for the effect of outliers (LA A1: S-ID.A.2, S-ID.A.3).
A Louisiana LEAP 2025 Algebra I answer on center and spread (LA A1: S-ID.A.2, A.3): mean versus median, range and interquartile range, comparing two data sets, and how outliers shift the mean.
- Summarize categorical data in a two-way frequency table and interpret joint, marginal, and conditional relative frequencies (LA A1: S-ID.B.5).
A Louisiana LEAP 2025 Algebra I answer on two-way frequency tables (LA A1: S-ID.B.5): reading rows and columns, the totals, and computing joint, marginal, and conditional relative frequencies.
- Distinguish linear, quadratic, and exponential functions by their rate of change and recognize that a quantity growing by a constant factor eventually exceeds one growing linearly (LA A1: F-LE.A.1, F-LE.A.3).
A Louisiana LEAP 2025 Algebra I answer on comparing function families (LA A1: F-LE.A.1, A.3): constant difference versus constant ratio versus constant second difference, and why exponential growth overtakes linear.
Sources & how we know this
- Louisiana Student Standards for Mathematics — Louisiana Department of Education (2025)
- LEAP 2025 Assessment Guide for Algebra I — Louisiana Department of Education (2025)