Skip to main content
OhioMathsSyllabus dot point

What does the correlation coefficient r tell you about a linear relationship, and why does correlation not prove causation?

Interpret the correlation coefficient of a linear fit and distinguish correlation from causation, recognizing lurking variables (Ohio S-ID.8, S-ID.9).

An Ohio Algebra I answer on correlation (S-ID.8, S-ID.9): what the correlation coefficient r measures, reading its sign and strength, and why a strong correlation does not prove one variable causes the other.

Generated by Claude Opus 4.811 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Reading the correlation coefficient
  3. Strength versus direction
  4. Correlation is not causation
  5. How Ohio examines this topic
  6. Why magnitude is strength and sign is direction
  7. Why correlation cannot prove causation
  8. Try this

What this topic is asking

Ohio standards S-ID.8 and S-ID.9 ask you to interpret the correlation coefficient rr of a linear fit and to distinguish correlation from causation. The coefficient rr measures how well a line fits the data; causation is a separate, stronger claim that data alone rarely proves. This is a frequent, high-yield Statistics item, and a classic reasoning trap.

Reading the correlation coefficient

A single number, rr, captures both the direction and the tightness of a linear trend.

So r=0.9r = 0.9 is a strong positive fit, r=0.2r = -0.2 a weak negative one, and r=0r = 0 no linear relationship. Algebra I computes rr with technology and focuses on interpreting it.

Strength versus direction

Keep the two pieces of information separate.

Correlation is not causation

This is the headline idea, and the most tested.

The classic example: ice-cream sales and drowning rise together, but hot weather (the lurking variable) drives both, neither causes the other.

How Ohio examines this topic

  • Multiple choice and multiple-select. Interpret the sign and strength of rr, or compare two values.
  • Reasoning items. Choose the best conclusion about a correlation, recognizing lurking variables and the correlation-causation distinction.
  • Numeric response. Match an rr value to a described scatter plot.

Why magnitude is strength and sign is direction

The correlation coefficient packs two separate facts into one number, and keeping them apart prevents the most common errors. The sign answers "which way does the trend go?", a positive rr goes with a line of positive slope (up-right), a negative rr with a line of negative slope (down-right). The magnitude r|r| answers "how closely do the points follow that line?", values near 11 mean the points cluster tightly around the line (strong), values near 00 mean they scatter widely (weak). Because these are independent, a strong relationship can be either positive or negative: r=0.9r = -0.9 is stronger than r=0.3r = 0.3 even though it is negative. Comparing strength means comparing r|r|, while the sign only tells direction, which is exactly the distinction reasoning items probe.

Why correlation cannot prove causation

The gap between correlation and causation is the deepest idea in the standard, and it rests on what observational data can and cannot rule out. A correlation tells you two variables move together, but movement-together is consistent with several stories: the first variable might cause the second, the second might cause the first, or a hidden third variable might cause both while neither affects the other. Observational data, where you simply record what happens, cannot distinguish these, because it does not control the other factors that might be responsible. Only a controlled experiment, which deliberately changes one variable while holding others fixed, can isolate cause and effect. This is why "correlation does not imply causation" is a rule rather than a slogan: a strong rr is genuine evidence of a relationship, but the explanation for that relationship requires more than the correlation itself.

Try this

Q1. Which is a stronger linear relationship, r=0.6r = 0.6 or r=0.8r = -0.8? [1 point]

  • Cue. Compare r|r|: 0.8>0.60.8 > 0.6, so r=0.8r = -0.8 is stronger.

Q2. Sleep and test scores are positively correlated. Does more sleep cause higher scores? [1 point]

  • Cue. Not necessarily; correlation alone does not prove causation (a lurking variable or reverse direction is possible).

Exam-style practice questions

Practice questions written in the style of ODEW exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

Ohio Algebra I EOC (style)2 marksMultiple choice. A correlation coefficient is r=0.95r = -0.95. What does this indicate? (A) a strong negative linear relationship (B) a weak negative relationship (C) a strong positive relationship (D) no relationship
Show worked answer →

The correct answer is (A).

The sign of rr gives the direction: negative means as xx increases, yy tends to decrease. The magnitude (how close r|r| is to 11) gives the strength: 0.95=0.95|{-0.95}| = 0.95 is very close to 11, so the linear relationship is strong. Together, r=0.95r = -0.95 indicates a strong negative linear relationship. A value near 00 would mean no linear relationship, and the sign would still tell the direction.

Ohio Algebra I EOC (style)2 marksMultiple choice. Ice-cream sales and drowning deaths are strongly correlated. What is the best conclusion? (A) a third variable (hot weather) likely affects both (B) ice cream causes drowning (C) drowning causes ice-cream sales (D) the correlation must be a calculation error
Show worked answer →

The correct answer is (A).

A strong correlation does not prove causation. Here a lurking variable, hot weather, plausibly drives both ice-cream sales and swimming (hence drowning), creating a correlation without either causing the other. Concluding that ice cream causes drowning (B) or the reverse (C) confuses correlation with causation. The correlation is real, not an error (D); it simply has a common-cause explanation.

Related dot points

Sources & how we know this