Skip to main content
OhioMaths

Ohio Algebra I: a complete guide to statistics and probability

A deep-dive Ohio Algebra I guide to statistics and probability, a smaller but reliable reporting category. Covers representing data with dot plots, histograms, and box plots, comparing center and spread, two-way frequency tables, scatter plots and lines of best fit, and the correlation coefficient with the correlation-causation distinction.

Generated by Claude Opus 4.816 min readS-ID.1, S-ID.2, S-ID.3, S-ID.5, S-ID.6, S-ID.7, S-ID.8, S-ID.9

Reviewed by: AI editorial process; not yet individually human-reviewed

Jump to a section
  1. What this category demands
  2. Representing data
  3. Center and spread
  4. Two-way frequency tables
  5. Scatter plots, lines of best fit, and correlation
  6. How this category is examined
  7. Check your knowledge

What this category demands

This guide covers statistics and probability, a smaller but reliable Ohio Algebra I reporting category drawn entirely from S-ID (interpreting categorical and quantitative data). It rewards reading data, computing summaries, and reasoning carefully about relationships. Each dot-point page has its own practice: representing data distributions, comparing center and spread, two-way frequency tables, scatter plots and linear models, and correlation and causation.

Representing data

Show one-variable data with a dot plot (every value, small sets), a histogram (interval counts, shows shape), or a box plot (the five-number summary). Describe shape by the tail: a long tail right is skewed right, left is skewed left, matching tails are symmetric.

Center and spread

Center: the mean (sum over count) or the median (middle value). Spread: the range (max minus min), the IQR (Q3Q1Q3 - Q1, middle 50%50\%), or informally standard deviation. The mean and range are sensitive to outliers; the median and IQR are resistant.

Two-way frequency tables

A two-way table cross-classifies two categorical variables. Relative frequencies differ by denominator: joint (cell over grand total), marginal (row/column total over grand total), conditional (cell over its row/column total). Compare conditional relative frequencies to judge association.

Scatter plots, lines of best fit, and correlation

A scatter plot plots paired data; a line of best fit y^=mx+b\hat{y} = mx + b summarizes a linear trend, with slope as a rate and intercept as a baseline. The correlation coefficient rr (1-1 to 11) gives direction (sign) and strength (r|r| near 11 strong, near 00 weak). Correlation is not causation: a lurking variable may drive both.

How this category is examined

  • Numeric response. Compute a summary statistic (mean, median, IQR), a relative frequency, or a prediction.
  • Multiple choice and multiple-select. Describe shape, choose a resistant measure, interpret rr, or pick the best conclusion about a correlation.
  • Tables and graphs. Complete a two-way table, build a box plot, or read a scatter plot.

Check your knowledge

Work these as you would for credit on the Ohio test.

  1. Find the mean and median of 3,5,5,7,203, 5, 5, 7, 20. (2 points)
  2. A distribution has a long tail to the left. Name its shape. (1 point)
  3. Find the range and IQR of 4,6,9,10,13,154, 6, 9, 10, 13, 15. (2 points)
  4. Of 4040 apartment dwellers, 2424 own a pet. What is that conditional relative frequency? (1 point)
  5. A line of best fit is y^=2x+50\hat{y} = -2x + 50. Interpret the slope. (2 points)
  6. Predict yy from y^=3x+8\hat{y} = 3x + 8 when x=9x = 9. (1 point)
  7. Which is a stronger linear relationship, r=0.7r = 0.7 or r=0.9r = -0.9? (1 point)

Sources & how we know this

  • mathematics
  • oh-eoc
  • algebra-i
  • statistics
  • data
  • correlation