Skip to main content
North CarolinaMaths

NC Math 1: a complete guide to descriptive statistics

A deep-dive NC Math 1 EOC guide to descriptive statistics (Statistics and Probability, about 18 to 20 percent of the test). Covers representing one-variable data with dot plots, histograms, and box plots, comparing center and spread, two-way frequency tables, scatter plots and lines of best fit, and the correlation coefficient with the correlation-versus-causation caution.

Generated by Claude Opus 4.814 min readNC.M1.S-ID

Reviewed by: AI editorial process; not yet individually human-reviewed

Jump to a section
  1. What this strand demands
  2. Representing one-variable data
  3. Comparing center and spread
  4. Two-way frequency tables
  5. Scatter plots and lines of best fit
  6. Correlation and causation
  7. How this strand is examined
  8. Check your knowledge

What this strand demands

This guide covers descriptive statistics on the NC Math 1 EOC, drawing on Interpreting Categorical and Quantitative Data (NC.M1.S-ID). The Statistics and Probability category is about 18 to 20 percent of the test, a reliable block that rewards careful reading of displays and clear interpretation. Each dot-point page has its own practice: representing data distributions, comparing center and spread, two-way frequency tables, scatter plots and linear models, and correlation and causation.

Representing one-variable data

Three displays show data on a single variable. A dot plot stacks a dot per value; a histogram groups values into bins and shows frequency; a box plot displays the five-number summary (minimum, Q1Q_1, median, Q3Q_3, maximum) with the box spanning the IQR. Shape is symmetric, skewed right (high-value tail), or skewed left (low-value tail), and may include outliers.

Comparing center and spread

Center is the mean (sensitive to outliers) or median (resistant). Spread is the range (max minus min) or IQR (Q3βˆ’Q1Q_3 - Q_1, resistant). For skewed data or outliers, prefer the median and IQR. To compare two data sets, report both a center and a spread using the same measures for each.

Two-way frequency tables

A two-way table cross-classifies two categorical variables. A joint frequency is one inner cell over the grand total; a marginal frequency is a margin total over the grand total; a conditional frequency divides a cell by its row or column total ("of those who..."). Comparing conditional frequencies across groups reveals association.

Scatter plots and lines of best fit

A scatter plot shows two numerical variables. Describe the direction (positive or negative), form (linear or not), and strength. Fit a line of best fit y=mx+by = mx + b to predict; interpret the slope as the predicted change in yy per unit of xx and the y-intercept as the predicted yy at x=0x = 0.

Correlation and causation

The correlation coefficient rr (from βˆ’1-1 to 11) measures the strength and direction of a linear relationship: sign for direction, size for strength. A strong correlation does not prove causation, a lurking variable, reverse causation, or coincidence can explain it. A residual (observed minus predicted) checks fit: small random residuals mean a good line.

How this strand is examined

  • Gridded response. Compute a mean, median, IQR, relative frequency, or prediction. Exact-match scoring.
  • Multiple choice and multiple select. Identify shape, choose a measure, interpret slope or rr, or spot a correlation-causation error.
  • Technology-enhanced. Build a box plot or table, or match scatter plots to descriptions.

Check your knowledge

Work these as you would for credit on the EOC.

  1. A five-number summary is 5,8,11,16,255, 8, 11, 16, 25. Find the IQR. (1 point)
  2. Find the mean and median of 3,4,4,5,393, 4, 4, 5, 39, and say which is more representative. (2 points)
  3. A histogram has a long tail toward high values. Name the shape. (1 point)
  4. Of 6060 students, 2424 play music and also a sport. What is this joint relative frequency? (1 point)
  5. Of 3030 musicians, 2424 play a sport. What is this conditional relative frequency? (1 point)
  6. A line of best fit is y=5x+12y = 5x + 12. Interpret the slope. (1 point)
  7. Using y=2x+8y = 2x + 8, predict yy when x=6x = 6. (1 point)
  8. What does r=βˆ’0.92r = -0.92 indicate? (1 point)

Sources & how we know this

  • mathematics
  • nc-eoc
  • nc-math-1
  • statistics
  • data-displays
  • scatter-plots
  • correlation