Skip to main content
United StatesStatisticsSyllabus dot point

How do we find the long-run average and spread of a random variable from its distribution?

Topic 4.8 Mean and Standard Deviation of Random Variables: calculate and interpret the mean (expected value), variance, and standard deviation of a discrete random variable from its probability distribution.

A focused answer to AP Statistics Topic 4.8, on the expected value (mean), variance, and standard deviation of a discrete random variable, the weighted-average idea, and interpreting expected value as a long-run mean, with full worked calculations.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. The mean as a weighted average
  3. Variance and standard deviation
  4. Interpreting expected value correctly
  5. Why these parameters matter downstream
  6. Try this

What this topic is asking

The College Board (Topic 4.8) wants you to calculate and interpret the mean (expected value), variance, and standard deviation of a discrete random variable from its probability distribution, and to read the mean as a long-run average.

The mean as a weighted average

Expected value generalizes the ordinary average. An ordinary mean weights every data value equally; the expected value weights each possible value by how likely it is. So values that occur more often pull the mean toward them more strongly. The result is the value the long-run average of the random variable settles on, by the law of large numbers, exactly the "fair price" of a game or the average defects per item over a long production run.

Variance and standard deviation

The structure mirrors Unit 1's standard deviation but with probabilities as weights. You find the mean, take each value's deviation from it, square the deviation, weight by the probability, sum to get the variance, and square-root to get the standard deviation. The squaring (as in Unit 1) prevents positive and negative deviations from cancelling and emphasizes larger departures; the square root returns the spread to the original units so it is interpretable.

Interpreting expected value correctly

The most tested idea is what the expected value means. It is a long-run average, not a prediction of any single outcome, and it need not be an attainable value: a random variable that takes only whole numbers can have a fractional mean like 0.70.7 defects, which simply says that over many items the defects average 0.70.7 each. In a game of chance, the expected payout is the average you would win per play over a very long run, which is why it is the right notion of a "fair" stake: a game is fair if the expected net gain is 00. Misreading the expected value as "the most likely outcome" or "what will happen next" is a classic error, the expected value of a single die roll is 3.53.5, which never appears on any single roll. Holding the long-run-average meaning firmly lets you interpret expected value in any context, payouts, defects, waiting times, and it is the meaning the binomial mean npnp (Topic 4.11) and all later expected values inherit.

Why these parameters matter downstream

The mean and standard deviation of a random variable are the parameters that the rest of the course revolves around. Combining random variables (Topic 4.9) is entirely a set of rules for how means and standard deviations behave when variables are added or scaled. The binomial and geometric distributions come with shortcut formulas for exactly these two parameters. And in the sampling-distribution unit, a sample mean and a sample proportion are themselves random variables whose mean and standard deviation (the standard error) drive every confidence interval and significance test. So the calculations here, weighted-average mean, weighted squared-deviation variance, are not isolated arithmetic but the foundation of the inferential machinery to come. Mastering them now, and especially interpreting the expected value as a long-run mean, pays dividends across half the course.

Try this

Q1. XX has P(1)=0.4P(1) = 0.4, P(2)=0.4P(2) = 0.4, P(5)=0.2P(5) = 0.2. Find E(X)E(X). [2 points]

  • Cue. E(X)=1(0.4)+2(0.4)+5(0.2)=0.4+0.8+1.0=2.2E(X) = 1(0.4) + 2(0.4) + 5(0.2) = 0.4 + 0.8 + 1.0 = 2.2.

Q2. Explain why an expected value of 2.22.2 can be valid even though XX never equals 2.22.2. [1 point]

  • Cue. The expected value is a long-run average over many trials, not a single outcome, so it need not be a value XX can actually take.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). A game pays \5withprobability with probability 0.2and and \00 with probability 0.80.8. What is the expected payout? (A) \1.00(B) (B) \2.502.50 (C) \5.00(D) (D) \0.200.20
Show worked answer →

The correct answer is (A).

The expected value is the sum of value times probability: E(X) = 5(0.2) + 0(0.8) = 1.0 + 0 = \1.00$.

(B) is the midpoint, not weighted by probability. (C) is the maximum payout. (D) confuses probability with value. Weighting each payout by its probability gives an expected payout of \1.00$.

AP 2022 (style)4 marksSection II (free response). A random variable XX (number of defects per item) has P(0)=0.5P(0) = 0.5, P(1)=0.3P(1) = 0.3, P(2)=0.2P(2) = 0.2. (a) Find the mean (expected value) of XX and interpret it. (b) Find the standard deviation of XX. (c) Explain why the expected value 0.70.7 is not a value XX can actually take, and what it means.
Show worked answer →

A 4-point question on expected value and standard deviation.

(a) (2 points) μX=0(0.5)+1(0.3)+2(0.2)=0+0.3+0.4=0.7\mu_X = 0(0.5) + 1(0.3) + 2(0.2) = 0 + 0.3 + 0.4 = 0.7 (1 point); interpret: over many items, the average number of defects per item is about 0.70.7 (1 point, in context).
(b) (1 point) σX2=(0−0.7)2(0.5)+(1−0.7)2(0.3)+(2−0.7)2(0.2)=0.245+0.027+0.338=0.61\sigma_X^2 = (0-0.7)^2(0.5) + (1-0.7)^2(0.3) + (2-0.7)^2(0.2) = 0.245 + 0.027 + 0.338 = 0.61, so σX=0.61≈0.781\sigma_X = \sqrt{0.61} \approx 0.781.
(c) (1 point) The expected value is a long-run average, not a single outcome; an item has 00, 11, or 22 defects, but averaged over many items the mean is 0.70.7 defects per item.

Markers reward the correct expected value with a long-run interpretation, the variance and standard deviation, and the insight that the mean need not be an attainable value.

Related dot points

Sources & how we know this