United StatesStatisticsSyllabus dot point

How do we find the long-run average and spread of a random variable from its distribution?

Topic 4.8 Mean and Standard Deviation of Random Variables: calculate and interpret the mean (expected value), variance, and standard deviation of a discrete random variable from its probability distribution.

A focused answer to AP Statistics Topic 4.8, on the expected value (mean), variance, and standard deviation of a discrete random variable, the weighted-average idea, and interpreting expected value as a long-run mean, with full worked calculations.

Generated by Claude Opus 4.810 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
The mean as a weighted average
Variance and standard deviation
Interpreting expected value correctly
Why these parameters matter downstream
Try this

What this topic is asking

The College Board (Topic 4.8) wants you to calculate and interpret the mean (expected value), variance, and standard deviation of a discrete random variable from its probability distribution, and to read the mean as a long-run average.

The mean as a weighted average

Expected value generalizes the ordinary average. An ordinary mean weights every data value equally; the expected value weights each possible value by how likely it is. So values that occur more often pull the mean toward them more strongly. The result is the value the long-run average of the random variable settles on, by the law of large numbers, exactly the "fair price" of a game or the average defects per item over a long production run.

Variance and standard deviation

The structure mirrors Unit 1's standard deviation but with probabilities as weights. You find the mean, take each value's deviation from it, square the deviation, weight by the probability, sum to get the variance, and square-root to get the standard deviation. The squaring (as in Unit 1) prevents positive and negative deviations from cancelling and emphasizes larger departures; the square root returns the spread to the original units so it is interpretable.

Interpreting expected value correctly

The most tested idea is what the expected value means. It is a long-run average, not a prediction of any single outcome, and it need not be an attainable value: a random variable that takes only whole numbers can have a fractional mean like $0.7$ defects, which simply says that over many items the defects average $0.7$ each. In a game of chance, the expected payout is the average you would win per play over a very long run, which is why it is the right notion of a "fair" stake: a game is fair if the expected net gain is $0$ . Misreading the expected value as "the most likely outcome" or "what will happen next" is a classic error, the expected value of a single die roll is $3.5$ , which never appears on any single roll. Holding the long-run-average meaning firmly lets you interpret expected value in any context, payouts, defects, waiting times, and it is the meaning the binomial mean $np$ (Topic 4.11) and all later expected values inherit.

Why these parameters matter downstream

The mean and standard deviation of a random variable are the parameters that the rest of the course revolves around. Combining random variables (Topic 4.9) is entirely a set of rules for how means and standard deviations behave when variables are added or scaled. The binomial and geometric distributions come with shortcut formulas for exactly these two parameters. And in the sampling-distribution unit, a sample mean and a sample proportion are themselves random variables whose mean and standard deviation (the standard error) drive every confidence interval and significance test. So the calculations here, weighted-average mean, weighted squared-deviation variance, are not isolated arithmetic but the foundation of the inferential machinery to come. Mastering them now, and especially interpreting the expected value as a long-run mean, pays dividends across half the course.

Expected value and standard deviation of a game

A carnival game costs nothing to describe: $X$ is the net winnings, with $P(X = 10) = 0.1$ , $P(X = 2) = 0.3$ , and $P(X = -1) = 0.6$ . (a) Find the expected winnings and interpret. (b) Find the standard deviation. (c) Comment on whether the game favors the player.

step 1 Expected value (part a)

$\mu_X = 10(0.1) + 2(0.3) + (-1)(0.6) = 1.0 + 0.6 - 0.6 = 1.0$ . So $E(X) = \$ 1.00 $: over many plays the player averages a$ \ $1$ gain per play.

step 2 Variance (part b)

Squared deviations from $\mu_X = 1$ , weighted by probability:

\sigma_X^2 = (10-1)^2(0.1) + (2-1)^2(0.3) + (-1-1)^2(0.6).

= 81(0.1) + 1(0.3) + 4(0.6) = 8.1 + 0.3 + 2.4 = 10.8.

step 3 Standard deviation

$\sigma_X = \sqrt{10.8} \approx 3.29$ . So individual outcomes typically fall about $\$ 3.29 $from the mean of$ \ $1$ , reflecting the wide range from $-\$ 1 $to$ \ $10$ .

step 4 Interpret

The expected winnings are $\$ 1 $per play, so the game **favors the player** on average (a fair game would have$ E(X) = 0 $). But the large standard deviation of about$ \ $3.29$ shows single plays are very variable: most plays lose $\$ 1$, with occasional larger wins driving the positive average. The mean is a long-run figure, not a guarantee for any single play.

Try this

Q1. $X$ has $P(1) = 0.4$ , $P(2) = 0.4$ , $P(5) = 0.2$ . Find $E(X)$ . [2 points]

Cue. $E(X) = 1(0.4) + 2(0.4) + 5(0.2) = 0.4 + 0.8 + 1.0 = 2.2$ .

Q2. Explain why an expected value of $2.2$ can be valid even though $X$ never equals $2.2$ . [1 point]

Cue. The expected value is a long-run average over many trials, not a single outcome, so it need not be a value $X$ can actually take.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). A game pays

\

with probability

0.2

and

0

with probability

0.8

. What is the expected payout? (A)

\

1.00

(B)

2.50

(C)

\

5.00

(D)

0.20

Show worked answer →

The correct answer is (A).

The expected value is the sum of value times probability: $E(X) = 5(0.2) + 0(0.8) = 1.0 + 0 = \$ 1.00$.

(B) is the midpoint, not weighted by probability. (C) is the maximum payout. (D) confuses probability with value. Weighting each payout by its probability gives an expected payout of $\$ 1.00$.

AP 2022 (style)4 marksSection II (free response). A random variable

X

(number of defects per item) has

P(0) = 0.5

P(1) = 0.3

P(2) = 0.2

. (a) Find the mean (expected value) of

X

and interpret it. (b) Find the standard deviation of

X

. (c) Explain why the expected value

0.7

is not a value

X

can actually take, and what it means.

Show worked answer →

A 4-point question on expected value and standard deviation.

(a) (2 points) $\mu_X = 0(0.5) + 1(0.3) + 2(0.2) = 0 + 0.3 + 0.4 = 0.7$ (1 point); interpret: over many items, the average number of defects per item is about $0.7$ (1 point, in context).
(b) (1 point) $\sigma_X^2 = (0-0.7)^2(0.5) + (1-0.7)^2(0.3) + (2-0.7)^2(0.2) = 0.245 + 0.027 + 0.338 = 0.61$ , so $\sigma_X = \sqrt{0.61} \approx 0.781$ .
(c) (1 point) The expected value is a long-run average, not a single outcome; an item has $0$ , $1$ , or $2$ defects, but averaged over many items the mean is $0.7$ defects per item.

Markers reward the correct expected value with a long-run interpretation, the variance and standard deviation, and the insight that the mean need not be an attainable value.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)