How do we find the long-run average and spread of a random variable from its distribution?
Topic 4.8 Mean and Standard Deviation of Random Variables: calculate and interpret the mean (expected value), variance, and standard deviation of a discrete random variable from its probability distribution.
A focused answer to AP Statistics Topic 4.8, on the expected value (mean), variance, and standard deviation of a discrete random variable, the weighted-average idea, and interpreting expected value as a long-run mean, with full worked calculations.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 4.8) wants you to calculate and interpret the mean (expected value), variance, and standard deviation of a discrete random variable from its probability distribution, and to read the mean as a long-run average.
The mean as a weighted average
Expected value generalizes the ordinary average. An ordinary mean weights every data value equally; the expected value weights each possible value by how likely it is. So values that occur more often pull the mean toward them more strongly. The result is the value the long-run average of the random variable settles on, by the law of large numbers, exactly the "fair price" of a game or the average defects per item over a long production run.
Variance and standard deviation
The structure mirrors Unit 1's standard deviation but with probabilities as weights. You find the mean, take each value's deviation from it, square the deviation, weight by the probability, sum to get the variance, and square-root to get the standard deviation. The squaring (as in Unit 1) prevents positive and negative deviations from cancelling and emphasizes larger departures; the square root returns the spread to the original units so it is interpretable.
Interpreting expected value correctly
The most tested idea is what the expected value means. It is a long-run average, not a prediction of any single outcome, and it need not be an attainable value: a random variable that takes only whole numbers can have a fractional mean like defects, which simply says that over many items the defects average each. In a game of chance, the expected payout is the average you would win per play over a very long run, which is why it is the right notion of a "fair" stake: a game is fair if the expected net gain is . Misreading the expected value as "the most likely outcome" or "what will happen next" is a classic error, the expected value of a single die roll is , which never appears on any single roll. Holding the long-run-average meaning firmly lets you interpret expected value in any context, payouts, defects, waiting times, and it is the meaning the binomial mean (Topic 4.11) and all later expected values inherit.
Why these parameters matter downstream
The mean and standard deviation of a random variable are the parameters that the rest of the course revolves around. Combining random variables (Topic 4.9) is entirely a set of rules for how means and standard deviations behave when variables are added or scaled. The binomial and geometric distributions come with shortcut formulas for exactly these two parameters. And in the sampling-distribution unit, a sample mean and a sample proportion are themselves random variables whose mean and standard deviation (the standard error) drive every confidence interval and significance test. So the calculations here, weighted-average mean, weighted squared-deviation variance, are not isolated arithmetic but the foundation of the inferential machinery to come. Mastering them now, and especially interpreting the expected value as a long-run mean, pays dividends across half the course.
Try this
Q1. has , , . Find . [2 points]
- Cue. .
Q2. Explain why an expected value of can be valid even though never equals . [1 point]
- Cue. The expected value is a long-run average over many trials, not a single outcome, so it need not be a value can actually take.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2018 (style)1 marksSection I (multiple choice). A game pays \50.2\ with probability . What is the expected payout? (A) \1.00\ (C) \5.00\Show worked answer →
The correct answer is (A).
The expected value is the sum of value times probability: E(X) = 5(0.2) + 0(0.8) = 1.0 + 0 = \1.00$.
(B) is the midpoint, not weighted by probability. (C) is the maximum payout. (D) confuses probability with value. Weighting each payout by its probability gives an expected payout of \1.00$.
AP 2022 (style)4 marksSection II (free response). A random variable (number of defects per item) has , , . (a) Find the mean (expected value) of and interpret it. (b) Find the standard deviation of . (c) Explain why the expected value is not a value can actually take, and what it means.Show worked answer →
A 4-point question on expected value and standard deviation.
(a) (2 points) (1 point); interpret: over many items, the average number of defects per item is about (1 point, in context).
(b) (1 point) , so .
(c) (1 point) The expected value is a long-run average, not a single outcome; an item has , , or defects, but averaged over many items the mean is defects per item.
Markers reward the correct expected value with a long-run interpretation, the variance and standard deviation, and the insight that the mean need not be an attainable value.
Related dot points
- Topic 4.7 Introduction to Random Variables and Probability Distributions: define discrete random variables, represent and interpret their probability distributions, and use them to find probabilities of events.
A focused answer to AP Statistics Topic 4.7, defining discrete random variables, the requirements of a valid probability distribution, cumulative probabilities, and interpreting distributions in context, with worked probability calculations.
- Topic 4.9 Combining Random Variables: apply the rules for the mean and variance of a linear transformation and of sums and differences of random variables, adding variances (not standard deviations) for independent variables.
A focused answer to AP Statistics Topic 4.9, on transforming and combining random variables, how means and variances behave under scaling and addition, the add-the-variances rule for independence, and why variances add for differences too, with worked calculations.
- Topic 4.10 Introduction to the Binomial Distribution: identify binomial settings (BINS conditions) and use the binomial probability formula to find the probability of a given number of successes in a fixed number of trials.
A focused answer to AP Statistics Topic 4.10, on the binomial setting (the BINS conditions), the binomial probability formula, and computing exact and cumulative binomial probabilities, with full worked calculations.
- Topic 4.11 Parameters for a Binomial Distribution: calculate and interpret the mean and standard deviation of a binomial random variable using the shortcut formulas, and describe how the distribution's shape depends on n and p.
A focused answer to AP Statistics Topic 4.11, on the binomial mean np and standard deviation, why the shortcuts work, interpreting them in context, and how shape depends on n and p, with full worked calculations.
- Topic 1.7 Summary Statistics for a Quantitative Variable: calculate and interpret measures of center (mean, median) and spread (range, IQR, standard deviation, variance), and judge their resistance to outliers.
A focused answer to AP Statistics Topic 1.7, defining and computing the mean, median, range, IQR, variance, and standard deviation, explaining resistance to outliers, with full worked calculations.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)