Write the standard deviation formula for x_1 - x_2 and state why variances are added. [2 points]

What must be true for x_1 - x_2 to be approximately normal? [1 point]

United StatesStatisticsSyllabus dot point

How is the sampling distribution of the difference between two sample means described?

Topic 5.8 Sampling Distributions for Differences in Sample Means: describe the mean, standard deviation, and shape of the sampling distribution of the difference between two independent sample means, adding variances and checking the conditions for normality.

A focused answer to AP Statistics Topic 5.8, on the mean, standard deviation, and approximately normal shape of the difference between two independent sample means, the add-the-variances rule, the conditions, and finding probabilities, with full worked calculations.

Generated by Claude Opus 4.810 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Quick answer

For two independent random samples from populations with means $\mu_1, \mu_2$ and standard deviations $\sigma_1, \sigma_2$ , and sizes $n_1, n_2$ , the sampling distribution of the difference $\bar{x}_1 - \bar{x}_2$ has:

mean $\mu_{\bar{x}_1 - \bar{x}_2} = \mu_1 - \mu_2$ ;
standard deviation $\sigma_{\bar{x}_1 - \bar{x}_2} = \sqrt{\dfrac{\sigma_1^2}{n_1} + \dfrac{\sigma_2^2}{n_2}}$ (add the two variances, then square-root);
an approximately normal shape when each sample mean is approximately normal, that is, both populations are normal or both sample sizes are large (central limit theorem), with the samples independent and each at most $10\%$ of its population.

As with proportions, the standard deviation adds variances because the samples are independent.

Jump to a section

What this topic is asking
Center and spread of the difference
Why variances add (again)
The conditions for normality
Why this matters for inference
Try this

What this topic is asking

The College Board (Topic 5.8) wants you to describe the mean, standard deviation, and shape of the sampling distribution of the difference between two independent sample means $\bar{x}_1 - \bar{x}_2$ , adding the variances and checking the conditions for normality.

Center and spread of the difference

The mean is the difference of the two population means. The standard deviation adds the two sample-mean variances $\dfrac{\sigma_1^2}{n_1}$ and $\dfrac{\sigma_2^2}{n_2}$ and square-roots the sum. Each term is the variance of one sample mean (the square of $\dfrac{\sigma}{\sqrt{n}}$ from Topic 5.7), so this is the same combining rule from Topic 4.9 applied to two independent sample means.

Why variances add (again)

This is the identical principle to Topic 5.6 for proportions: variances of independent quantities add under both addition and subtraction. The persistent error is to subtract the variances or to combine the standard deviations directly. The correct procedure is always: square each sample mean's standard deviation to get its variance, add the two variances, then take the square root.

The conditions for normality

The shape is approximately normal when each sample mean is approximately normal, so the shape conditions of Topic 5.7 must hold for both samples. Each sample mean is normal if its population is normal (any $n$ ) or its sample is large (CLT, commonly $n \ge 30$ ). In a typical two-sample problem you justify normality by stating that both populations are normal, or that both sample sizes are large, or a mix (one normal population, one large sample). In addition, the two samples must be independent of each other, and each sample should be at most $10\%$ of its population for the standard deviation formula to be valid. A complete answer checks all of this: the normality justification for each sample, the between-sample independence, and the $10\%$ condition for each. This is the two-mean analogue of the doubled conditions seen for two proportions, and it is the main way the two-sample topic extends the one-sample Topic 5.7.

Why this matters for inference

Topic 5.8 is the sampling-distribution foundation for comparing two means, the basis of the two-sample $t$ procedures in Unit 7. A confidence interval for $\mu_1 - \mu_2$ is centered at $\bar{x}_1 - \bar{x}_2$ with a width built from this added-variances standard deviation (estimated as a standard error using $s_1$ and $s_2$ , which is why a $t$ -distribution is used in practice), and a two-sample significance test computes a standardized statistic with it. Being able to state the center $\mu_1 - \mu_2$ , compute the added-variances spread, justify the normal shape for both samples, and find a probability is exactly the preparation for those procedures. As with proportions, an illuminating question asks for the probability that the observed difference is large or even has the opposite sign to $\mu_1 - \mu_2$ , which underscores that a single observed difference of sample means is one draw from a distribution of possible differences, not the true difference itself. Completing the full template here, center, added-variances spread, shape justification, standardize, interpret, rounds out Unit 5 and feeds straight into two-sample mean inference.

Difference of two sample means

Brand A batteries last a mean of $\mu_A = 40$ hours with $\sigma_A = 6$ ; brand B last $\mu_B = 36$ hours with $\sigma_B = 8$ . Both lifetime distributions are approximately normal. Independent samples of $n_A = 36$ and $n_B = 64$ are taken. Let $D = \bar{x}_A - \bar{x}_B$ . (a) Find the mean and standard deviation of $D$ . (b) Justify the shape. (c) Find $P(D < 2)$ .

step 1 Center and spread (part a)

$\mu_D = \mu_A - \mu_B = 40 - 36 = 4$ hours. Add the variances:

\sigma_D = \sqrt{\frac{\sigma_A^2}{n_A} + \frac{\sigma_B^2}{n_B}} = \sqrt{\frac{36}{36} + \frac{64}{64}} = \sqrt{1 + 1} = \sqrt{2} \approx 1.414 \text{ hours}.

step 2 Justify the shape (part b)

Both populations are approximately normal, so each sample mean is normal for any $n$ , and therefore their difference $D$ is approximately normal with mean $4$ and standard deviation about $1.414$ . (Even without normal populations, $n_A = 36$ and $n_B = 64$ are large enough for the CLT.)

step 3 Standardize and find the area (part c)

$z = \dfrac{2 - 4}{1.414} = \dfrac{-2}{1.414} \approx -1.41$ . So $P(D < 2) = P(Z < -1.41) \approx 0.0793$ , about $7.9\%$ .

step 4 Interpret

The difference in mean lifetimes is approximately normal, centered at $4$ hours with standard deviation about $1.41$ hours. There is roughly a $7.9\%$ chance the observed difference in sample means is below $2$ hours, even though brand A truly averages $4$ hours longer. The spread came from adding the two sample-mean variances and square-rooting, the essential rule for a difference of independent means.

Try this

Q1. Write the standard deviation formula for $\bar{x}_1 - \bar{x}_2$ and state why variances are added. [2 points]

Cue. $\sigma = \sqrt{\dfrac{\sigma_1^2}{n_1} + \dfrac{\sigma_2^2}{n_2}}$ ; variances add because the samples are independent (and add even for a difference).

Q2. What must be true for $\bar{x}_1 - \bar{x}_2$ to be approximately normal? [1 point]

Cue. Each sample mean must be approximately normal (both populations normal, or both samples large by the CLT), with the samples independent and each at most $10\%$ of its population.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). Two independent samples have sample-mean standard deviations

\sigma_{\bar{x}_1} = 3

and

\sigma_{\bar{x}_2} = 4

. The standard deviation of

\bar{x}_1 - \bar{x}_2

is (A)

1

(B)

5

(C)

7

(D)

25

Show worked answer →

The correct answer is (B).

For independent samples, variances add: $\sigma_{\bar{x}_1 - \bar{x}_2} = \sqrt{3^2 + 4^2} = \sqrt{9 + 16} = \sqrt{25} = 5$ .

(A) wrongly subtracts. (C) adds the standard deviations directly. (D) is the variance, not the standard deviation. Adding variances and rooting gives $5$ .

AP 2022 (style)4 marksSection II (free response). Population 1 has mean

\mu_1 = 70

and standard deviation

\sigma_1 = 10

; population 2 has

\mu_2 = 65

and

\sigma_2 = 8

. Independent random samples of

n_1 = 50

and

n_2 = 40

are taken. Let

D = \bar{x}_1 - \bar{x}_2

. (a) Find the mean and standard deviation of

D

. (b) Justify the shape. (c) Find the probability that

\bar{x}_1 - \bar{x}_2

is greater than

8

, and interpret in context.

Show worked answer →

A 4-point question on the difference of two means.

(a) (2 points) $\mu_D = \mu_1 - \mu_2 = 70 - 65 = 5$ (1 point); $\sigma_D = \sqrt{\dfrac{\sigma_1^2}{n_1} + \dfrac{\sigma_2^2}{n_2}} = \sqrt{\dfrac{100}{50} + \dfrac{64}{40}} = \sqrt{2 + 1.6} = \sqrt{3.6} \approx 1.897$ (1 point).
(b) (1 point) Both samples are large ( $n_1 = 50$ , $n_2 = 40$ , each $\ge 30$ ), so by the central limit theorem each sample mean is approximately normal and so is their difference.
(c) (1 point) $z = \dfrac{8 - 5}{1.897} \approx 1.58$ , so $P(D > 8) = P(Z > 1.58) \approx 0.0571$ ; about a $5.7\%$ chance the difference in sample means exceeds $8$ .

Markers reward the mean and the add-the-variances standard deviation, justifying normality via the CLT for both samples, and the probability with interpretation.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)