Skip to main content
United StatesStatisticsSyllabus dot point

How is the sampling distribution of the difference between two sample means described?

Topic 5.8 Sampling Distributions for Differences in Sample Means: describe the mean, standard deviation, and shape of the sampling distribution of the difference between two independent sample means, adding variances and checking the conditions for normality.

A focused answer to AP Statistics Topic 5.8, on the mean, standard deviation, and approximately normal shape of the difference between two independent sample means, the add-the-variances rule, the conditions, and finding probabilities, with full worked calculations.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Center and spread of the difference
  3. Why variances add (again)
  4. The conditions for normality
  5. Why this matters for inference
  6. Try this

What this topic is asking

The College Board (Topic 5.8) wants you to describe the mean, standard deviation, and shape of the sampling distribution of the difference between two independent sample means xˉ1xˉ2\bar{x}_1 - \bar{x}_2, adding the variances and checking the conditions for normality.

Center and spread of the difference

The mean is the difference of the two population means. The standard deviation adds the two sample-mean variances σ12n1\dfrac{\sigma_1^2}{n_1} and σ22n2\dfrac{\sigma_2^2}{n_2} and square-roots the sum. Each term is the variance of one sample mean (the square of σn\dfrac{\sigma}{\sqrt{n}} from Topic 5.7), so this is the same combining rule from Topic 4.9 applied to two independent sample means.

Why variances add (again)

This is the identical principle to Topic 5.6 for proportions: variances of independent quantities add under both addition and subtraction. The persistent error is to subtract the variances or to combine the standard deviations directly. The correct procedure is always: square each sample mean's standard deviation to get its variance, add the two variances, then take the square root.

The conditions for normality

The shape is approximately normal when each sample mean is approximately normal, so the shape conditions of Topic 5.7 must hold for both samples. Each sample mean is normal if its population is normal (any nn) or its sample is large (CLT, commonly n30n \ge 30). In a typical two-sample problem you justify normality by stating that both populations are normal, or that both sample sizes are large, or a mix (one normal population, one large sample). In addition, the two samples must be independent of each other, and each sample should be at most 10%10\% of its population for the standard deviation formula to be valid. A complete answer checks all of this: the normality justification for each sample, the between-sample independence, and the 10%10\% condition for each. This is the two-mean analogue of the doubled conditions seen for two proportions, and it is the main way the two-sample topic extends the one-sample Topic 5.7.

Why this matters for inference

Topic 5.8 is the sampling-distribution foundation for comparing two means, the basis of the two-sample tt procedures in Unit 7. A confidence interval for μ1μ2\mu_1 - \mu_2 is centered at xˉ1xˉ2\bar{x}_1 - \bar{x}_2 with a width built from this added-variances standard deviation (estimated as a standard error using s1s_1 and s2s_2, which is why a tt-distribution is used in practice), and a two-sample significance test computes a standardized statistic with it. Being able to state the center μ1μ2\mu_1 - \mu_2, compute the added-variances spread, justify the normal shape for both samples, and find a probability is exactly the preparation for those procedures. As with proportions, an illuminating question asks for the probability that the observed difference is large or even has the opposite sign to μ1μ2\mu_1 - \mu_2, which underscores that a single observed difference of sample means is one draw from a distribution of possible differences, not the true difference itself. Completing the full template here, center, added-variances spread, shape justification, standardize, interpret, rounds out Unit 5 and feeds straight into two-sample mean inference.

Try this

Q1. Write the standard deviation formula for xˉ1xˉ2\bar{x}_1 - \bar{x}_2 and state why variances are added. [2 points]

  • Cue. σ=σ12n1+σ22n2\sigma = \sqrt{\dfrac{\sigma_1^2}{n_1} + \dfrac{\sigma_2^2}{n_2}}; variances add because the samples are independent (and add even for a difference).

Q2. What must be true for xˉ1xˉ2\bar{x}_1 - \bar{x}_2 to be approximately normal? [1 point]

  • Cue. Each sample mean must be approximately normal (both populations normal, or both samples large by the CLT), with the samples independent and each at most 10%10\% of its population.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). Two independent samples have sample-mean standard deviations σxˉ1=3\sigma_{\bar{x}_1} = 3 and σxˉ2=4\sigma_{\bar{x}_2} = 4. The standard deviation of xˉ1xˉ2\bar{x}_1 - \bar{x}_2 is (A) 11 (B) 55 (C) 77 (D) 2525
Show worked answer →

The correct answer is (B).

For independent samples, variances add: σxˉ1xˉ2=32+42=9+16=25=5\sigma_{\bar{x}_1 - \bar{x}_2} = \sqrt{3^2 + 4^2} = \sqrt{9 + 16} = \sqrt{25} = 5.

(A) wrongly subtracts. (C) adds the standard deviations directly. (D) is the variance, not the standard deviation. Adding variances and rooting gives 55.

AP 2022 (style)4 marksSection II (free response). Population 1 has mean μ1=70\mu_1 = 70 and standard deviation σ1=10\sigma_1 = 10; population 2 has μ2=65\mu_2 = 65 and σ2=8\sigma_2 = 8. Independent random samples of n1=50n_1 = 50 and n2=40n_2 = 40 are taken. Let D=xˉ1xˉ2D = \bar{x}_1 - \bar{x}_2. (a) Find the mean and standard deviation of DD. (b) Justify the shape. (c) Find the probability that xˉ1xˉ2\bar{x}_1 - \bar{x}_2 is greater than 88, and interpret in context.
Show worked answer →

A 4-point question on the difference of two means.

(a) (2 points) μD=μ1μ2=7065=5\mu_D = \mu_1 - \mu_2 = 70 - 65 = 5 (1 point); σD=σ12n1+σ22n2=10050+6440=2+1.6=3.61.897\sigma_D = \sqrt{\dfrac{\sigma_1^2}{n_1} + \dfrac{\sigma_2^2}{n_2}} = \sqrt{\dfrac{100}{50} + \dfrac{64}{40}} = \sqrt{2 + 1.6} = \sqrt{3.6} \approx 1.897 (1 point).
(b) (1 point) Both samples are large (n1=50n_1 = 50, n2=40n_2 = 40, each 30\ge 30), so by the central limit theorem each sample mean is approximately normal and so is their difference.
(c) (1 point) z=851.8971.58z = \dfrac{8 - 5}{1.897} \approx 1.58, so P(D>8)=P(Z>1.58)0.0571P(D > 8) = P(Z > 1.58) \approx 0.0571; about a 5.7%5.7\% chance the difference in sample means exceeds 88.

Markers reward the mean and the add-the-variances standard deviation, justifying normality via the CLT for both samples, and the probability with interpretation.

Related dot points

Sources & how we know this