Skip to main content
United StatesStatisticsSyllabus dot point

What are the mean and standard deviation of a binomial distribution, and what shape does it take?

Topic 4.11 Parameters for a Binomial Distribution: calculate and interpret the mean and standard deviation of a binomial random variable using the shortcut formulas, and describe how the distribution's shape depends on n and p.

A focused answer to AP Statistics Topic 4.11, on the binomial mean np and standard deviation, why the shortcuts work, interpreting them in context, and how shape depends on n and p, with full worked calculations.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. The shortcut formulas
  3. Why the formulas work
  4. The shape of a binomial distribution
  5. Using the parameters to judge surprise
  6. Try this

What this topic is asking

The College Board (Topic 4.11) wants you to calculate and interpret the mean and standard deviation of a binomial random variable using the shortcut formulas, and to describe how the distribution's shape depends on nn and pp.

The shortcut formulas

The mean formula is intuitive: if each of nn trials succeeds with probability pp, you expect about npnp successes, just as 5050 free throws at 70%70\% should produce about 3535 makes. The standard deviation formula is less obvious but follows from adding the variances of nn independent trials (Topic 4.9): each trial contributes variance p(1p)p(1-p), so nn of them give np(1p)np(1-p), and the square root is the standard deviation.

Why the formulas work

Seeing the binomial as a sum of nn independent one-trial variables connects this topic back to Topic 4.9 and explains both shortcuts at once: means add to give npnp, and (because the trials are independent) variances add to give np(1p)np(1-p). This is also why you must not add standard deviations: the \sqrt{\cdot} comes from adding variances and rooting at the end. Understanding the derivation, rather than just memorizing the formulas, makes them stick and clarifies why independence is required.

The shape of a binomial distribution

The third part of the topic is shape. With p=0.5p = 0.5 the distribution is perfectly symmetric, because successes and failures are equally likely. With p<0.5p < 0.5 successes are rarer, so the distribution is right-skewed (piled up at low counts with a tail toward high counts); with p>0.5p > 0.5 it is left-skewed. But as the number of trials nn increases, the distribution becomes more bell-shaped and symmetric regardless of pp, a preview of the normal approximation. The standard rule of thumb (the large-counts condition) is that the binomial is approximately normal when both np10np \ge 10 and n(1p)10n(1-p) \ge 10, that is, when you expect at least about 1010 successes and 1010 failures. When that holds, you can use the mean npnp and standard deviation np(1p)\sqrt{np(1-p)} with the normal model to estimate probabilities, exactly the bridge that makes Unit 5's sampling distribution of a proportion approximately normal. So the shape discussion is not decorative: it tells you when the normal machinery of Topic 1.10 may be reused on a binomial count.

Using the parameters to judge surprise

A powerful exam move combines the binomial parameters with the z-score idea from Topic 1.10 to decide whether an observed count is surprising. If the number of successes you saw is several standard deviations from the mean npnp, it is far out in the tail and unlikely under the assumed pp, evidence that pp may not be what was claimed. In the defects example, 2020 defective parts is about 3.23.2 standard deviations above the expected 1010, which is extreme, so a 5%5\% defect rate looks doubtful. This is the same surprise-measuring logic from Topic 4.1, now made quantitative with μ=np\mu = np and σ=np(1p)\sigma = \sqrt{np(1-p)}, and it is the direct ancestor of the significance test for a proportion in Unit 6. Being able to compute the two parameters, interpret the mean as a long-run expected count, and use the standard deviation to gauge how unusual an outcome is, is the full skill set Topic 4.11 builds.

Try this

Q1. For XB(100,0.3)X \sim B(100, 0.3), find the mean and standard deviation. [2 points]

  • Cue. μ=np=30\mu = np = 30; σ=np(1p)=100(0.3)(0.7)=214.58\sigma = \sqrt{np(1-p)} = \sqrt{100(0.3)(0.7)} = \sqrt{21} \approx 4.58.

Q2. State the condition under which a binomial distribution is approximately normal. [1 point]

  • Cue. The large-counts condition: both np10np \ge 10 and n(1p)10n(1-p) \ge 10 (at least about 1010 expected successes and 1010 expected failures).

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2018 (style)1 marksSection I (multiple choice). For XB(50,0.2)X \sim B(50, 0.2), what is the mean of XX? (A) 55 (B) 1010 (C) 4040 (D) 0.20.2
Show worked answer →

The correct answer is (B).

The binomial mean is μ=np=50×0.2=10\mu = np = 50 \times 0.2 = 10.

(A) and (C) miscompute the product. (D) gives the probability pp, not the mean. Multiplying the number of trials by the success probability gives an expected 1010 successes.

AP 2021 (style)4 marksSection II (free response). A quality inspector checks 200200 independently produced parts, each defective with probability 0.050.05. Let XX be the number defective. (a) Find the mean and standard deviation of XX. (b) Interpret the mean in context. (c) Would 2020 defective parts be surprising? Use the mean and standard deviation to justify your answer in context.
Show worked answer →

A 4-point question on binomial parameters.

(a) (2 points) μ=np=200(0.05)=10\mu = np = 200(0.05) = 10 (1 point); σ=np(1p)=200(0.05)(0.95)=9.53.08\sigma = \sqrt{np(1-p)} = \sqrt{200(0.05)(0.95)} = \sqrt{9.5} \approx 3.08 (1 point).
(b) (1 point) On average, about 1010 of the 200200 parts are defective per batch over the long run.
(c) (1 point) 2020 defective is (2010)/3.083.2(20 - 10)/3.08 \approx 3.2 standard deviations above the mean, which is far out in the tail, so 2020 defective would be surprising under a 5%5\% defect rate (suggesting the rate may be higher).

Markers reward correct mean and standard deviation, a long-run interpretation of the mean, and using a z-score-style argument to judge that 2020 is surprising.

Related dot points

Sources & how we know this