Skip to main content
United StatesStatisticsSyllabus dot point

When should we block or pair subjects instead of using a completely randomised design?

Topic 3.6 Selecting an Experimental Design: compare completely randomised, randomised block, and matched pairs designs, and explain how blocking and pairing control a known source of variation to make treatment effects clearer.

A focused answer to AP Statistics Topic 3.6, comparing completely randomised, randomised block, and matched pairs designs, and explaining how blocking and pairing remove a known source of variation to sharpen the comparison.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Three designs
  3. What blocking does
  4. Matched pairs as the extreme
  5. Choosing the right design on the exam
  6. Try this

What this topic is asking

The College Board (Topic 3.6) wants you to compare the main experimental designs, completely randomised, randomised block, and matched pairs, and to explain how blocking and pairing control a known source of variation so that the treatment effect stands out more clearly.

Three designs

The progression is one of increasing control over a known source of variation. The completely randomised design controls nothing in advance and relies on randomisation alone to balance everything. The block design takes one large, predictable source of variation out of the comparison by handling it explicitly. The matched pairs design takes this to its limit, pairing on as much similarity as possible.

What blocking does

The purpose of blocking is not to balance the block variable (random assignment already tends to do that) but to reduce variability so a real treatment effect is easier to detect. Suppose a drug's effect is small but sex strongly affects the response. In a completely randomised design, the large male-female differences inflate the overall variation and can swamp the drug's effect. Blocking on sex compares the treatments within men and within women separately, where the sex variation is absent, so the drug's effect emerges clearly. You block on a variable you know matters; you randomise within blocks to handle everything you do not know.

Matched pairs as the extreme

A matched pairs design is the tightest form of blocking, with blocks of size two. There are two common versions. In the first, you pair similar subjects (twins, or people matched on age and weight) and randomly give one of each pair each treatment. In the second, common and powerful, each subject receives both treatments, in a randomly chosen order, and serves as their own control, as in the running-sole example where each runner wears both soles. Because the comparison is made within a pair (or within a person), all the variation between people, weight, fitness, habits, simply cancels out, leaving a much cleaner read on the treatment difference. The price is that you must still randomise within each pair (which subject or which order gets the new treatment) so that a pairing artefact, such as a left-right foot difference or an order effect, does not confound the result. This is why matched pairs questions always reward randomising the within-pair assignment, usually by a coin flip per pair.

Choosing the right design on the exam

The exam expects you to justify a design choice, so reason from the source of variation. If there is a known variable likely to affect the response and you can group units by it, a randomised block (or matched pairs) design gives a more precise comparison than a completely randomised design of the same size. If no such dominant variable is apparent, or pairing is impractical, the completely randomised design is appropriate and simplest. When each unit can sensibly receive both treatments without one affecting the other, matched pairs (within-subject) is often the most powerful choice. In every case the analysis still rests on random assignment for its validity; blocking and pairing are about sharpening the comparison, not about whether causation can be claimed. Articulating "block what is known, randomise what is not" is the reasoning Topic 3.6 wants to see.

Try this

Q1. What is the purpose of blocking in an experiment? [2 points]

  • Cue. To remove the variation due to a known variable by comparing treatments within similar groups, making the treatment effect more precise and easier to detect (not to enable causation, which randomisation already does).

Q2. In a matched pairs design where each subject takes both treatments, what must still be randomised, and why? [1 point]

  • Cue. The order (or which treatment comes first) for each subject, so an order or position effect does not systematically favor one treatment.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). An experiment on a heart drug expects men and women to respond differently. To account for this, researchers separate subjects by sex, then randomly assign treatments within each group. This is a (A) completely randomised design (B) randomised block design with sex as the block (C) matched pairs design (D) stratified sample
Show worked answer →

The correct answer is (B).

Grouping subjects by a variable expected to affect the response (sex), then randomly assigning treatments within each group, is a randomised block design with sex as the blocking variable.

(A) A completely randomised design ignores sex and assigns all subjects at once. (C) Matched pairs pairs similar individuals (or uses each subject twice), not whole groups. (D) Stratifying is a sampling idea, not a treatment-assignment design. Blocking on sex then randomising within blocks is a randomised block design.

AP 2021 (style)4 marksSection II (free response). A researcher tests whether a new sole improves running comfort. Each of 2020 runners will wear the new sole on one foot and the standard sole on the other. (a) Identify this design. (b) Explain how it controls a source of variation that a completely randomised design would not. (c) Explain how the researcher should decide which foot gets the new sole, and why, justifying in context.
Show worked answer →

A 4-point question on matched pairs design.

(a) (1 point) This is a matched pairs design: each runner is their own pair, wearing both soles.
(b) (2 points) Differences between runners (weight, gait, fitness) are a large source of variation (1 point); by comparing the two soles within the same runner, this person-to-person variation is removed, so the comparison reflects the sole difference more clearly (1 point).
(c) (1 point) Randomly assign which foot gets the new sole for each runner (for example by coin flip), so that any left-right foot difference does not systematically favor one sole, justified because otherwise a foot effect could confound the sole effect.

Markers reward identifying matched pairs, explaining that pairing removes between-subject variation, and randomising the within-pair assignment with a reason.

Related dot points

Sources & how we know this