United StatesStatisticsSyllabus dot point

When should we block or pair subjects instead of using a completely randomised design?

Topic 3.6 Selecting an Experimental Design: compare completely randomised, randomised block, and matched pairs designs, and explain how blocking and pairing control a known source of variation to make treatment effects clearer.

A focused answer to AP Statistics Topic 3.6, comparing completely randomised, randomised block, and matched pairs designs, and explaining how blocking and pairing remove a known source of variation to sharpen the comparison.

Generated by Claude Opus 4.810 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
Three designs
What blocking does
Matched pairs as the extreme
Choosing the right design on the exam
Try this

What this topic is asking

The College Board (Topic 3.6) wants you to compare the main experimental designs, completely randomised, randomised block, and matched pairs, and to explain how blocking and pairing control a known source of variation so that the treatment effect stands out more clearly.

Three designs

The progression is one of increasing control over a known source of variation. The completely randomised design controls nothing in advance and relies on randomisation alone to balance everything. The block design takes one large, predictable source of variation out of the comparison by handling it explicitly. The matched pairs design takes this to its limit, pairing on as much similarity as possible.

What blocking does

The purpose of blocking is not to balance the block variable (random assignment already tends to do that) but to reduce variability so a real treatment effect is easier to detect. Suppose a drug's effect is small but sex strongly affects the response. In a completely randomised design, the large male-female differences inflate the overall variation and can swamp the drug's effect. Blocking on sex compares the treatments within men and within women separately, where the sex variation is absent, so the drug's effect emerges clearly. You block on a variable you know matters; you randomise within blocks to handle everything you do not know.

Matched pairs as the extreme

A matched pairs design is the tightest form of blocking, with blocks of size two. There are two common versions. In the first, you pair similar subjects (twins, or people matched on age and weight) and randomly give one of each pair each treatment. In the second, common and powerful, each subject receives both treatments, in a randomly chosen order, and serves as their own control, as in the running-sole example where each runner wears both soles. Because the comparison is made within a pair (or within a person), all the variation between people, weight, fitness, habits, simply cancels out, leaving a much cleaner read on the treatment difference. The price is that you must still randomise within each pair (which subject or which order gets the new treatment) so that a pairing artefact, such as a left-right foot difference or an order effect, does not confound the result. This is why matched pairs questions always reward randomising the within-pair assignment, usually by a coin flip per pair.

Choosing the right design on the exam

The exam expects you to justify a design choice, so reason from the source of variation. If there is a known variable likely to affect the response and you can group units by it, a randomised block (or matched pairs) design gives a more precise comparison than a completely randomised design of the same size. If no such dominant variable is apparent, or pairing is impractical, the completely randomised design is appropriate and simplest. When each unit can sensibly receive both treatments without one affecting the other, matched pairs (within-subject) is often the most powerful choice. In every case the analysis still rests on random assignment for its validity; blocking and pairing are about sharpening the comparison, not about whether causation can be claimed. Articulating "block what is known, randomise what is not" is the reasoning Topic 3.6 wants to see.

Comparing a completely randomised and a block design

A researcher tests two study methods on $40$ students for a vocabulary test. Prior GPA strongly predicts test performance. (a) Describe a completely randomised design. (b) Describe a randomised block design using GPA, and (c) explain why the block design may detect a method effect more easily.

step 1 Completely randomised design (part a)

Number the $40$ students $1$ to $40$ . Use a random number generator to assign $20$ to method A and $20$ to method B, ignoring GPA. Compare the average test scores of the two groups.

step 2 Randomised block design (part b)

Rank the $40$ students by GPA and form blocks of similar GPA, for example a "high-GPA" block of $20$ and a "low-GPA" block of $20$ . Within each block, randomly assign $10$ students to method A and $10$ to method B. Compare methods within each block, then combine.

step 3 Why blocking helps (part c)

GPA strongly affects test scores, so in the completely randomised design the high-low GPA gap inflates the overall variation, which can hide a modest method effect. The block design compares the two methods among similar-GPA students, where the GPA variation is removed, so a real method difference stands out against smaller leftover variation.

step 4 Interpret

Both designs use random assignment and so both can support a causal conclusion about study method. But by removing the known GPA variation, the block design estimates the method effect more precisely, making it easier to detect a true difference of a given size.

Try this

Q1. What is the purpose of blocking in an experiment? [2 points]

Cue. To remove the variation due to a known variable by comparing treatments within similar groups, making the treatment effect more precise and easier to detect (not to enable causation, which randomisation already does).

Q2. In a matched pairs design where each subject takes both treatments, what must still be randomised, and why? [1 point]

Cue. The order (or which treatment comes first) for each subject, so an order or position effect does not systematically favor one treatment.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). An experiment on a heart drug expects men and women to respond differently. To account for this, researchers separate subjects by sex, then randomly assign treatments within each group. This is a (A) completely randomised design (B) randomised block design with sex as the block (C) matched pairs design (D) stratified sample

Show worked answer →

The correct answer is (B).

Grouping subjects by a variable expected to affect the response (sex), then randomly assigning treatments within each group, is a randomised block design with sex as the blocking variable.

(A) A completely randomised design ignores sex and assigns all subjects at once. (C) Matched pairs pairs similar individuals (or uses each subject twice), not whole groups. (D) Stratifying is a sampling idea, not a treatment-assignment design. Blocking on sex then randomising within blocks is a randomised block design.

AP 2021 (style)4 marksSection II (free response). A researcher tests whether a new sole improves running comfort. Each of

20

runners will wear the new sole on one foot and the standard sole on the other. (a) Identify this design. (b) Explain how it controls a source of variation that a completely randomised design would not. (c) Explain how the researcher should decide which foot gets the new sole, and why, justifying in context.

Show worked answer →

A 4-point question on matched pairs design.

(a) (1 point) This is a matched pairs design: each runner is their own pair, wearing both soles.
(b) (2 points) Differences between runners (weight, gait, fitness) are a large source of variation (1 point); by comparing the two soles within the same runner, this person-to-person variation is removed, so the comparison reflects the sole difference more clearly (1 point).
(c) (1 point) Randomly assign which foot gets the new sole for each runner (for example by coin flip), so that any left-right foot difference does not systematically favor one sole, justified because otherwise a foot effect could confound the sole effect.

Markers reward identifying matched pairs, explaining that pairing removes between-subject variation, and randomising the within-pair assignment with a reason.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)