When should we block or pair subjects instead of using a completely randomised design?
Topic 3.6 Selecting an Experimental Design: compare completely randomised, randomised block, and matched pairs designs, and explain how blocking and pairing control a known source of variation to make treatment effects clearer.
A focused answer to AP Statistics Topic 3.6, comparing completely randomised, randomised block, and matched pairs designs, and explaining how blocking and pairing remove a known source of variation to sharpen the comparison.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 3.6) wants you to compare the main experimental designs, completely randomised, randomised block, and matched pairs, and to explain how blocking and pairing control a known source of variation so that the treatment effect stands out more clearly.
Three designs
The progression is one of increasing control over a known source of variation. The completely randomised design controls nothing in advance and relies on randomisation alone to balance everything. The block design takes one large, predictable source of variation out of the comparison by handling it explicitly. The matched pairs design takes this to its limit, pairing on as much similarity as possible.
What blocking does
The purpose of blocking is not to balance the block variable (random assignment already tends to do that) but to reduce variability so a real treatment effect is easier to detect. Suppose a drug's effect is small but sex strongly affects the response. In a completely randomised design, the large male-female differences inflate the overall variation and can swamp the drug's effect. Blocking on sex compares the treatments within men and within women separately, where the sex variation is absent, so the drug's effect emerges clearly. You block on a variable you know matters; you randomise within blocks to handle everything you do not know.
Matched pairs as the extreme
A matched pairs design is the tightest form of blocking, with blocks of size two. There are two common versions. In the first, you pair similar subjects (twins, or people matched on age and weight) and randomly give one of each pair each treatment. In the second, common and powerful, each subject receives both treatments, in a randomly chosen order, and serves as their own control, as in the running-sole example where each runner wears both soles. Because the comparison is made within a pair (or within a person), all the variation between people, weight, fitness, habits, simply cancels out, leaving a much cleaner read on the treatment difference. The price is that you must still randomise within each pair (which subject or which order gets the new treatment) so that a pairing artefact, such as a left-right foot difference or an order effect, does not confound the result. This is why matched pairs questions always reward randomising the within-pair assignment, usually by a coin flip per pair.
Choosing the right design on the exam
The exam expects you to justify a design choice, so reason from the source of variation. If there is a known variable likely to affect the response and you can group units by it, a randomised block (or matched pairs) design gives a more precise comparison than a completely randomised design of the same size. If no such dominant variable is apparent, or pairing is impractical, the completely randomised design is appropriate and simplest. When each unit can sensibly receive both treatments without one affecting the other, matched pairs (within-subject) is often the most powerful choice. In every case the analysis still rests on random assignment for its validity; blocking and pairing are about sharpening the comparison, not about whether causation can be claimed. Articulating "block what is known, randomise what is not" is the reasoning Topic 3.6 wants to see.
Try this
Q1. What is the purpose of blocking in an experiment? [2 points]
- Cue. To remove the variation due to a known variable by comparing treatments within similar groups, making the treatment effect more precise and easier to detect (not to enable causation, which randomisation already does).
Q2. In a matched pairs design where each subject takes both treatments, what must still be randomised, and why? [1 point]
- Cue. The order (or which treatment comes first) for each subject, so an order or position effect does not systematically favor one treatment.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2019 (style)1 marksSection I (multiple choice). An experiment on a heart drug expects men and women to respond differently. To account for this, researchers separate subjects by sex, then randomly assign treatments within each group. This is a (A) completely randomised design (B) randomised block design with sex as the block (C) matched pairs design (D) stratified sampleShow worked answer →
The correct answer is (B).
Grouping subjects by a variable expected to affect the response (sex), then randomly assigning treatments within each group, is a randomised block design with sex as the blocking variable.
(A) A completely randomised design ignores sex and assigns all subjects at once. (C) Matched pairs pairs similar individuals (or uses each subject twice), not whole groups. (D) Stratifying is a sampling idea, not a treatment-assignment design. Blocking on sex then randomising within blocks is a randomised block design.
AP 2021 (style)4 marksSection II (free response). A researcher tests whether a new sole improves running comfort. Each of runners will wear the new sole on one foot and the standard sole on the other. (a) Identify this design. (b) Explain how it controls a source of variation that a completely randomised design would not. (c) Explain how the researcher should decide which foot gets the new sole, and why, justifying in context.Show worked answer →
A 4-point question on matched pairs design.
(a) (1 point) This is a matched pairs design: each runner is their own pair, wearing both soles.
(b) (2 points) Differences between runners (weight, gait, fitness) are a large source of variation (1 point); by comparing the two soles within the same runner, this person-to-person variation is removed, so the comparison reflects the sole difference more clearly (1 point).
(c) (1 point) Randomly assign which foot gets the new sole for each runner (for example by coin flip), so that any left-right foot difference does not systematically favor one sole, justified because otherwise a foot effect could confound the sole effect.
Markers reward identifying matched pairs, explaining that pairing removes between-subject variation, and randomising the within-pair assignment with a reason.
Related dot points
- Topic 3.5 Introduction to Experimental Design: identify the components of an experiment (units, treatments, response) and apply the principles of comparison, random assignment, replication, and control, including blinding and the placebo effect.
A focused answer to AP Statistics Topic 3.5, on experimental units, treatments and factors, and the principles of comparison, random assignment, replication, and control, plus blinding and the placebo effect.
- Topic 3.2 Introduction to Planning a Study: distinguish observational studies from experiments, identify explanatory and response variables, and recognize that only an experiment with imposed treatments can support a causal conclusion.
A focused answer to AP Statistics Topic 3.2, distinguishing observational studies from experiments, identifying explanatory and response variables and confounding, and explaining why imposing treatments is what enables causal claims.
- Topic 3.7 Inference and Experiments: use the presence or absence of random selection and random assignment to determine the scope of inference, that is, whether results generalize to a population and whether a causal conclusion is justified.
A focused answer to AP Statistics Topic 3.7, on the scope of inference, using random selection (generalization) and random assignment (causation) to decide what conclusions are valid, with a worked four-quadrant analysis.
- Topic 3.1 Introducing Statistics: Do the Data We Collected Tell the Truth? Recognize that the method of data collection determines the kinds of conclusions that can be drawn, and that poorly collected data cannot be fixed by analysis.
A focused answer to AP Statistics Topic 3.1, on why the data-collection method determines what conclusions are valid, the difference between random error and bias, and why analysis cannot rescue badly collected data.
- Topic 3.3 Random Sampling and Data Collection: describe and distinguish simple random, stratified, cluster, and systematic random sampling, and explain why random selection supports generalization to a population.
A focused answer to AP Statistics Topic 3.3, describing simple random, stratified, cluster, and systematic random sampling, how each uses chance, their trade-offs, and why random selection allows generalization, with a worked SRS selection.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)