United StatesStatisticsSyllabus dot point

How do random selection and random assignment together decide what conclusions a study can support?

Topic 3.7 Inference and Experiments: use the presence or absence of random selection and random assignment to determine the scope of inference, that is, whether results generalize to a population and whether a causal conclusion is justified.

A focused answer to AP Statistics Topic 3.7, on the scope of inference, using random selection (generalization) and random assignment (causation) to decide what conclusions are valid, with a worked four-quadrant analysis.

Generated by Claude Opus 4.810 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
The two questions, the two randomisations
The four-quadrant grid
Why each randomisation does its job
Writing a defensible conclusion
Try this

What this topic is asking

The College Board (Topic 3.7) wants you to use the presence or absence of two randomisations, random selection and random assignment, to determine the scope of inference: whether a result can be generalized to a population, and whether a causal conclusion is justified.

The two questions, the two randomisations

These are genuinely separate. A study can have one, both, or neither. Confusing them is the most common error in the topic, so it is worth fixing the pairing firmly: selection to generalize, assignment to cause. Selection is about who is in the study; assignment is about what is done to them.

The four-quadrant grid

Most real experiments use volunteers (random assignment, no random selection), so they support causation for those volunteers but not automatic generalization. Most surveys use random selection but are observational (no random assignment), so they support generalisable associations but not causation. Recognizing which quadrant a study falls in, then stating exactly the conclusion that quadrant allows, is the core skill Topic 3.7 assesses.

Why each randomisation does its job

Random selection works because a chance-chosen sample is, on average, a fair miniature of the population, so a statistic computed from it estimates the population parameter with only quantifiable random error; that is what makes extrapolation to the population legitimate. Random assignment works because distributing subjects to treatments by chance tends to make the groups alike in every variable except the treatment, so confounding is eliminated by design and a difference in response can be pinned on the treatment. The two mechanisms are independent: making the sample representative says nothing about whether treatments were imposed fairly, and balancing treatment groups says nothing about whether the subjects resemble a wider population. That independence is exactly why the conclusions they license are independent, and why you must check for each one separately before writing a conclusion.

Writing a defensible conclusion

On the exam, a conclusion about scope should do three things: state whether the result generalizes (and to whom), state whether a causal claim is justified, and tie each judgement to the presence or absence of the relevant randomisation, in context. For a volunteer experiment with random assignment, write that the treatment caused the observed difference for these subjects, but that without random selection the finding may not extend to a broader population. For a random-sample survey, write that the association can be generalized to the sampled population, but that because treatments were not assigned, a confounding variable could be responsible, so no causal claim is warranted. Disciplining yourself to name the randomisation that does (or does not) support each half of the conclusion is what separates a full-credit answer from a vague one, and it is the reasoning every later inference unit assumes you can produce.

Determining scope of inference for two studies

Study P: a researcher takes a random sample of $400$ employees from a large company and records their commute time and job satisfaction, finding longer commutes associated with lower satisfaction. Study Q: $50$ volunteer employees are randomly assigned to a shorter or longer simulated commute for a month, and the longer-commute group reports lower satisfaction.

step 1 Identify the randomisations in Study P

Study P uses random selection (a random sample of employees) but no random assignment (it only records existing commutes), so it is observational.

step 2 State Study P's scope

Because of random selection, the association between commute length and satisfaction can be generalized to all employees at the company. Because there is no random assignment, a confounder (such as seniority or distance lived from work) could explain it, so no causal claim is allowed.

step 3 Identify the randomisations in Study Q

Study Q uses random assignment (volunteers assigned to commute lengths) but no random selection (volunteers, not a random sample), so it is an experiment on a non-random group.

step 4 State Study Q's scope and interpret

Because of random assignment, Study Q can conclude that longer commutes caused lower satisfaction, but only for these $50$ volunteers; without random selection, this may not generalize to all employees. So Study P gives a broad association and Study Q gives a narrow causal result: together they show why both randomisations are needed for a generalisable causal conclusion.

Try this

Q1. A study randomly selects subjects but does not randomly assign treatments. What kind of conclusion is justified? [2 points]

Cue. A generalisable association (random selection allows generalization), but no causation, because without random assignment a confounding variable could explain the link.

Q2. Why can a volunteer experiment with random assignment still not generalize to a population? [1 point]

Cue. It lacks random selection, so the volunteers may not represent any wider population; random assignment gives causation for the subjects but not generalization.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2020 (style)1 marksSection I (multiple choice). Volunteers are randomly assigned to two exercise programs, and one program produces significantly more weight loss. The volunteers were not randomly selected from any population. What can be concluded? (A) The program causes more weight loss, and this generalizes to all adults (B) The program causes more weight loss, for these volunteers (C) The program is only associated with weight loss, with no causal claim (D) Nothing can be concluded

Show worked answer →

The correct answer is (B).

Random assignment was used, so a causal conclusion is justified: the program caused more weight loss. But because there was no random selection from a population, the result cannot be generalized beyond these volunteers.

(A) overreaches by generalizing without random selection. (C) wrongly denies causation despite random assignment. (D) is too pessimistic; causation for these subjects is valid. Random assignment gives cause; lack of random selection limits the population.

AP 2022 (style)4 marksSection II (free response). In study X, researchers randomly select

500

adults from a city and survey their exercise and cholesterol; more exercise is associated with lower cholesterol. In study Y,

60

volunteers are randomly assigned to an exercise plan or none, and the exercise group ends with lower cholesterol. (a) State what study X can and cannot conclude. (b) State what study Y can and cannot conclude. (c) Explain why the two studies reach different kinds of conclusion, justifying in context.

Show worked answer →

A 4-point question on scope of inference.

(a) (1 point) Study X used random selection but not random assignment (observational), so it can generalize an association between exercise and cholesterol to the city's adults, but cannot conclude that exercise causes lower cholesterol (confounding possible).
(b) (1 point) Study Y used random assignment but not random selection (volunteers), so it can conclude exercise causes lower cholesterol for these volunteers, but cannot generalize to all adults.
(c) (2 points) Random selection controls generalization and random assignment controls causation (1 point); study X has the first but not the second, study Y has the second but not the first, so each supports only the conclusion its randomisation licenses (1 point, in context).

Markers reward correctly pairing random selection with generalization and random assignment with causation, and explaining the trade-off in context.

Related dot points

Sources & how we know this

AP Statistics Course and Exam Description — College Board (2020)