Skip to main content
United StatesStatisticsSyllabus dot point

How do random selection and random assignment together decide what conclusions a study can support?

Topic 3.7 Inference and Experiments: use the presence or absence of random selection and random assignment to determine the scope of inference, that is, whether results generalize to a population and whether a causal conclusion is justified.

A focused answer to AP Statistics Topic 3.7, on the scope of inference, using random selection (generalization) and random assignment (causation) to decide what conclusions are valid, with a worked four-quadrant analysis.

Generated by Claude Opus 4.810 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. The two questions, the two randomisations
  3. The four-quadrant grid
  4. Why each randomisation does its job
  5. Writing a defensible conclusion
  6. Try this

What this topic is asking

The College Board (Topic 3.7) wants you to use the presence or absence of two randomisations, random selection and random assignment, to determine the scope of inference: whether a result can be generalized to a population, and whether a causal conclusion is justified.

The two questions, the two randomisations

These are genuinely separate. A study can have one, both, or neither. Confusing them is the most common error in the topic, so it is worth fixing the pairing firmly: selection to generalize, assignment to cause. Selection is about who is in the study; assignment is about what is done to them.

The four-quadrant grid

Most real experiments use volunteers (random assignment, no random selection), so they support causation for those volunteers but not automatic generalization. Most surveys use random selection but are observational (no random assignment), so they support generalisable associations but not causation. Recognizing which quadrant a study falls in, then stating exactly the conclusion that quadrant allows, is the core skill Topic 3.7 assesses.

Why each randomisation does its job

Random selection works because a chance-chosen sample is, on average, a fair miniature of the population, so a statistic computed from it estimates the population parameter with only quantifiable random error; that is what makes extrapolation to the population legitimate. Random assignment works because distributing subjects to treatments by chance tends to make the groups alike in every variable except the treatment, so confounding is eliminated by design and a difference in response can be pinned on the treatment. The two mechanisms are independent: making the sample representative says nothing about whether treatments were imposed fairly, and balancing treatment groups says nothing about whether the subjects resemble a wider population. That independence is exactly why the conclusions they license are independent, and why you must check for each one separately before writing a conclusion.

Writing a defensible conclusion

On the exam, a conclusion about scope should do three things: state whether the result generalizes (and to whom), state whether a causal claim is justified, and tie each judgement to the presence or absence of the relevant randomisation, in context. For a volunteer experiment with random assignment, write that the treatment caused the observed difference for these subjects, but that without random selection the finding may not extend to a broader population. For a random-sample survey, write that the association can be generalized to the sampled population, but that because treatments were not assigned, a confounding variable could be responsible, so no causal claim is warranted. Disciplining yourself to name the randomisation that does (or does not) support each half of the conclusion is what separates a full-credit answer from a vague one, and it is the reasoning every later inference unit assumes you can produce.

Try this

Q1. A study randomly selects subjects but does not randomly assign treatments. What kind of conclusion is justified? [2 points]

  • Cue. A generalisable association (random selection allows generalization), but no causation, because without random assignment a confounding variable could explain the link.

Q2. Why can a volunteer experiment with random assignment still not generalize to a population? [1 point]

  • Cue. It lacks random selection, so the volunteers may not represent any wider population; random assignment gives causation for the subjects but not generalization.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2020 (style)1 marksSection I (multiple choice). Volunteers are randomly assigned to two exercise programs, and one program produces significantly more weight loss. The volunteers were not randomly selected from any population. What can be concluded? (A) The program causes more weight loss, and this generalizes to all adults (B) The program causes more weight loss, for these volunteers (C) The program is only associated with weight loss, with no causal claim (D) Nothing can be concluded
Show worked answer →

The correct answer is (B).

Random assignment was used, so a causal conclusion is justified: the program caused more weight loss. But because there was no random selection from a population, the result cannot be generalized beyond these volunteers.

(A) overreaches by generalizing without random selection. (C) wrongly denies causation despite random assignment. (D) is too pessimistic; causation for these subjects is valid. Random assignment gives cause; lack of random selection limits the population.

AP 2022 (style)4 marksSection II (free response). In study X, researchers randomly select 500500 adults from a city and survey their exercise and cholesterol; more exercise is associated with lower cholesterol. In study Y, 6060 volunteers are randomly assigned to an exercise plan or none, and the exercise group ends with lower cholesterol. (a) State what study X can and cannot conclude. (b) State what study Y can and cannot conclude. (c) Explain why the two studies reach different kinds of conclusion, justifying in context.
Show worked answer →

A 4-point question on scope of inference.

(a) (1 point) Study X used random selection but not random assignment (observational), so it can generalize an association between exercise and cholesterol to the city's adults, but cannot conclude that exercise causes lower cholesterol (confounding possible).
(b) (1 point) Study Y used random assignment but not random selection (volunteers), so it can conclude exercise causes lower cholesterol for these volunteers, but cannot generalize to all adults.
(c) (2 points) Random selection controls generalization and random assignment controls causation (1 point); study X has the first but not the second, study Y has the second but not the first, so each supports only the conclusion its randomisation licenses (1 point, in context).

Markers reward correctly pairing random selection with generalization and random assignment with causation, and explaining the trade-off in context.

Related dot points

Sources & how we know this