How do we decide between an observational study and an experiment when planning to answer a question?
Topic 3.2 Introduction to Planning a Study: distinguish observational studies from experiments, identify explanatory and response variables, and recognize that only an experiment with imposed treatments can support a causal conclusion.
A focused answer to AP Statistics Topic 3.2, distinguishing observational studies from experiments, identifying explanatory and response variables and confounding, and explaining why imposing treatments is what enables causal claims.
Reviewed by: AI editorial process; not yet individually human-reviewed
Have a quick question? Jump to the Q&A page
Jump to a section
What this topic is asking
The College Board (Topic 3.2) wants you to distinguish an observational study from an experiment, identify the explanatory and response variables, and understand why only an experiment that imposes treatments can support a cause-and-effect conclusion.
Observational study versus experiment
The single test you apply is: did the researchers do something to the subjects, or did they only watch and record? Assigning people to a diet, a drug, or a teaching method is an experiment; recording the diet, drug, or method people already chose is an observational study. Surveys are observational. This one distinction governs the entire scope of what the study can conclude.
Explanatory and response variables
Identifying these correctly orients the whole analysis: the explanatory variable goes on the horizontal axis of a scatterplot, defines the groups being compared, and is the variable a causal claim would be about. In an experiment, the levels of the explanatory variable are the treatments.
Why only experiments prove causation
The reason an observational study cannot establish causation is confounding. Two variables are confounded when their effects on the response cannot be separated. In the coffee-and-blood-pressure example, people who drink a lot of coffee may also be more stressed or sleep less, and those factors raise blood pressure too, so the observed link between coffee and blood pressure might really be a link between stress and blood pressure. Because the researchers did not control who drank coffee, the groups differ in many ways at once, and any of those differences could explain the result. An experiment removes this problem by imposing the treatment and (in Topic 3.5) randomly assigning subjects to treatment groups, which tends to balance every other variable, measured or not, across the groups. Then a difference in the response can be attributed to the treatment. This is why the College Board repeats the mantra: association from an observational study, causation only from a well-designed experiment.
Planning the right study for the question
Choosing a study type is a trade-off the exam expects you to reason about. Experiments give causal answers but are not always possible or ethical: you cannot randomly assign people to smoke for twenty years, so the link between smoking and cancer rests on observational evidence and biological mechanism, not a human experiment. Observational studies are often the only feasible route for questions about harmful exposures or naturally occurring groups, and they are perfectly good for describing associations and generating hypotheses, as long as you do not overclaim causation. So the planning question is not "which study type is better?" but "which type can answer my question, and what will I be allowed to conclude?" If the goal is to establish that a treatment works, an experiment is needed; if the goal is to describe a relationship in a population you cannot manipulate, an observational study is appropriate, with the confounding caveat stated plainly.
Try this
Q1. State the one feature that distinguishes an experiment from an observational study. [1 point]
- Cue. An experiment imposes a treatment on subjects; an observational study only measures or records, without intervening.
Q2. A study links eating breakfast to better grades from survey data. Why can it not conclude breakfast causes better grades? [2 points]
- Cue. It is observational, so a confounding variable (such as a stable home life) could be linked to both breakfast and grades; the link may reflect the confounder, not breakfast.
Exam-style practice questions
Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.
AP 2018 (style)1 marksSection I (multiple choice). Researchers record the diets people already follow and track their health for ten years. This is best described as (A) an experiment with random assignment (B) an observational study (C) a census (D) a simple random sample of treatmentsShow worked answer →
The correct answer is (B).
The researchers only observe and record the diets people chose for themselves; they do not assign anyone to a diet. Because no treatment is imposed, this is an observational study, not an experiment.
(A) requires the researchers to assign diets, which they do not. (C) a census measures the whole population, not the point here. (D) is not a meaningful description. Observing existing behavior, with no imposed treatment, defines an observational study.
AP 2021 (style)4 marksSection II (free response). A study finds that people who drink more coffee tend to have higher blood pressure. The data come from a survey of adults' coffee habits and blood pressure. (a) Identify the explanatory and response variables. (b) Explain why this study cannot conclude that coffee causes higher blood pressure, naming the relevant problem. (c) Describe, in general terms, the type of study that could establish a causal link, and justify why in context.Show worked answer →
A 4-point question on study type and confounding.
(a) (1 point) Explanatory variable: amount of coffee consumed; response variable: blood pressure.
(b) (2 points) This is an observational study (1 point); a confounding variable, such as stress or smoking, may be associated with both coffee drinking and blood pressure, so the link could be due to the confounder rather than coffee (1 point, in context).
(c) (1 point) A randomised experiment that assigns adults to different coffee amounts would, through random assignment, balance confounders across groups, allowing a cause-and-effect conclusion about coffee and blood pressure.
Markers reward correct identification of the variables, naming confounding in an observational study, and recognizing that a randomised experiment is what licenses a causal claim.
Related dot points
- Topic 3.1 Introducing Statistics: Do the Data We Collected Tell the Truth? Recognize that the method of data collection determines the kinds of conclusions that can be drawn, and that poorly collected data cannot be fixed by analysis.
A focused answer to AP Statistics Topic 3.1, on why the data-collection method determines what conclusions are valid, the difference between random error and bias, and why analysis cannot rescue badly collected data.
- Topic 3.5 Introduction to Experimental Design: identify the components of an experiment (units, treatments, response) and apply the principles of comparison, random assignment, replication, and control, including blinding and the placebo effect.
A focused answer to AP Statistics Topic 3.5, on experimental units, treatments and factors, and the principles of comparison, random assignment, replication, and control, plus blinding and the placebo effect.
- Topic 3.6 Selecting an Experimental Design: compare completely randomised, randomised block, and matched pairs designs, and explain how blocking and pairing control a known source of variation to make treatment effects clearer.
A focused answer to AP Statistics Topic 3.6, comparing completely randomised, randomised block, and matched pairs designs, and explaining how blocking and pairing remove a known source of variation to sharpen the comparison.
- Topic 3.7 Inference and Experiments: use the presence or absence of random selection and random assignment to determine the scope of inference, that is, whether results generalize to a population and whether a causal conclusion is justified.
A focused answer to AP Statistics Topic 3.7, on the scope of inference, using random selection (generalization) and random assignment (causation) to decide what conclusions are valid, with a worked four-quadrant analysis.
- Topic 2.1 Introducing Statistics - Are Variables Related?: identify questions about the association between two variables, distinguish association from causation, and recognize what two-variable data can answer.
A focused answer to AP Statistics Topic 2.1, on framing questions about the association between two variables, the difference between explanatory and response variables, why association is not causation, and what two-variable data can answer, with worked examples.
Sources & how we know this
- AP Statistics Course and Exam Description — College Board (2020)