Skip to main content
United StatesStatisticsSyllabus dot point

How does a probability distribution describe all the possible values of a numerical random outcome?

Topic 4.7 Introduction to Random Variables and Probability Distributions: define discrete random variables, represent and interpret their probability distributions, and use them to find probabilities of events.

A focused answer to AP Statistics Topic 4.7, defining discrete random variables, the requirements of a valid probability distribution, cumulative probabilities, and interpreting distributions in context, with worked probability calculations.

Generated by Claude Opus 4.89 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. Random variables and their distributions
  3. What makes a distribution valid
  4. Finding probabilities of events
  5. Interpreting in context
  6. Try this

What this topic is asking

The College Board (Topic 4.7) wants you to define a discrete random variable, represent and interpret its probability distribution, and use the distribution to find probabilities of events, including cumulative ("at least," "at most," "between") probabilities.

Random variables and their distributions

The shift from Topics 4.3 to 4.6 is that we now attach numbers to outcomes and study the whole pattern of those numbers. Instead of asking "what is the probability of this event?", we describe all the values a numerical outcome can take and how likely each is, the same descriptive move as Unit 1 (shape, center, spread), but now for a theoretical distribution rather than a dataset.

What makes a distribution valid

These two conditions are exam staples. Given all but one probability, the missing one is whatever makes the total 11. Given a "distribution" whose probabilities sum to more or less than 11, or include a negative value, you declare it invalid. This validity check is the first thing to do whenever a distribution is presented.

Finding probabilities of events

Once you have a valid distribution, the probability of any event is just the sum of the probabilities of the values that make up the event. The skill is translating the words into the right set of values. "XX is at least 22" means X{2,3,4,}X \in \{2, 3, 4, \dots\}, so add P(2)+P(3)+P(2) + P(3) + \cdots. "XX is at most 11" means {0,1}\{0, 1\}. "XX is between 11 and 33 inclusive" means {1,2,3}\{1, 2, 3\}. The boundary words matter: "more than 22" excludes 22 ({3,4,}\{3, 4, \dots\}) while "at least 22" includes it. A cumulative probability such as P(Xk)P(X \le k) adds all values up to kk, and the complement is often handy, P(X2)=1P(X1)P(X \ge 2) = 1 - P(X \le 1), when the "tail" you want has more values than its opposite. Reading the inequality carefully and then summing the matching entries is the entire computational content of the topic, but it underlies everything that follows, because the mean, standard deviation, and the binomial and geometric distributions are all built on a probability distribution like this one.

Interpreting in context

As everywhere in AP Statistics, a probability is only fully answered when interpreted in context. P(X2)=0.6P(X \ge 2) = 0.6 should be read as "there is a 60%60\% chance the dealership sells at least two cars on a given day," not left as a bare 0.60.6. This habit matters more than it looks, because free-response scoring repeatedly awards a separate point for the contextual interpretation, and because it forces you to keep track of what the random variable actually counts or measures. A probability distribution is a model of a real quantity, daily car sales, number of defective items, points scored, and the point of computing its probabilities is to say something meaningful about that quantity. Pairing every numerical answer with a sentence in the situation's own terms is the discipline that turns a correct calculation into a complete response, and it carries directly into the expected-value and distribution topics next.

Try this

Q1. A distribution has P(1)=0.3P(1) = 0.3, P(2)=0.45P(2) = 0.45, P(3)=kP(3) = k. Find kk. [1 point]

  • Cue. Sum to one: k=1(0.3+0.45)=0.25k = 1 - (0.3 + 0.45) = 0.25.

Q2. For that distribution, find P(X>1)P(X > 1) and interpret it. [2 points]

  • Cue. P(X>1)=P(2)+P(3)=0.45+0.25=0.70P(X > 1) = P(2) + P(3) = 0.45 + 0.25 = 0.70; there is a 70%70\% chance XX exceeds 11.

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2019 (style)1 marksSection I (multiple choice). A discrete random variable XX has P(X=0)=0.2P(X=0) = 0.2, P(X=1)=0.5P(X=1) = 0.5, and P(X=2)=kP(X=2) = k. What is kk? (A) 0.10.1 (B) 0.20.2 (C) 0.30.3 (D) 0.70.7
Show worked answer →

The correct answer is (C).

The probabilities of a discrete distribution must sum to 11: 0.2+0.5+k=10.2 + 0.5 + k = 1, so k=10.7=0.3k = 1 - 0.7 = 0.3.

(A) and (B) make the total less than 11; (D) makes it exceed 11. Only k=0.3k = 0.3 gives a valid distribution summing to 11.

AP 2021 (style)4 marksSection II (free response). The number of cars XX a dealership sells in a day has distribution P(X=0)=0.1P(X=0)=0.1, P(X=1)=0.3P(X=1)=0.3, P(X=2)=0.35P(X=2)=0.35, P(X=3)=0.15P(X=3)=0.15, P(X=4)=0.1P(X=4)=0.1. (a) Verify this is a valid probability distribution. (b) Find P(X2)P(X \ge 2). (c) Find P(1X3)P(1 \le X \le 3) and interpret it in context.
Show worked answer →

A 4-point question on probability distributions.

(a) (1 point) All probabilities are between 00 and 11, and they sum to 0.1+0.3+0.35+0.15+0.1=10.1+0.3+0.35+0.15+0.1 = 1, so it is a valid distribution.
(b) (1 point) P(X2)=P(2)+P(3)+P(4)=0.35+0.15+0.1=0.6P(X \ge 2) = P(2) + P(3) + P(4) = 0.35 + 0.15 + 0.1 = 0.6.
(c) (2 points) P(1X3)=P(1)+P(2)+P(3)=0.3+0.35+0.15=0.8P(1 \le X \le 3) = P(1) + P(2) + P(3) = 0.3 + 0.35 + 0.15 = 0.8 (1 point); interpret: on about 80%80\% of days the dealership sells between 11 and 33 cars inclusive (1 point, in context).

Markers reward verifying validity (range and sum), correctly summing the relevant probabilities, and a contextual interpretation.

Related dot points

Sources & how we know this