Skip to main content
United StatesPsychologySyllabus dot point

How do consequences shape voluntary behavior, and how do reinforcement and punishment work?

Topic 3.8 Operant Conditioning: explain operant conditioning, including positive and negative reinforcement and punishment, primary and secondary reinforcers, shaping, and the schedules of reinforcement.

A focused answer to AP Psychology Topic 3.8, covering Thorndike's law of effect and Skinner's operant conditioning, positive and negative reinforcement, positive and negative punishment, primary and secondary reinforcers, shaping, and the four schedules of reinforcement and their response patterns.

Generated by Claude Opus 4.812 min answer

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section
  1. What this topic is asking
  2. The foundations
  3. The four-cell grid
  4. Reinforcers and shaping
  5. Schedules of reinforcement
  6. Try this

What this topic is asking

Topic 3.8 covers operant conditioning, learning from consequences. The College Board wants you to distinguish positive and negative reinforcement, positive and negative punishment, primary and secondary reinforcers, the technique of shaping, and the four schedules of reinforcement with their characteristic response patterns.

The foundations

The four-cell grid

The single most tested distinction is positive/negative crossed with reinforcement/punishment. Positive = add a stimulus; negative = remove a stimulus.

  • Positive reinforcement: add a pleasant stimulus to increase behavior (give praise).
  • Negative reinforcement: remove an aversive stimulus to increase behavior (a seatbelt alarm stops when you buckle).
  • Positive punishment: add an aversive stimulus to decrease behavior (assign extra chores).
  • Negative punishment: remove a pleasant stimulus to decrease behavior (take away phone privileges).

Reinforcers and shaping

  • Primary reinforcer: innately satisfying, meeting a biological need (food, water).
  • Secondary (conditioned) reinforcer: gains reinforcing power through association with primary reinforcers (money, tokens, praise).
  • Shaping: reinforcing successive approximations, ever closer steps toward a target behavior, used to build complex behaviors gradually.

Schedules of reinforcement

When reinforcement is partial, the schedule determines the response pattern:

  • Fixed-ratio: reinforce after a set number of responses (every 5th); high rate with brief post-reinforcement pauses.
  • Variable-ratio: reinforce after an unpredictable number of responses (slot machines); the highest, most persistent responding.
  • Fixed-interval: reinforce the first response after a set time; a "scalloped" pattern that speeds up near the deadline.
  • Variable-interval: reinforce the first response after varying times; slow, steady responding.

The reason this topic generates so many questions is that it gives the exam a clean four-way labeling task plus a schedule-identification task, both of which reward precision over intuition. The reliable method is to ask two questions in order: is the behavior going up or down (reinforcement versus punishment), and is something being added or taken away (positive versus negative). Then for schedules, ask whether reinforcement depends on a number of responses (ratio) or the passage of time (interval), and whether it is predictable (fixed) or not (variable). Variable-ratio is worth memorizing as the schedule behind gambling and the hardest to extinguish. This vocabulary returns in Unit 5, where behavior therapies like token economies are built directly on operant principles.

Try this

Q1. Distinguish negative reinforcement from positive punishment with examples. [2 points]

  • Cue. Negative reinforcement removes an aversive stimulus to increase behavior (alarm stops when you buckle); positive punishment adds an aversive stimulus to decrease behavior (extra chores).

Q2. Name the schedule that produces the highest, most persistent responding. [1 point]

  • Cue. The variable-ratio schedule (as in gambling).

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2024 (style)1 marksMultiple choice. A worker is paid after every fixed number of items produced, and output rises sharply just before each payment. Which schedule of reinforcement is in use? (A) Fixed-interval (B) Variable-interval (C) Fixed-ratio (D) Variable-ratio (E) Continuous
Show worked answer →

The answer is (C) Fixed-ratio.

A fixed-ratio schedule delivers reinforcement after a set number of responses (every nth item). It produces high rates of responding with a brief pause after each reinforcement, fitting pay per fixed quantity of output.

(A) fixed-interval reinforces the first response after a set time, producing a scalloped pattern. (B) variable-interval reinforces after varying time periods. (D) variable-ratio reinforces after a varying number of responses (like a slot machine), producing the highest, steadiest rates. (E) continuous reinforces every response.

AP 2023 (style)5 marksConcept-application free-response question. A teacher uses consequences to manage a classroom. Explain how EACH of the following could be applied: positive reinforcement, negative reinforcement, positive punishment, shaping, and a variable-ratio schedule.
Show worked answer →

A 5-point concept-application FRQ; one point per term.

Positive reinforcement (1): adding a desirable stimulus (such as praise) after a behavior to increase it.
Negative reinforcement (1): removing an aversive stimulus (such as cancelling a quiz) to increase a behavior.
Positive punishment (1): adding an aversive stimulus (such as extra work) after a behavior to decrease it.
Shaping (1): reinforcing successive approximations toward a target behavior so it is gradually built up.
Variable-ratio schedule (1): reinforcing after an unpredictable number of responses, producing high, persistent responding.

Markers reward each term being correctly defined AND tied to the classroom.

Related dot points

Sources & how we know this