United StatesComputer Science PrinciplesSyllabus dot point

How does bias get into computing systems, and how can it be reduced?

Topic 5.3 Computing Bias: computing innovations can reflect existing human biases through biased data or design choices, and bias can be embedded intentionally or unintentionally.

A focused answer to AP CSP Topic 5.3, covering how bias enters computing systems through biased data and design, intentional versus unintentional bias, real effects on people, why biased data produces biased outputs, and how bias can be identified and reduced.

Generated by Claude Opus 4.89 min answerUpdated 2026-06-04

Reviewed by: AI editorial process; not yet individually human-reviewed

Have a quick question? Jump to the Q&A page

Jump to a section

What this topic is asking
What computing bias is
How bias enters: data and design
Intentional versus unintentional
Effects and mitigation
Try this

What this topic is asking

The College Board (Topic 5.3) wants you to understand computing bias: how computing systems can reflect and amplify human biases. Bias enters through biased data and through design choices, and it can be embedded intentionally or unintentionally. You need to explain why unrepresentative data produces biased outputs, give examples of the real harm bias causes, and describe how bias can be identified and reduced.

What computing bias is

How bias enters: data and design

A facial-recognition system trained mostly on one group's images works poorly for others: the data was unrepresentative, so the outputs are biased.

Intentional versus unintentional

Bias can be deliberately built in, but the CED stresses that it is most often unintentional. Developers with no intent to discriminate can still create biased systems by using data that carries historical inequalities. Because it is often invisible to the creators, bias must be actively looked for, not assumed absent.

Effects and mitigation

Biased systems make real decisions about people, in hiring, lending, policing and recognition, so bias can cause serious, scaled harm. To reduce it, developers can:

Use more representative and diverse data.
Test the system across different groups to detect unequal performance.
Review design choices and involve diverse perspectives (linking back to collaboration).

Identifying and reducing bias in a system

A company builds a tool that screens job applicants by learning from its past hiring decisions. Analyze the bias risk.

step 1 Identify the data source

The tool learns from past hiring decisions. If past hiring favored certain groups, that history is the training data.

step 2 Trace how bias is reproduced

Because the system learns patterns from biased history, it will reproduce the same favoritism, screening out qualified applicants from under-represented groups, even though no one programmed it to discriminate. This is unintentional computing bias from biased data.

step 3 Note the real harm

The biased tool makes real decisions at scale, denying opportunities unfairly and entrenching existing inequality.

step 4 Propose mitigations

Use more representative data, test the tool's outcomes across different groups to detect unequal results, remove inputs that act as proxies for protected characteristics, and review the design with diverse reviewers.

step 5 State the lesson

A system is only as fair as its data and design. Bias must be actively identified and reduced; assuming a system is neutral because it is automated is a mistake the exam targets.

Try this

Q1. How can unrepresentative training data cause computing bias? [2 points]

Cue. A system learns patterns from its data; if the data over-represents some groups, the system performs better for them and worse for others, producing unfair, biased outputs.

Q2. Suggest one way developers can reduce bias in a computing system. [1 point]

Cue. Use more representative and diverse data, test the system across different groups, or review design choices for fairness (any one).

Exam-style practice questions

Practice questions written in the style of College Board exam questions on this dot point, with worked answer explainers. The year tag is the paper they imitate, not the source.

AP 2022 (style)1 marksMultiple choice. A facial recognition system performs much worse for some groups of people than others because it was trained mostly on images of one group. This is an example of: (A) A hardware fault. (B) Computing bias caused by unrepresentative training data. (C) The digital divide. (D) A network latency problem.

Show worked answer →

The answer is (B).

The system reflects computing bias introduced by biased (unrepresentative) data: trained mostly on one group, it works poorly for others. (A) is not a hardware fault; the system works as built but unfairly. (C) the digital divide is about access, not biased outputs. (D) is unrelated to fairness.

Markers reward identifying that unrepresentative training data embeds bias into a system's outputs.

AP 2021 (style)2 marksFree response (short). Explain how bias can be unintentionally introduced into a computing system, and suggest one way developers can reduce it.

Show worked answer →

A 2-point question on the source and mitigation of bias.

Point 1 (source): Bias is often introduced unintentionally through the data used to build or train a system. If the data over-represents some groups or reflects existing human prejudices, the system learns and reproduces those patterns even though no one set out to discriminate.

Point 2 (mitigation): Developers can reduce bias by using more representative, diverse data, testing the system across different groups, and reviewing design choices for fairness. Any valid source-and-mitigation pair earns the marks.

Related dot points

Sources & how we know this

AP Computer Science Principles Course and Exam Description — College Board (2025)