1.6 — Ethics, Bias & Responsible AI

Learning Objectives

Identify sources of bias in AI training data and explain how they propagate to model outputs
Articulate the core principles of responsible AI: fairness, accountability, transparency
Recognize the key AI governance frameworks: EU AI Act, NIST AI RMF, ISO/IEC 42001

Where Bias Comes From

AI systems learn from data — and data reflects the world as it is, including its historical inequalities, underrepresentation, and measurement limitations. Bias enters AI in several ways:

Representation bias: The training data under-represents certain groups. A medical AI trained primarily on data from one demographic will perform less accurately on others.

Measurement bias: The way data is collected or labeled introduces systematic error. If loan default is used as a proxy for creditworthiness, and default rates are affected by discriminatory lending practices, the resulting model inherits those biases.

Aggregation bias: A single model is applied uniformly to subgroups that have different underlying patterns. A one-size-fits-all diabetes prediction model may perform well on average but poorly for specific populations.

Evaluation bias: The benchmarks used to assess AI performance don't reflect real-world diversity. A model that scores highly on benchmarks derived from predominantly English text may perform poorly in other languages.

⚠️Warning

Compounding harms: Bias in AI systems deployed at scale causes outsized harm because decisions are made automatically, at speed, across millions of people. A hiring algorithm that subtly deprioritizes women's resumes affects far more people than a single biased hiring manager.

Responsible AI Principles

The AI industry and government regulators have converged on a set of core principles for responsible AI development and deployment:

Principle	What It Means
Fairness	Systems should not discriminate or produce disparate outcomes across demographic groups
Accountability	Clear ownership for AI decisions; humans remain responsible for AI-driven outcomes
Transparency	Users should know when AI is making decisions that affect them; explainability for high-stakes domains
Privacy	Data minimization; consent; protection of sensitive information
Safety	Systems should not cause harm; guardrails against misuse
Human oversight	Meaningful human review for high-stakes decisions (hiring, lending, healthcare, law enforcement)

Key Governance Frameworks

EU AI Act (2024)

The world's first comprehensive AI regulation, passed by the European Union. It takes a risk-based approach:

Prohibited AI: Social scoring, real-time biometric surveillance in public spaces, manipulative AI
High-risk AI: Must meet transparency, data governance, and human oversight requirements (e.g., AI in hiring, credit, healthcare, law enforcement)
General-purpose AI models: Large models (above certain compute thresholds) face transparency and copyright compliance requirements

NIST AI Risk Management Framework (AI RMF)

Developed by the US National Institute of Standards and Technology — a voluntary framework for organizations to identify, assess, and manage AI risks. Organized around four functions: Govern, Map, Measure, Manage.

ISO/IEC 42001

The first international management system standard for AI, providing organizations with a framework to demonstrate responsible AI practices — similar to how ISO 27001 works for information security.

Case Studies in AI Failure

COMPAS recidivism scoring: A risk assessment tool used in US criminal sentencing was found to incorrectly flag Black defendants as higher risk at nearly twice the rate of white defendants with similar histories.

Amazon's hiring tool: Amazon built a resume-screening AI trained on historical hiring decisions — which had been predominantly male. The model learned to penalize resumes that included the word "women's" (as in "women's chess club") and downgraded graduates of all-women's colleges.

Medical AI and skin conditions: Several dermatology AI models trained predominantly on lighter skin tones performed significantly worse at detecting conditions on darker skin tones.

✅Tip

If you are building or evaluating AI systems, ask these questions early: What is the training data? Who is represented and who is missing? What are the failure modes? Who is harmed if the system is wrong? What is the appeals process?

Key Takeaways

Bias in AI comes from bias in data — representation, measurement, aggregation, and evaluation bias
Responsible AI is built on fairness, accountability, transparency, privacy, safety, and human oversight
The EU AI Act is the most comprehensive AI regulation globally; NIST AI RMF provides a voluntary US framework
Real-world AI failures have caused measurable harm to individuals and groups — the stakes of getting this right are high

Ethics, Bias & Responsible AI

Audio & video lessons are paid features