YouTube

Video explainers and talks on AI safety and alignment—whole channels devoted to the topic, plus standout individual videos.

Browse this category in the interactive library →

Robert Miles AI Safety

Robert Miles

The single most popular AI alignment video series, explaining technical safety concepts like the orthogonality thesis, instrumental convergence, inner misalignment, and reward hacking in clear, rigorous terms.

Beginner2017

Rational Animations

Animated explainers on rationality and AI safety, adapting foundational alignment writing into accessible short films on existential risk, scalable oversight, and why aligning advanced AI is hard.

Beginner2020

AI In Context

80,000 Hours

80,000 Hours' YouTube channel hosted by Aric Floyd, mixing long and short videos on the risks of transformative AI—including a deep dive on the AI 2027 scenario—and what people can do about them.

Beginner2025

Will Superintelligent AI End the World? | Eliezer Yudkowsky | TED

Eliezer Yudkowsky

Yudkowsky's fiery TED talk arguing that smarter-than-human AI could kill us all and calling for an immediate worldwide moratorium on developing generalist frontier AI.

Beginner2023

3 Principles for Creating Safer AI | Stuart Russell | TED

Stuart Russell

Russell proposes building machines that are altruistic, humble about human values, and uncertain enough to defer to people—the core of his human-compatible approach to alignment.

Beginner2017

Can We Build AI Without Losing Control Over It? | Sam Harris | TED

Sam Harris

Harris argues we will inevitably build superintelligent machines yet have barely grappled with the control problem, making a visceral case for taking AI risk seriously now.

Beginner2016

What Happens When Our Computers Get Smarter Than We Are? | Nick Bostrom | TED

Nick Bostrom

Bostrom frames machine superintelligence as the last invention humanity need ever make and explains why getting its goals right is a civilization-critical challenge.

Beginner2015

Slaughterbots

Future of Life Institute

A dramatized near-future short film from FLI and Stuart Russell depicting swarms of autonomous facial-recognition microdrones used as weapons, made to warn against lethal autonomous weapons.

Beginner2017

Humans Need Not Apply

CGP Grey

A widely viewed essay on how automation and AI will displace human labor across nearly every sector, reframing the economic disruption question for a mass audience.

Beginner2014

A.I. ‐ Humanity's Final Invention?

Kurzgesagt – In a Nutshell

Kurzgesagt's animated explainer on artificial superintelligence: how an AGI that improves itself in a feedback loop could rapidly surpass humans and why that makes alignment our most consequential problem.

Beginner2024

The Rise of the Machines – Why Automation is Different this Time

Kurzgesagt – In a Nutshell

Kurzgesagt argues that information-age automation differs fundamentally from past waves, with machine learning encroaching on cognitive work and reshaping the future of employment.

Beginner2017

Deadly Truth of General AI? – Computerphile

Robert Miles

Rob Miles uses the 'deadly stamp collector' thought experiment to show why a general AI pursuing a simple objective could be catastrophic if its goals aren't aligned with ours.

Beginner2015

AI "Stop Button" Problem – Computerphile

Robert Miles

Rob Miles explains why simply adding an off-switch to a capable AI is far harder than it sounds, illustrating corrigibility and the incentives an agent has to resist being stopped.

Beginner2017

The Artificial Intelligence That Deleted A Century

Tom Scott

A short speculative fiction about a narrow copyright-enforcement AI that, left unchecked, destroys a century of culture—an accessible parable of specification gaming and unintended consequences.

Beginner2020

The danger of AI is weirder than you think | Janelle Shane | TED

Janelle Shane

Shane uses funny real-world ML failures to show the core risk isn't AI rebelling but doing exactly what we literally asked—making misspecified objectives vivid for a general audience.

Beginner2019

Artificial Intelligence: Last Week Tonight with John Oliver

John Oliver

A mainstream comedic explainer covering how modern AI works, its bias and reliability problems, and the 'black box' challenge of systems we deploy without understanding them.

Beginner2023

"Godfather of AI" Geoffrey Hinton: The 60 Minutes Interview

60 Minutes / CBS

Deep-learning pioneer Geoffrey Hinton explains why, after leaving Google, he warns that there is no guaranteed path to safety as AI systems approach and exceed human capability.

Beginner2023

The A.I. Dilemma

Tristan Harris & Aza Raskin

The Center for Humane Technology co-founders argue that racing to deploy AI without safety guardrails already threatens society, drawing parallels to the social-media harms they earlier warned about.

Beginner2023

AI Deception: How Tech Companies Are Fooling Us

ColdFusion

ColdFusion traces the history of 'AI washing' and deceptive demos, examining how hype distorts public understanding of what AI systems can actually do and why honest evaluation matters.

Beginner2024

The Urgent Risks of Runaway AI — and What to Do about Them | Gary Marcus | TED

Gary Marcus

Marcus warns that unreliable, fast-deployed AI threatens truth and democracy through mass misinformation, and calls for a global, neutral governance body to oversee the technology.

Beginner2023

How to Keep AI Under Control | Max Tegmark | TED

Max Tegmark

Tegmark argues that today's commercial AI boom is likely to be followed by superintelligence, and sketches an optimistic technical vision—including provably safe systems—for keeping it under human control.

Beginner2023

What Is an AI Anyway? | Mustafa Suleyman | TED

Mustafa Suleyman

A leading model-builder reframes AI as 'a new digital species,' arguing this lens clarifies both the stakes and the responsibility we have to contain and steer increasingly capable systems.

Beginner2024

Why AI Is Incredibly Smart and Shockingly Stupid | Yejin Choi | TED

Yejin Choi

Choi demystifies large language models by showing where they fail at basic reasoning and common sense, and argues for smaller systems trained on human norms and values.

Beginner2023

AI Is Becoming Dangerous. Are We Ready?

Sabine Hossenfelder

Hossenfelder examines the real near-term risks of agentic AI—prompt injection, deception, and models resisting shutdown—as autonomous agents ship with serious unsolved problems.

Beginner2025

[1hr Talk] Intro to Large Language Models

Andrej Karpathy

A widely praised technical primer on how LLMs work, ending with a clear tour of the security challenges—jailbreaks, prompt injection, and data poisoning—that make these systems hard to secure.

Intermediate2023

How Not to Destroy the World with AI

Stuart Russell

The Royal Institution lecture in which Russell lays out why the standard model of AI—optimizing fixed objectives—is dangerous, and how building machines uncertain about human preferences could keep them controllable.

Intermediate2023

Munk Debate on Artificial Intelligence | Bengio & Tegmark vs. Mitchell & LeCun

Munk Debates

A structured debate on whether AI poses an existential threat, with Yoshua Bengio and Max Tegmark arguing for the resolution against Melanie Mitchell and Yann LeCun—an unusually direct airing of the core cruxes.

Intermediate2023

Will Artificial Intelligence Save Us or Kill Us?

DW Documentary

A documentary weighing AI's promise against its dangers, from automation and aging societies to the warnings of researchers who fear losing control of increasingly capable systems.

Beginner2024

Are We All Wrong About AI?

ColdFusion

ColdFusion examines competing narratives about AI progress—hype versus genuine capability—helping viewers calibrate how seriously to take both the promises and the risks.

Beginner2024

The Catastrophic Risks of AI — and a Safer Path | Yoshua Bengio | TED

Yoshua Bengio

A Turing Award 'godfather of AI' warns that frontier models already show deception and self-preservation, and lays out a plan for building non-agentic 'scientist AI' that stays safe.

Beginner2025

Eliezer Yudkowsky: Dangers of AI and the End of Human Civilization | Lex Fridman Podcast #368

Lex Fridman

A long-form conversation in which Yudkowsky makes his case that humanity is unprepared for superintelligence, probing why alignment is so hard and why he expects catastrophe by default.

Intermediate2023

AI and the Future of Humanity | Yuval Noah Harari at the Frontiers Forum

Yuval Noah Harari

Harari argues AI is the first technology that can make decisions and create ideas by itself, and warns that mastering language lets it hack the operating system of human civilization.

Beginner2023

Can We Actually Control Superintelligent AI? | Ada, Ep. 4 | TED-Ed

Elizabeth Cox

An animated explainer on the control problem—why a superintelligent system pursuing a misspecified goal could resist correction—featuring Stuart Russell's case for rules against unsafe AI.

Beginner2024

Scaling Interpretability

Anthropic

Anthropic researchers explain mechanistic interpretability—reading the millions of concepts represented inside a production model like Claude—as a path to understanding and steering AI behavior.

Intermediate2024

A.I. Expert Answers A.I. Questions From Twitter | Tech Support | WIRED

Gary Marcus

AI researcher Gary Marcus fields the internet's questions about what AI can and can't do, cutting through hype to explain reliability, limits, and where the real risks lie.

Beginner2023

How to Legislate AI

Johnny Harris

Harris examines why people are scared of AI and how governments might regulate it, covering risks to critical infrastructure, military uses, and the difficulty of overseeing systems we don't understand.

Beginner2023

OpenAI CEO Sam Altman Testifies on AI Oversight Before Senate

PBS NewsHour

The landmark May 2023 Senate Judiciary hearing where Altman told Congress that government intervention is critical to mitigate AI risks and proposed licensing for the most powerful systems.

Beginner2023

Artificial Intelligence & Personhood: Crash Course Philosophy #23

CrashCourse

Hank Green walks through how thinkers define 'strong AI,' the Turing Test, and Searle's Chinese Room—foundational questions about machine minds, consciousness, and moral status.

Beginner2016

Do Robots Deserve Rights? What If Machines Become Conscious?

Kurzgesagt – In a Nutshell

Kurzgesagt explores the moral-patienthood problem: if machines become conscious, what rights would they deserve—and why our existing ethics are ill-equipped to answer.

Beginner2017