All Episodes

August 21, 2025 84 mins

We dig into how the concept of AI "safety" has been co-opted and weaponized by tech companies. Starting with examples like Mecha-Hitler Grok, we explore how real safety engineering differs from AI "alignment," the myth of the alignment tax, and why this semantic confusion matters for actual safety.

  • (00:00) - Intro
  • (00:21) - Mecha-Hitler Grok
  • (10:07) - "Safety"
  • (19:40) - Under-specification
  • (53:56) - This time isn't different
  • (01:01:46) - Alignment Tax myth
  • (01:17:37) - Actually making AI safer

Links
  • JMLR article - Underspecification Presents Challenges for Credibility in Modern Machine Learning
  • Trail of Bits paper - Towards Comprehensive Risk Assessments and Assurance of AI-Based Systems
  • SSRN paper - Uniqueness Bias: Why It Matters, How to Curb It

Additional Referenced Papers

  • NeurIPS paper - Safetywashing: Do AI Safety Benchmarks Actually Measure Safety Progress?
  • ICML paper - AI Control: Improving Safety Despite Intentional Subversion
  • ICML paper - DarkBench: Benchmarking Dark Patterns in Large Language Models
  • OSF preprint - Current Real-World Use of Large Language Models for Mental Health
  • Anthropic preprint - Training a Helpful and Harmless Assistant with Reinforcement Learning from Human Feedback

Inciting Examples

  • ars Technica article - US government agency drops Grok after MechaHitler backlash, report says
  • The Guardian article - Musk’s AI Grok bot rants about ‘white genocide’ in South Africa in unrelated chats
  • BBC article - Update that made ChatGPT 'dangerously' sycophantic pulled

Other Sources

  • London Daily article - UK AI Safety Institute Rebrands as AI Security Institute to Focus on Crime and National Security
  • Vice article - Prominent AI Philosopher and ‘Father’ of Longtermism Sent Very Racist Email to a 90s Philosophy Listserv
  • LessWrong blogpost - "notkilleveryoneism" sounds dumb (see comments)
  • EA Forum blogpost - An Overview of the AI Safety Funding Situation
  • Book by Dmitry Chernov and Didier Sornette - Man-made Catastrophes and Risk Information Concealment
  • Euronews article - OpenAI adds mental health safeguards to ChatGPT, saying chatbot has fed into users’ ‘delusions’
  • Pleias website
  • Wikipedia page on Jaywalking
Mark as Played

Advertise With Us

Popular Podcasts

Dateline NBC

Dateline NBC

Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

Are You A Charlotte?

Are You A Charlotte?

In 1997, actress Kristin Davis’ life was forever changed when she took on the role of Charlotte York in Sex and the City. As we watched Carrie, Samantha, Miranda and Charlotte navigate relationships in NYC, the show helped push once unacceptable conversation topics out of the shadows and altered the narrative around women and sex. We all saw ourselves in them as they searched for fulfillment in life, sex and friendships. Now, Kristin Davis wants to connect with you, the fans, and share untold stories and all the behind the scenes. Together, with Kristin and special guests, what will begin with Sex and the City will evolve into talks about themes that are still so relevant today. "Are you a Charlotte?" is much more than just rewatching this beloved show, it brings the past and the present together as we talk with heart, humor and of course some optimism.

Stuff You Should Know

Stuff You Should Know

If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

Music, radio and podcasts, all free. Listen online or download the iHeart App.

Connect

© 2025 iHeartMedia, Inc.