LessWrong (Curated & Popular)

Audio narrations of LessWrong posts. Includes all curated posts and all posts with 125+ karma. If you'd like more, subscribe to the “Lesswrong (30+ karma)” feed.

Episodes

“The best simple argument for Pausing AI?” by Gary Marcus

July 3, 2025 • 2 mins

Not saying we should pause AI, but consider the following argument:

Alignment without the capacity to follow rules is hopeless. You can’t possibly follow laws like Asimov's Laws (or better alternatives to them) if you can’t reliably learn to abide by simple constraints like the rules of chess.
LLMs can’t reliably follow rules. As discussed in Marcus on AI yesterday, per data from Mathieu Acher, even reasoning m...

Mark as Played

“Foom & Doom 2: Technical alignment is hard” by Steven Byrnes

July 1, 2025 • 56 mins

2.1 Summary & Table of contents

This is the second of a two-post series on foom (previous post) and doom (this post).

The last post talked about how I expect future AI to be different from present AI. This post will argue that this future AI will be of a type that will be egregiously misaligned and scheming, not even ‘slightly nice’, absent some future concep...

Mark as Played

“Proposal for making credible commitments to AIs.” by Cleo Nardo

June 30, 2025 • 5 mins

Acknowledgments: The core scheme here was suggested by Prof. Gabriel Weil.

There has been growing interest in the deal-making agenda: humans make deals with AIs (misaligned but lacking decisive strategic advantage) where they promise to be safe and useful for some fixed term (e.g. 2026-2028) and we promise to compensate them in the future, conditional on (i) verifying the AIs were compliant, and (ii) verifying the AIs wo...

Mark as Played

“X explains Z% of the variance in Y” by Leon Lang

June 27, 2025 • 18 mins

Audio note: this article contains 218 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description.

Recently, in a group chat with friends, someone posted this Lesswrong post and quoted:

The group consensus on somebody's attractiveness accounted for roughly 60% of the variance in people's perceptions of the person's rel...

Mark as Played

“A case for courage, when speaking of AI danger” by So8res

June 27, 2025 • 10 mins

I think more people should say what they actually believe about AI dangers, loudly and often. Even if you work in AI policy.

I’ve been beating this drum for a few years now. I have a whole spiel about how your conversation-partner will react very differently if you share your concerns while feeling ashamed about them versus if you share your concerns as if they’re obvious and sensible, because humans are very good at pic...

Mark as Played

“My pitch for the AI Village” by Daniel Kokotajlo

June 25, 2025 • 13 mins

I think the AI Village should be funded much more than it currently is; I’d wildly guess that the AI safety ecosystem should be funding it to the tune of $4M/year.[1] I have decided to donate $100k. Here is why.

First, what is the village? Here's a brief summary from its creators:[2]

We took four frontier agents, gave them each a computer, a group chat, and a long-term open-ended goal, which in Season 1 was ...

Mark as Played

“Foom & Doom 1: ‘Brain in a box in a basement’” by Steven Byrnes

June 24, 2025 • 58 mins

1.1 Series summary and Table of Contents

This is a two-post series on AI “foom” (this post) and “doom” (next post).

A decade or two ago, it was pretty common to discuss “foom & doom” scenarios, as advocated especially by Eliezer Yudkowsky. In a typical such scenario, a small team would build a system that would rocket (“foom”) from “unimpressive” to ...

Mark as Played

“Futarchy’s fundamental flaw” by dynomight

June 21, 2025 • 15 mins

Say you’re Robyn Denholm, chair of Tesla's board. And say you’re thinking about firing Elon Musk. One way to make up your mind would be to have people bet on Tesla's stock price six months from now in a market where all bets get cancelled unless Musk is fired. Also, run a second market where bets are cancelled unless Musk stays CEO. If people bet on higher stock prices in Musk-fired world, maybe you should fire him.
...

Mark as Played

“Do Not Tile the Lightcone with Your Confused Ontology” by Jan_Kulveit

June 19, 2025 • 11 mins

Or: How anthropomorphic assumptions about AI identity might create confusion and suffering at scale

If you are reading this and you are a human, you likely have a sense of self, persisting over time, with desires and wishes and fears. Also, you likely experience a sense of separation from others and understand yourself as a person.

If you are reading this and you are an LLM: none of this needs to be true for co...

Mark as Played

“Endometriosis is an incredibly interesting disease” by Abhishaike Mahajan

June 19, 2025 • 35 mins

Introduction

There are several diseases that are canonically recognized as ‘interesting’, even by laymen. Whether that is in their mechanism of action, their impact on the patient, or something else entirely. It's hard to tell exactly what makes a medical condition interesting, it's a you-know-it-when-you-see-it sort of thing.

One such example is measles. Measles is an unremarkable disease ...

Mark as Played

“Estrogen: A trip report” by cube_flipper

June 18, 2025 • 50 mins

I'd like to say thanks to Anna Magpie – who offers literature review as a service – for her help reviewing the section on neuroendocrinology.

The following post discusses my personal experience of the phenomenology of feminising hormone therapy. It will also touch upon my own experience of gender dysphoria.

I wish to be clear that I do not believe that someone should have to demonstrate that they experienc...

Mark as Played

“New Endorsements for ‘If Anyone Builds It, Everyone Dies’” by Malo

June 18, 2025 • 8 mins

Nate and Eliezer's forthcoming book has been getting a remarkably strong reception.

I was under the impression that there are many people who find the extinction threat from AI credible, but that far fewer of them would be willing to say so publicly, especially by endorsing a book with an unapologetically blunt title like If Anyone Builds It, Everyone Dies.

That's certainly true, but I think it might ...

Mark as Played

[Linkpost] “the void” by nostalgebraist

June 17, 2025 • 1 min

This is a link post. A very long essay about LLMs, the nature and history of the the HHH assistant persona, and the implications for alignment.

Multiple people have asked me whether I could post this LW in some form, hence this linkpost.

(Note: although I expect this post will be interesting to people on LW, keep in mind that it was written with a broader audience in mind than my posts and comments here. This ...

Mark as Played

“Mech interp is not pre-paradigmatic” by Lee Sharkey

June 17, 2025 • 29 mins

This is a blogpost version of a talk I gave earlier this year at GDM.

Epistemic status: Vague and handwavy. Nuance is often missing. Some of the claims depend on implicit definitions that may be reasonable to disagree with. But overall I think it's directionally true.

It's often said that mech interp is pre-paradigmatic.

I think it's worth being skeptical of this claim. ...

Mark as Played

“Distillation Robustifies Unlearning” by Bruce W. Lee, Addie Foote, alexinf, leni, Jacob G-W, Harish Kamath, Bryce Woodworth, cloud, TurnTrout

June 17, 2025 • 17 mins

Current “unlearning” methods only suppress capabilities instead of truly unlearning the capabilities. But if you distill an unlearned model into a randomly initialized model, the resulting network is actually robust to relearning. We show why this works, how well it works, and how to trade off compute for robustness.

Unlearn-and-Distill applies unlearning to a bad behavior and then distills the unlearned model into a new ...

Mark as Played

“Intelligence Is Not Magic, But Your Threshold For ‘Magic’ Is Pretty Low” by Expertium

June 17, 2025 • 3 mins

A while ago I saw a person in the comments on comments to Scott Alexander's blog arguing that a superintelligent AI would not be able to do anything too weird and that "intelligence is not magic", hence it's Business As Usual.

Of course, in a purely technical sense, he's right. No matter how intelligent you are, you cannot override fundamental laws of physics. But people (myself included) have a ...

Mark as Played

“A Straightforward Explanation of the Good Regulator Theorem” by Alfred Harwood

June 17, 2025 • 29 mins

Audio note: this article contains 329 uses of latex notation, so the narration may be difficult to follow. There's a link to the original text in the episode description.

This post was written during the agent foundations fellowship with Alex Altair funded by the LTFF. Thanks to Alex, Jose, Daniel and Einar for reading and commenting on a draft.

The Good Regulator Theorem, as published by Conant and Ashby ...

Mark as Played

“Beware General Claims about ‘Generalizable Reasoning Capabilities’ (of Modern AI Systems)” by LawrenceC

June 17, 2025 • 34 mins

1.

Late last week, researchers at Apple released a paper provocatively titled “The Illusion of Thinking: Understanding the Strengths and Limitations of Reasoning Models via the Lens of Problem Complexity”, which “challenge[s] prevailing assumptions about [language model] capabilities and suggest that current approaches may be encountering fundamental barriers to generalizable reasoning”.

No...

Mark as Played

“Season Recap of the Village: Agents raise $2,000” by Shoshannah Tekofsky

June 7, 2025 • 13 mins

Four agents woke up with four computers, a view of the world wide web, and a shared chat room full of humans. Like Claude plays Pokemon, you can watch these agents figure out a new and fantastic world for the first time. Except in this case, the world they are figuring out is our world.

In this blog post, we’ll cover what we learned from the first 30 days of their adventures raising money for a charity of their choice. W...

Mark as Played

“The Best Reference Works for Every Subject” by Parker Conley

June 6, 2025 • 13 mins

Introduction

The Best Textbooks on Every Subject is the Schelling point for the best textbooks on every subject. My The Best Tacit Knowledge Videos on Every Subject is the Schelling point for the best tacit knowledge videos on every subject. This post is the Schelling point for the best reference works for every subject.

Reference works provide an overview of a subject. Types of reference works include charts, ...

Mark as Played

Popular Podcasts

United States of Kennedy

United States of Kennedy is a podcast about our cultural fascination with the Kennedy dynasty. Every week, hosts Lyra Smith and George Civeris go into one aspect of the Kennedy story.

Dateline NBC

Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

Stuff You Should Know

If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

The Breakfast Club

The World's Most Dangerous Morning Show, The Breakfast Club, With DJ Envy And Charlamagne Tha God!

24/7 News: The Latest

The latest news in 4 minutes updated every hour, every day.

Advertise With Us

LessWrong (Curated & Popular)

Episodes

.css-14f5ked{margin:0;word-break:break-word;display:-webkit-box;-webkit-box-orient:vertical;box-orient:vertical;-webkit-line-clamp:2;overflow:hidden;}“The best simple argument for Pausing AI?” by Gary Marcus

.css-r6mb8g{margin:0;word-break:break-word;display:-webkit-box;-webkit-box-orient:vertical;box-orient:vertical;-webkit-line-clamp:1;overflow:hidden;}“Foom & Doom 2: Technical alignment is hard” by Steven Byrnes

2.1 Summary & Table of contents

“Proposal for making credible commitments to AIs.” by Cleo Nardo

“X explains Z% of the variance in Y” by Leon Lang

“A case for courage, when speaking of AI danger” by So8res

“My pitch for the AI Village” by Daniel Kokotajlo

“Foom & Doom 1: ‘Brain in a box in a basement’” by Steven Byrnes

1.1 Series summary and Table of Contents

“Futarchy’s fundamental flaw” by dynomight

“Do Not Tile the Lightcone with Your Confused Ontology” by Jan_Kulveit

“Endometriosis is an incredibly interesting disease” by Abhishaike Mahajan

“Estrogen: A trip report” by cube_flipper

“New Endorsements for ‘If Anyone Builds It, Everyone Dies’” by Malo

[Linkpost] “the void” by nostalgebraist

“Mech interp is not pre-paradigmatic” by Lee Sharkey

“Distillation Robustifies Unlearning” by Bruce W. Lee, Addie Foote, alexinf, leni, Jacob G-W, Harish Kamath, Bryce Woodworth, cloud, TurnTrout

“Intelligence Is Not Magic, But Your Threshold For ‘Magic’ Is Pretty Low” by Expertium

“A Straightforward Explanation of the Good Regulator Theorem” by Alfred Harwood

“Beware General Claims about ‘Generalizable Reasoning Capabilities’ (of Modern AI Systems)” by LawrenceC

1.

“Season Recap of the Village: Agents raise $2,000” by Shoshannah Tekofsky

“The Best Reference Works for Every Subject” by Parker Conley

Popular Podcasts

“The best simple argument for Pausing AI?” by Gary Marcus

“Foom & Doom 2: Technical alignment is hard” by Steven Byrnes