The chatbot is definitely trying to kill you, maybe. Send us money.
Text version: https://pivot-to-ai.com/2025/04/18/reasoning-ai-is-lying-to-you-or-maybe-its-just-hallucinating-again/
Sources:
Anthropic: Reasoning models don't always say what they think https://www.anthropic.com/research/reasoning-models-dont-say-think
paper (PDF) https://assets.anthropic.com/m/71876fabef0f0ed4/original/reasoning_models_paper.pdf
Introducing Transluce https://transluce.org/introducing-transluce
Investigating truthfulness in a pre-release o3 model https://transluce.org/investigating-o3-truthfulness
Transluce: "These behaviors are surprising." https://x.com/TransluceAI/status/1912552068717637980
(Ars Technica article, edited) Researchers concerned to find AI models misrepresenting their “reasoning” processes https://arstechnica.com/ai/2025/04/researchers-concerned-to-find-ai-models-hiding-their-true-reasoning-processes/
(Ars Technica article, original) Researchers concerned to find AI models hiding their true “reasoning” processes https://web.archive.org/web/20250410231357/https://arstechnica.com/ai/2025/04/researchers-concerned-to-find-ai-models-hiding-their-true-reasoning-processes/
Copyscape is nice for quickly comparing web pages https://copyscape.com
Previously:
Anthropic, Apollo astounded to find a chatbot will lie to you if you tell it to lie to you https://pivot-to-ai.com/2024/12/19/anthropic-and-apollo-astounded-to-find-that-a-chatbot-will-lie-to-you-if-you-tell-it-to-lie-to-you/
How Sam Altman got fired from OpenAI in 2023: not being an AI doom crank (and lying a lot) https://pivot-to-ai.com/2025/04/06/how-sam-altman-got-fired-from-openai-in-2023-not-being-an-ai-doom-crank-and-lying-a-lot/
video: https://www.youtube.com/watch?v=xlrBjeAtJUk&list=UU9rJrMVgcXTfa8xuMnbhAEA
T-shirt store now open! https://pivot-to-ai.redbubble.com
Enhance the channel: https://www.amazon.co.uk/hz/wishlist/ls/3Q8VZW46J6DM6
Please fund my vital AI safety research! The fate of humanity is at stake!
Patreon: https://www.patreon.com/davidgerard
Ko-Fi: https://ko-fi.com/A1529D5
On Purpose with Jay Shetty
I’m Jay Shetty host of On Purpose the worlds #1 Mental Health podcast and I’m so grateful you found us. I started this podcast 5 years ago to invite you into conversations and workshops that are designed to help make you happier, healthier and more healed. I believe that when you (yes you) feel seen, heard and understood you’re able to deal with relationship struggles, work challenges and life’s ups and downs with more ease and grace. I interview experts, celebrities, thought leaders and athletes so that we can grow our mindset, build better habits and uncover a side of them we’ve never seen before. New episodes every Monday and Friday. Your support means the world to me and I don’t take it for granted — click the follow button and leave a review to help us spread the love with On Purpose. I can’t wait for you to listen to your first or 500th episode!
24/7 News: The Latest
The latest news in 4 minutes updated every hour, every day.
Crime Junkie
Does hearing about a true crime case always leave you scouring the internet for the truth behind the story? Dive into your next mystery with Crime Junkie. Every Monday, join your host Ashley Flowers as she unravels all the details of infamous and underreported true crime cases with her best friend Brit Prawat. From cold cases to missing persons and heroes in our community who seek justice, Crime Junkie is your destination for theories and stories you won’t hear anywhere else. Whether you're a seasoned true crime enthusiast or new to the genre, you'll find yourself on the edge of your seat awaiting a new episode every Monday. If you can never get enough true crime... Congratulations, you’ve found your people. Follow to join a community of Crime Junkies! Crime Junkie is presented by audiochuck Media Company.