In this solo episode, we go beyond Google's Gemini and OpenAI's ChatGPT to take a look at Anthropic, a startup that made headlines after securing a $4 billion investment from Amazon. We'll also dive into the importance of AI industry benchmarks. Learn about LMSYS's Arena Elo and MMLU (Measuring Massive Multitask Language Understanding), including how these benchmarks are constructed and used to objectively evaluate the performance of large language models. Discover how benchmarks can help you identify promising chatbots in the market. Enjoy the episode!
Anthropic's Claude
https://claude.ai
LMSYS Leaderboard
https://chat.lmsys.org/?leaderboard
For more information, check out https://www.superprompt.fm There you can contact me and/or sign up for our newsletter.
Dateline NBC
Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com
Stuff You Should Know
If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.
Law & Order: Criminal Justice System - Season 1 & Season 2
Season Two Out Now! Law & Order: Criminal Justice System tells the real stories behind the landmark cases that have shaped how the most dangerous and influential criminals in America are prosecuted. In its second season, the series tackles the threat of terrorism in the United States. From the rise of extremist political groups in the 60s to domestic lone wolves in the modern day, we explore how organizations like the FBI and Joint Terrorism Take Force have evolved to fight back against a multitude of terrorist threats.