All Episodes

June 6, 2025 20 mins

Reward engineering is one of the most brutal, time-consuming challenges in AI—a "black art" that forms the very foundation of how intelligent agents learn. For decades, it's been a manual process of trial, error, and intuition. But what if an AI could learn this art and perform it better than its human creators?

In this episode, we dissect EUREKA, a groundbreaking system from NVIDIA that automates reward design, achieving superhuman results. This isn't just an incremental improvement; it's a fundamental shift in how we build and teach AI. We explore how EUREKA enabled a robot hand to master dexterous pen-spinning for the first time—a skill previously thought impossible—by discovering incentive structures that are often profoundly counter-intuitive to human experts.

Prepare to level up your understanding of AI's creative potential. This is the story of how AI learned to write the rules for itself, and it will change how you think about the future of intelligent systems.

In this episode, you’ll discover:

    • (02:10) The Expert's Bottleneck: Why reward design is the frustrating, manual trial-and-error process that has slowed down AI progress for years (with 89% of human-designed rewards being sub-optimal).

    • (06:45) The EUREKA Breakthrough: An introduction to the system that uses GPT-4 to write executable reward code, essentially turning AI into its own most effective teacher.

    • (11:30) The Engine of Success: A deep dive into the three pillars of EUREKA:

        • Environment as Context: Giving the LLM the source code to see the world as it truly is.

        • Evolutionary Search: The "survival of the fittest" process for generating and refining reward code.

        • Reward Reflection: The secret sauce—a detailed feedback loop that tells the AI why a reward worked, enabling targeted, intelligent improvement.

    • (19:05) The Shocking Results: How EUREKA outperformed expert humans on 83% of tasks, delivering an average 52% performance boost and unlocking the "impossible" skill of pen-spinning.

    • (25:50) Beyond Human Intuition: Why EUREKA's best solutions are often ones humans would never think of, and what this reveals about discovering truly novel principles in AI.

    • (31:15) The New Era of Collaboration: How this technology isn't just about replacement, but about augmenting human expertise—improving our rewards and incorporating our qualitative feedback to create more aligned AI.

Mark as Played

Advertise With Us

Popular Podcasts

Stuff You Should Know
Dateline NBC

Dateline NBC

Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

New Heights with Jason & Travis Kelce

New Heights with Jason & Travis Kelce

Football’s funniest family duo — Jason Kelce of the Philadelphia Eagles and Travis Kelce of the Kansas City Chiefs — team up to provide next-level access to life in the league as it unfolds. The two brothers and Super Bowl champions drop weekly insights about the weekly slate of games and share their INSIDE perspectives on trending NFL news and sports headlines. They also endlessly rag on each other as brothers do, chat the latest in pop culture and welcome some very popular and well-known friends to chat with them. Check out new episodes every Wednesday. Follow New Heights on the Wondery App, YouTube or wherever you get your podcasts. You can listen to new episodes early and ad-free, and get exclusive content on Wondery+. Join Wondery+ in the Wondery App, Apple Podcasts or Spotify. And join our new membership for a unique fan experience by going to the New Heights YouTube channel now!

Music, radio and podcasts, all free. Listen online or download the iHeart App.

Connect

© 2025 iHeartMedia, Inc.