The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Machine learning and artificial intelligence are dramatically changing the way businesses operate and people live. The TWIML AI Podcast brings the top minds and ideas from the world of ML and AI to a broad and influential community of ML/AI researchers, data scientists, engineers and tech-savvy business and IT leaders. Hosted by Sam Charrington, a sought after industry analyst, speaker, commentator and thought leader. Technologies covered include machine learning, artificial intelligence, deep learning, natural language processing, neural networks, analytics, computer science, data science and more.

Episodes

High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753

October 28, 2025 • 52 mins

In this episode, Hung Bui, Technology Vice President at Qualcomm, joins us to explore the latest high-efficiency techniques for running generative AI, particularly diffusion models, on-device. We dive deep into the technical challenges of deploying these models, which are powerful but computationally expensive due to their iterative sampling process. Hung details his team's work on SwiftBrush and SwiftEdit, which enable high-qualit...

Mark as Played

Vibe Coding's Uncanny Valley with Alexandre Pesant - #752

October 22, 2025 • 72 mins

Today, we're joined by Alexandre Pesant, AI lead at Lovable, who joins us to discuss the evolution and practice of vibe coding. Alex shares his take on how AI is enabling a shift in software development from typing characters to expressing intent, creating a new layer of abstraction similar to how high-level code compiles to machine code. We explore the current capabilities and limitations of coding agents, the importance of contex...

Mark as Played

Dataflow Computing for AI Inference with Kunle Olukotun - #751

October 14, 2025 • 57 mins

In this episode, we're joined by Kunle Olukotun, professor of electrical engineering and computer science at Stanford University and co-founder and chief technologist at Sambanova Systems, to discuss reconfigurable dataflow architectures for AI inference. Kunle explains the core idea of building computers that are dynamically configured to match the dataflow graph of an AI model, moving beyond the traditional instruction-fetch para...

Mark as Played

Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750

October 7, 2025 • 57 mins

Today, we're joined by Jacob Buckman, co-founder and CEO of Manifest AI to discuss achieving long context in transformers. We discuss the bottlenecks of scaling context length and recent techniques to overcome them, including windowed attention, grouped query attention, and latent space attention. We explore the idea of weight-state balance and the weight-state FLOP ratio as a way of reasoning about the optimality of compute archit...

Mark as Played

The Decentralized Future of Private AI with Illia Polosukhin - #749

September 30, 2025 • 65 mins

In this episode, Illia Polosukhin, a co-author of the seminal "Attention Is All You Need" paper and co-founder of Near AI, joins us to discuss his vision for building private, decentralized, and user-owned AI. Illia shares his unique journey from developing the Transformer architecture at Google to building the NEAR Protocol blockchain to solve global payment challenges, and now applying those decentralized principles back to AI. W...

Mark as Played

Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748

September 23, 2025 • 63 mins

Today, we’re joined by Oliver Wang, principal scientist at Google DeepMind and tech lead for Gemini 2.5 Flash Image—better known by its code name, “Nano Banana.” We dive into the development and capabilities of this newly released frontier vision-language model, beginning with the broader shift from specialized image generators to general-purpose multimodal agents that can use both visual and textual data for a variety of tasks. Ol...

Mark as Played

Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747

September 16, 2025 • 58 mins

Today, we're joined by Aditi Raghunathan, assistant professor at Carnegie Mellon University, to discuss the limitations of LLMs and how we can build more adaptable and creative models. We dig into her ICML 2025 Outstanding Paper Award winner, “Roll the dice & look before you leap: Going beyond the creative limits of next-token prediction,” which examines why LLMs struggle with generating truly novel ideas. We dig into the "Roll the...

Mark as Played

Building an Immune System for AI Generated Software with Animesh Koratana - #746

September 9, 2025 • 65 mins

Today, we're joined by Animesh Koratana, founder and CEO of PlayerZero to discuss his team’s approach to making agentic and AI-assisted coding tools production-ready at scale. Animesh explains how rapid advances in AI-assisted coding have created an “asymmetry” where the speed of code output outpaces the maturity of processes for maintenance and support. We explore PlayerZero’s debugging and code verification platform, which uses c...

Mark as Played

Autoformalization and Verifiable Superintelligence with Christian Szegedy - #745

September 2, 2025 • 71 mins

In this episode, Christian Szegedy, Chief Scientist at Morph Labs, joins us to discuss how the application of formal mathematics and reasoning enables the creation of more robust and safer AI systems. A pioneer behind concepts like the Inception architecture and adversarial examples, Christian now focuses on autoformalization—the AI-driven process of translating mathematical concepts from their human-readable form into rigorously f...

Mark as Played

Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744

August 26, 2025 • 70 mins

Today, we're joined by Prince Canuma, an ML engineer and open-source developer focused on optimizing AI inference on Apple Silicon devices. Prince shares his journey to becoming one of the most prolific contributors to Apple’s MLX ecosystem, having published over 1,000 models and libraries that make open, multimodal AI accessible and performant on Apple devices. We explore his workflow for adapting new models in MLX, the trade-offs...

Mark as Played

Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743

August 19, 2025 • 61 mins

Today, we're joined by Jack Parker-Holder and Shlomi Fruchter, researchers at Google DeepMind, to discuss the recent release of Genie 3, a model capable of generating “playable” virtual worlds. We dig into the evolution of the Genie project and review the current model’s scaled-up capabilities, including creating real-time, interactive, and high-resolution environments. Jack and Shlomi share their perspectives on what defines a wor...

Mark as Played

Closing the Loop Between AI Training and Inference with Lin Qiao - #742

August 12, 2025 • 61 mins

In this episode, we're joined by Lin Qiao, CEO and co-founder of Fireworks AI. Drawing on key lessons from her time building PyTorch, Lin shares her perspective on the modern generative AI development lifecycle. She explains why aligning training and inference systems is essential for creating a seamless, fast-moving production pipeline, preventing the friction that often stalls deployment. We explore the strategic shift from treat...

Mark as Played

Context Engineering for Productive AI Agents with Filip Kozera - #741

July 29, 2025 • 46 mins

In this episode, Filip Kozera, founder and CEO of Wordware, explains his approach to building agentic workflows where natural language serves as the new programming interface. Filip breaks down the architecture of these "background agents," explaining how they use a reflection loop and tool-calling to execute complex tasks. He discusses the current limitations of agent protocols like MCPs and how developers can extend them to handl...

Mark as Played

Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740

July 22, 2025 • 73 mins

In this episode, Jared Quincy Davis, founder and CEO at Foundry, introduces the concept of "compound AI systems," which allows users to create powerful, efficient applications by composing multiple, often diverse, AI models and services. We discuss how these "networks of networks" can push the Pareto frontier, delivering results that are simultaneously faster, more accurate, and even cheaper than single-model approaches. Using exam...

Mark as Played

Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739

July 15, 2025 • 73 mins

In this episode, Kwindla Kramer, co-founder and CEO of Daily and creator of the open source Pipecat framework, joins us to discuss the architecture and challenges of building real-time, production-ready conversational voice AI. Kwin breaks down the full stack for voice agents—from the models and APIs to the critical orchestration layer that manages the complexities of multi-turn conversations. We explore why many production systems...

Mark as Played

Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738

July 9, 2025 • 60 mins

Today, we're joined by Fatih Porikli, senior director of technology at Qualcomm AI Research for an in-depth look at several of Qualcomm's accepted papers and demos featured at this year’s CVPR conference. We start with “DiMA: Distilling Multi-modal Large Language Models for Autonomous Driving,” an end-to-end autonomous driving system that incorporates distilling large language models for structured scene understanding and safe plan...

Mark as Played

Building the Internet of Agents with Vijoy Pandey - #737

June 24, 2025 • 56 mins

Today, we're joined by Vijoy Pandey, SVP and general manager at Outshift by Cisco to discuss a foundational challenge for the enterprise: how do we make specialized agents from different vendors collaborate effectively? As companies like Salesforce, Workday, and Microsoft all develop their own agentic systems, integrating them creates a complex, probabilistic, and noisy environment, a stark contrast to the deterministic APIs of the...

Mark as Played

LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736

June 17, 2025 • 59 mins

Today, we're joined by Ben Wellington, deputy head of feature forecasting at Two Sigma. We dig into the team’s end-to-end approach to leveraging AI in equities feature forecasting, covering how they identify and create features, collect and quantify historical data, and build predictive models to forecast market behavior and asset prices for trading and investment. We explore the firm's platform-centric approach to managing an exte...

Mark as Played

Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735

June 10, 2025 • 56 mins

Today, we're joined by Jason Corso, co-founder of Voxel51 and professor at the University of Michigan, to explore automated labeling in computer vision. Jason introduces FiftyOne, an open-source platform for visualizing datasets, analyzing models, and improving data quality. We focus on Voxel51’s recent research report, “Zero-shot auto-labeling rivals human performance,” which demonstrates how zero-shot auto-labeling with foundatio...

Mark as Played

Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734

June 4, 2025 • 85 mins

Today, we're joined by Charles Martin, founder of Calculation Consulting, to discuss Weight Watcher, an open-source tool for analyzing and improving Deep Neural Networks (DNNs) based on principles from theoretical physics. We explore the foundations of the Heavy-Tailed Self-Regularization (HTSR) theory that underpins it, which combines random matrix theory and renormalization group ideas to uncover deep insights about model trainin...

Mark as Played

Popular Podcasts

Spooky Podcasts from iHeartRadio

Whether you’re a scaredy-cat or a brave bat, this collection of episodes from iHeartPodcasts will put you in the Halloween spirit. Binge stories, frights, and more that may keep you up at night!

Dateline NBC

Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

Stuff You Should Know

If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

Health Stuff

On Health Stuff, hosts Dr. Priyanka Wali and comedian Hari Kondabolu tackle all the health questions that keep you up at night with hilarity and humanity. Together they demystify the flashy trends, and keep you informed on the latest research. You can rely on Health Stuff to bring you real, uninhibited, and thoughtful health talk of the highest caliber, and a healthy dose of humor.

The Breakfast Club

The World's Most Dangerous Morning Show, The Breakfast Club, With DJ Envy, Jess Hilarious, And Charlamagne Tha God!

Advertise With Us

The TWIML AI Podcast (formerly This Week in Machine Learning & Artificial Intelligence)

Episodes

.css-14f5ked{margin:0;word-break:break-word;display:-webkit-box;-webkit-box-orient:vertical;box-orient:vertical;-webkit-line-clamp:2;overflow:hidden;}High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753

.css-r6mb8g{margin:0;word-break:break-word;display:-webkit-box;-webkit-box-orient:vertical;box-orient:vertical;-webkit-line-clamp:1;overflow:hidden;}Vibe Coding's Uncanny Valley with Alexandre Pesant - #752

Dataflow Computing for AI Inference with Kunle Olukotun - #751

Recurrence and Attention for Long-Context Transformers with Jacob Buckman - #750

The Decentralized Future of Private AI with Illia Polosukhin - #749

Inside Nano Banana 🍌 and the Future of Vision-Language Models with Oliver Wang - #748

Is It Time to Rethink LLM Pre-Training? with Aditi Raghunathan - #747

Building an Immune System for AI Generated Software with Animesh Koratana - #746

Autoformalization and Verifiable Superintelligence with Christian Szegedy - #745

Multimodal AI Models on Apple Silicon with MLX with Prince Canuma - #744

Genie 3: A New Frontier for World Models with Jack Parker-Holder and Shlomi Fruchter - #743

Closing the Loop Between AI Training and Inference with Lin Qiao - #742

Context Engineering for Productive AI Agents with Filip Kozera - #741

Infrastructure Scaling and Compound AI Systems with Jared Quincy Davis - #740

Building Voice AI Agents That Don’t Suck with Kwindla Kramer - #739

Distilling Transformers and Diffusion Models for Robust Edge Use Cases with Fatih Porikli - #738

Building the Internet of Agents with Vijoy Pandey - #737

LLMs for Equities Feature Forecasting at Two Sigma with Ben Wellington - #736

Zero-Shot Auto-Labeling: The End of Annotation for Computer Vision with Jason Corso - #735

Grokking, Generalization Collapse, and the Dynamics of Training Deep Neural Networks with Charles Martin - #734

Popular Podcasts

High-Efficiency Diffusion Models for On-Device Image Generation and Editing with Hung Bui - #753

Vibe Coding's Uncanny Valley with Alexandre Pesant - #752