Hugging Face Trending Papers

Stay ahead in AI with Hugging Face Trending Papers — your daily digest of trending ai research. Hosts break down the most talked-about papers in machine learning, LLMs, generative AI, and robotics in just few minutes. Clear, conversational insights on problems, methods, benchmarks, and real-world impact — no jargon overload. Perfect for researchers, engineers, students, and AI enthusiasts.

Episodes

Episode. 15: Real-Time AI: Video, Proactive LLMs & Text Structure

March 5, 2026 • 10 mins

This episode explores groundbreaking AI research, featuring Helios, a real-time long video generation model; Proact-VL, a proactive VideoLLM for real-time AI companions; and T2S-Bench & Structure-of-Thought, a new benchmark and prompting technique for text-to-structure reasoning.

### Featured Papers* **Helios: Real Real-Time Long Video Generation Model** * **Key Insight:** Helios is the first 14B video generation model capabl...

Listen

Watch

Mark as Played

Episode 14: Revolutionizing Deep Learning: The Rise of CUDA Agent and Agentic RL

March 5, 2026 • 3 mins

# Hugging Face Trending Papers Episode Summary
In this episode, we discuss two trending papers, "Large-Scale Agentic RL for High-Performance CUDA Kernel Generation" and "Language-Agnostic SWE Task Collection at Scale". The first paper presents CUDA Agent, a large-scale reinforcement learning system that optimizes GPUs for deep learning, and the second introduces SWE-rebench V2, a language-agnostic, automated ...

Listen

Watch

Mark as Played

Episode 13: Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

November 20, 2025 • 2 mins

Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

**Source:** huggingface_daily

**URL:** https://huggingface.co/papers/2511.14993

**Key Points:**- Problem: The research addresses the challenges in high-resolution image and video generation, particularly the scalability and computational complexity associa...- Method: The authors introduce Kandinsky 5.0, a family of foundation models comprising three core ...

Listen

Watch

Mark as Played

Episode 12: Exploring Next-Gen AI: Interactive Scaling & Video-Based Reasoning

November 19, 2025 • 3 mins

# Episode SummaryIn this episode of Hugging Face Trending Papers, we delve into the latest AI research with three top trending papers from arXiv. We explore MiroThinker's interaction scaling for open-source research agents, the new paradigm of "Thinking with Video" for multimodal reasoning, and Lumine's approach to building generalist AI agents for 3D open-world environments.

# Mentioned Papers
1. ["Mir...

Listen

Watch

Mark as Played

Episode 11: Unlocking AI Reasoning: Breakthroughs in Looped Language Models

November 1, 2025 • 5 mins

Papers discussed:

1. [Scaling Latent Reasoning via Looped Language Models](https://arxiv.org/pdf/2510.25741): This paper introduces a new kind of pre-trained looped language models, Ouro, which improves reasoning capabilities by integrating reasoning into the pre-training phase. The models have demonstrated superior performance due to enhanced knowledge manipulation capabilities.

2. [Concerto: Joint 2D-3D Self-Supervised Lear...

Listen

Watch

Mark as Played

Episode 10: AI's New Brain: LLM Reasoning, Memory, Agents

October 22, 2025 • 3 mins

**Episode Summary:**This episode dives into cutting-edge advancements for Large Language Models, covering new methods to enhance reasoning reliability and efficiency, and introducing lightweight memory systems for more effective long-term interaction.

**Featured Papers:**
* **A Theoretical Study on Bridging Internal Probability and Self-Consistency for LLM Reasoning** * *Key Insight:* Introduces RPC, a novel method that th...

Listen

Watch

Mark as Played

Episode 9: Boosting AI Problem Solving: Tiny Networks and Early Experience Learning

October 10, 2025 • 4 mins

In this episode of Hugging Face Trending Papers, we discuss three exciting AI research papers: "Less is More: Recursive Reasoning with Tiny Networks", "Agent Learning via Early Experience", and "Paper2Video: Automatic Video Generation from Scientific Papers".

## Papers Discussed
1. **[Less is More: Recursive Reasoning with Tiny Networks](https://arxiv.org/pdf/2510.04871)**: This paper introduces...

Listen

Watch

Mark as Played

Episode 8: Boosting AI Efficiency: Code Compression, Video Generation, and Experience-based Reasoning

October 3, 2025 • 4 mins

In this episode, we discuss three trending AI research papers. We delve into the challenges and solutions related to code language models, video generation, and reinforcement learning.

Key Points Discussed
#LongCodeZip: Compress Long Context for Code Language Models- LongCodeZip is a novel framework for compressing code for Large Language Models (LLMs)- It addresses the issue of high API costs and generation latency associat...

Listen

Watch

Mark as Played

Episode 7: Agents of Change: From Interactive Papers to Lifelong AI Learning

October 2, 2025 • 3 mins

In today’s episode of Hugging Face Trending Papers, we explore three cutting-edge ideas reshaping how we interact with AI and research.

First, we dive into Paper2Agent, a framework that transforms static research papers into interactive AI agents, making findings more transparent and usable. Next, we look at Scaling Agents via Continual Pre-training, which pushes the boundaries of agent reliability by teaching them through lifelong ...

Listen

Watch

Mark as Played

Episode 6: Steering the Future: Real-Time Long Video, Training-Time Search, and a Gym for Agentic LLMs

October 2, 2025 • 8 mins

In this episode, we unpack three fresh arXiv papers shaping how AI creates, reasons, and acts. First, arXiv:2509.22622 explores real-time, steerable long-form video generation you can guide on the fly (PDF: https://arxiv.org/pdf/2509.22622).

Next, arXiv:2509.25454 integrates tree search directly into reinforcement-learning training for verifiable reasoning—think math and code with checkable rewards (PDF: https://arxiv.org/pdf/2...

Listen

Watch

Mark as Played

Episode 5: Scaling Feedback, Forgetting Smartly, and Video Agents: AI’s Next Frontier

September 24, 2025 • 7 mins

1. RLAIF at Scale: Reinforcement Learning from AI Feedback for Multi-Turn Reasoning

This paper explores using AI-generated feedback instead of expensive human labels to train reasoning models. The authors show that Reinforcement Learning from AI Feedback (RLAIF) can match or even outperform models trained with limited human feedback, especially in multi-turn reasoning tasks.

2. Learning to Forget: Dynamic Memory Compression in Lo...

Listen

Watch

Mark as Played

Episode 4: Panoramas, HALA, and the T2I Exam: Three Trends You Shouldn’t Miss

September 22, 2025 • 10 mins

Today we cover three standout arXiv releases shaping vision, language, and evaluation. First, PANORAMA surveys the rise of omnidirectional, 360° perception for embodied AI—why standard pinhole vision isn’t enough, where datasets and models fall short, and how new backbones and adaptation methods are closing the gap. Read: https://arxiv.org/pdf/2509.12989 (arXiv:2509.12989).
Next, the HALA technical report details an Arabic-centr...

Listen

Watch

Mark as Played

Episode 3: Swarms, Tiny Robot Policies & HuMo

September 21, 2025 • 8 mins

In today’s 5–6 minute roundup, we cover:

(1) SAPO’s decentralized RL that shares rollouts across a swarm for cheaper, faster LM post-training (arXiv:2509.08721 PDF),

(2) VLA-Adapter’s “Bridge Attention” that makes small vision-language-action models both fast and state-of-the-art on robotics tasks (arXiv:2509.09372 PDF), and

(3) HuMo’s unified generator coordinating text, reference images, and audio for people-centric video with st...

Listen

Watch

Mark as Played

Episode 2: Boundaries Checked, Populations Evolved, Images Understood

September 19, 2025 • 7 mins

In this episode, we cover three HuggingFace trending AI papers shaping the future of alignment, training, and creativity.

How models can reason over boundaries to stick to instructions (arXiv:2509.14760)
How populations of models can evolve without labels through consensus and novelty (arXiv:2509.15194)
How autoregressive generators can understand before they generate images (arXiv:2509.15185)

Three different paths, one goal: ...

Listen

Watch

Mark as Played

Hugging Face Trending Papers (Ep. 1) — ScaleCUA, FlowRL, RynnVLA-001

September 19, 2025 • 3 mins

Smarter computer agents, better reasoning, and robot manipulation breakthroughs.

Today’s Papers

ScaleCUA: Scaling Open-Source Computer Use Agents with Cross-Platform Data
🔗 arXiv:2509.15221
➡️ Large dataset across 6 OSs and 3 task domains; closed-loop pipeline of auto-agents + human curation; big benchmark gains for GUI agents.
FlowRL: Matching Reward Distributions for LLM Reasoning
🔗 arXiv:2509.15207
➡️ Shifts RL objective from rew...

Listen

Watch

Mark as Played

Popular Podcasts

Hey Jonas!

Hey Jonas! The official Jonas Brothers podcast. Hosted by Kevin, Joe, and Nick Jonas. It’s the Jonas Brothers you know... musicians, actors, and well, yes, brothers. Now, they’re sharing another side of themselves in the playful, intimate, and irreverent way only they can. Spend time with the Jonas Brothers here and stay a little bit longer for deep conversations like never before.

Dateline NBC

Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

Stuff You Should Know

If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

The Clay Travis and Buck Sexton Show

The Clay Travis and Buck Sexton Show. Clay Travis and Buck Sexton tackle the biggest stories in news, politics and current events with intelligence and humor. From the border crisis, to the madness of cancel culture and far-left missteps, Clay and Buck guide listeners through the latest headlines and hot topics with fun and entertaining conversations and opinions.

The Joe Rogan Experience

The official podcast of comedian Joe Rogan.

Advertise With Us

Hugging Face Trending Papers

Episodes

.css-14f5ked{margin:0;word-break:break-word;display:-webkit-box;-webkit-box-orient:vertical;box-orient:vertical;-webkit-line-clamp:2;overflow:hidden;}Episode. 15: Real-Time AI: Video, Proactive LLMs & Text Structure

.css-r6mb8g{margin:0;word-break:break-word;display:-webkit-box;-webkit-box-orient:vertical;box-orient:vertical;-webkit-line-clamp:1;overflow:hidden;}Episode 14: Revolutionizing Deep Learning: The Rise of CUDA Agent and Agentic RL

Episode 13: Kandinsky 5.0: A Family of Foundation Models for Image and Video Generation

Episode 12: Exploring Next-Gen AI: Interactive Scaling & Video-Based Reasoning

Episode 11: Unlocking AI Reasoning: Breakthroughs in Looped Language Models

Episode 10: AI's New Brain: LLM Reasoning, Memory, Agents

Episode 9: Boosting AI Problem Solving: Tiny Networks and Early Experience Learning

Episode 8: Boosting AI Efficiency: Code Compression, Video Generation, and Experience-based Reasoning

Episode 7: Agents of Change: From Interactive Papers to Lifelong AI Learning

Episode 6: Steering the Future: Real-Time Long Video, Training-Time Search, and a Gym for Agentic LLMs

Episode 5: Scaling Feedback, Forgetting Smartly, and Video Agents: AI’s Next Frontier

Episode 4: Panoramas, HALA, and the T2I Exam: Three Trends You Shouldn’t Miss

Episode 3: Swarms, Tiny Robot Policies & HuMo

Episode 2: Boundaries Checked, Populations Evolved, Images Understood

Hugging Face Trending Papers (Ep. 1) — ScaleCUA, FlowRL, RynnVLA-001

Popular Podcasts

Episode. 15: Real-Time AI: Video, Proactive LLMs & Text Structure

Episode 14: Revolutionizing Deep Learning: The Rise of CUDA Agent and Agentic RL