All Episodes

October 26, 2025 18 mins

The October 25, 2025 Bytedance paper introduces **Open-o3 Video**, a novel framework developed by researchers from **Peking University** and **ByteDance**, aimed at advancing video reasoning by incorporating explicit spatio-temporal evidence. Unlike prior models that only generate textual rationales, Open-o3 Video explicitly highlights key **timestamps** and **bounding boxes** to ground its answers in visual observations. To achieve this, the authors curate two new datasets, **STGR-CoT-30k** and **STGR-RL-36k**, and utilize a two-stage training strategy involving supervised fine-tuning and **Group Sequence Policy Optimization (GSPO)** with specialized rewards. This approach, which includes adaptive temporal proximity and temporal gating mechanisms, significantly improves performance on the **V-STAR benchmark** and other video understanding tasks, making video reasoning more accurate and verifiable.


Source:

https://arxiv.org/pdf/2510.20579

Mark as Played

Advertise With Us

Popular Podcasts

Spooky Podcasts from iHeartRadio
Dateline NBC

Dateline NBC

Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

Stuff You Should Know

Stuff You Should Know

If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

Music, radio and podcasts, all free. Listen online or download the iHeart App.

Connect

© 2025 iHeartMedia, Inc.