All Episodes

July 2, 2024 32 mins

Today we join Maria Khalusova, Staff Developer Advocate with Unstructured.IO, to discuss how companies can unlock their Unstructured Data to deliver better results from their Large Language Models. We talk about how Unstructured Data can enhance the performance of RAG applications, RAG vs Fine Tuning, data Chunking, Multi-Modal models and more. 

AWS Hosts: Nolan Chen & Malini Chatterjee

Unstructured Enterprise Platform beta signup: 

https://unstructured.io/platform


Embedding models MTEB Leaderboard: 

https://huggingface.co/spaces/mteb/leaderboard


2019 Deloitte report (source of the statistics that only 18% of organizations were using unstructured data):

https://www2.deloitte.com/us/en/insights/topics/analytics/insight-driven-organization.html


80% of data is unstructured, source: https://mitsloan.mit.edu/ideas-made-to-matter/tapping-power-unstructured-data


Papers showing RAG outperforming fine-tuning: 

https://arxiv.org/abs/2312.05934

https://arxiv.org/abs/2401.08406


Email Your Feedback: rethinkpodcast@amazon.com

Mark as Played

Advertise With Us

Popular Podcasts

24/7 News: The Latest
Stuff You Should Know

Stuff You Should Know

If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

Dateline NBC

Dateline NBC

Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

Music, radio and podcasts, all free. Listen online or download the iHeart App.

Connect

© 2025 iHeartMedia, Inc.