All Episodes

January 24, 2024 61 mins

On the premiere episode of the AI Inside podcast, hosts Jeff Jarvis and Jason Howell discuss AI copyright issues with Common Crawl Foundation's Rich Skrenta regarding news outlets limiting access to content they publish publicly, impacting the integrity of Common Crawl's internet archive. In recent years, the archive has been used by LLMs as AI training data, and the implications of restricting information have a dramatic impact on the data quality that survives.


INTERVIEW

  • Introduction and background on AI Inside podcast
  • Discussion of the recent AI oversight Senate hearing Jeff testified at
  • Introduction of guest Rich Skrenta from Common Crawl Foundation
  • Overview of Common Crawl and its goals to archive the open web
  • Discussion of how Common Crawl data is used to train AI models
  • News publishers wanting content removed from Common Crawl
  • Debate around copyright, fair use, and AI’s “right to read”
  • Mechanics of how Common Crawl works and what it archives
  • Concerns about restricting AI access to data for training
  • Risk of regulatory capture and only big companies being able to use AI
  • Discussion of recent court ruling related to web scraping
  • Hopes for Common Crawl's growth and evolution


NEWS BITES

  • Interesting device announcement from CES - Rabbit R1 with Perplexity AI integration
  • Study on actual risk of AI automating jobs away in the near future


Keep connected to the show:


WEBSITE: http://aiinside.show

VIDEO: http://www.youtube.com/@YellowgoldStudios?sub_confirmation=1

PATREON: http://www.patreon.com/aiinsideshow

TWITTER: http://www.twitter.com/AIInsideShow

INSTAGRAM: http://www.instagram.com/aiinsideshow

THREADS: https://www.threads.net/@aiinsideshow

MASTODON: https://mastodon.social/@aiinsideshow



Hosted on Acast. See acast.com/privacy for more information.

Mark as Played

Advertise With Us

Popular Podcasts

Dateline NBC
Stuff You Should Know

Stuff You Should Know

If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

The Nikki Glaser Podcast

The Nikki Glaser Podcast

Every week comedian and infamous roaster Nikki Glaser provides a fun, fast-paced, and brutally honest look into current pop-culture and her own personal life.

Music, radio and podcasts, all free. Listen online or download the iHeart App.

Connect

© 2024 iHeartMedia, Inc.