Tales at Scale

Tales at Scale

Tales at Scale cracks open the world of analytics projects. We’ll be diving into Apache Druid, a high performance, real-time analytics database, but also hearing from folks in the data ecosystem tackling everything from architecture to open source, from scaling to streaming and everything in between- brought to you by Imply.

Episodes

January 9, 2025 39 mins

Held in October 2024, Druid Summit brought Apache Druid® community contributors at companies including Netflix, Salesforce, Atlassian, Imply, Roblox and more together to discuss the latest trends, challenges, and best practices across the Druid community.

The summit explored user experience design, operations and optimization techniques, and lakehouse and streaming analytics pipelines. And on this featured...

Mark as Played

On this episode, we explore how Kong, a cloud-native API gateway, leverages Apache Druid for real-time data processing and analytics in their platform, Kong Konnect. Hiroshi Fukada, Staff Software Engineer at Kong, shares his insights on managing customer data through Kong Gateway and transitioning to Imply's managed Druid services to simplify their infrastructure. Discover the benefits of Druid, like low latency and ease of use, a...

Mark as Played

On this episode, we are joined by special co-host Hugh Evans and returning guest Will Xu as we announce Druid Summit 2024 and dive into Druid 30.0's new features and enhancements. Improvements include better ingestion for Amazon Kinesis and Apache Kafka, enhanced support for Delta Lake, and advanced integrations with Google Cloud Storage and Azure Blob Storage. 

Come for the technical upgrades like GROUP B...

Mark as Played

On this episode, we’re diving into digital ad spend and real-time data with Miguel Rodrigues, Head of Engineering at British media company Global. We’ll discuss their use of Apache Druid to enhance real-time analytics for their digital adve...

Mark as Played

On this episode, we are joined by Ross Morrow, a Software Engineer at Finix, the payment processor working to create the most accessible financial services ecosystem in history. Finix’s B2B payments platform is designed for flexibility and scalability, streamlining financial transactions for businesses and delivering a truly customer-centric experience. Faced with the need for a powerful database for real-time insights, Finix turne...

Mark as Played

On this episode, we’re going all in on cybersecurity!  Helping us with what critical aspects of security you need to focus on when building analytics applications is Carrell Jackson, CISO at Imply. We’ll discuss the importance of protecting sensitive data by implementing role-based access control and encryption and hear about best practices for securing a Druid cluster. Listen to learn more about how Imply takes a security-first ap...

Mark as Played

On this episode, we explore Apache Druid 29.0, focusing on three specific themes: performance, ecosystem, and SQL compliance. Discover new features such as EARLIEST / LATEST support for numerical columns, system fields ingestion, and enhanced array support like UNNEST and JSON_QUERY_ARRAY. In addition, get the full scoop on community-contributed extensions like Spectator Histogram and DDsketch for efficient quantile calculations an...

Mark as Played

In this special episode of Tales at Scale - this is our final episode of our first season! - Peter Marshall, Director of Developer Relations at Imply joins the show to discuss the highlights of 2023 for Apache Druid. We dive into the significant feature releases and enhancements that have transformed Druid over the past year, including the SQL standardizaion, query from deep storage, experimental window functions, and the growing D...

Mark as Played

On this episode, we dive into Apache Druid 28. This latest Druid release includes improved ANSI SQL and Apache Calcite support, the addition of window functions as an experimental feature, async queries and query from deep storage going GA, array enhancements, multi-topic Apache Kafka ingestion, and so much more! Will Xu, program manager at Imply returns to give us the full scoop.

Mark as Played

On this episode, we debunk the myth that Druid can't do joins. Druid doesn't function as a traditional relational database because it was purpose-built for lightning-fast queries on large datasets. However, this doesn't mean Druid is entirely devoid of join capabilities – it simply approaches them differently. Our myth-busting team features returning guests Sergio Ferragut and Hellmar Becker from Imply ready to clarify how Druid ha...

Mark as Played

On this episode, we explore how Atlassian leverages Apache Druid's capabilities to handle millions of daily events and empower users with intelligent data-driven features. We’re joined by Gautam Jethwani and Kasirajan Selladurai Selvakumari from the Confluence Big Data Platform Team who will talk through how they use Druid to power intelligent features, sub-second query latency, and complex ingestion tasks.

Mark as Played

When it comes to fraud detection, initial detection is key, but so is the ability to quickly dissect and address the problem to minimize losses. This means access to real-time data is paramount. The only way to combat fraud in the digital age is to fight fire with fire…automation with automation. In this episode, we’re joined by Jaylyn Stoesz, Staff Data Engineer at Ibotta, a free cashback rewards platform, who walks us through Ibo...

Mark as Played

We’re back again with another Druid release! Here we are at Apache Druid 27.0, thanks to the dedication of the Druid Community. This release was made possible by over 350 commits & 46 contributors. Will Xu, Product Manager at Imply joins the show to discuss new features like Smart Segment Loading, a new mechanism for managing data files as the database scales, improvements to schema auto-discovery, and the long-awaited feature – qu...

Mark as Played

Real-time data has many applications but one place where it’s extremely valuable is with usage tracking, billing, and generating reports. Ensuring the freshness and availability of this data is not only essential for financial success but also for establishing a more challenging aspect—trust. That's precisely why Orb chose Apache Druid and Imply as the backbone of their advanced pricing platform. This platform encompasses invoicing...

Mark as Played

Apache Kafka® is a streaming platform that can handle large-scale, real-time data streams reliably. It’s used for real-time data pipelines, event sourcing, log aggregation, stream processing, and building analytics applications. Apache® Druid is a database designed to provide fast, interactive, and scalable analytics on time-series and event-based data, empowering organizations to derive insights, monitor real-time metrics, and bui...

Mark as Played

Today's show is all about the world of big data and open source projects, and we've got a real gem to share with you—Voltron Data!  They're on a mission to revolutionize the data analytics industry through open standards. To unleash the untapped potential in data, Voltron Data uses cutting-edge tech and provides top-notch support services, with a special focus on Apache Arrow. This open-source framework lets you process data in bot...

Mark as Played

Who better to talk about the real-world usage of Apache Druid than Digital Turbine, a leading mobile growth and monetization platform? The folks at DT go way back with Druid. On this episode Lioz Nudel, Engineering Group Manager at Digital Turbine and Alon Edelman, Data Architect at Digital Turbine discuss how Druid has significantly improved their analytics infrastructure in terms of performance and scalability. We cover their jou...

Mark as Played

Whether you're a data engineer, data scientist, technology enthusiast, or just a person on the Internet, you’ve heard about ChatGPT. But did you know there are some great use cases for it that work with Apache Druid? Druid and ChatGPT are two cutting-edge technologies that are revolutionizing the world of real-time analytics and natural language processing. In this episode, we’re joined by Rick Jacobs, Senior Technical Evangelist a...

Mark as Played

Deploying and configuring Apache Druid manually in a Kubernetes environment can be complex and time-consuming. But it doesn’t have to be. Enter Druid Operator, a tool specifically designed for managing Apache Druid deployments in a Kubernetes environment. Adheip Singh, founder of DataInfra and contributor to Druid Operator, walks us through the benefits, including managing upgrades and rollbacks of Druid clusters, scaling Druid clu...

Mark as Played

Breaking news! Apache Druid 26.0 is now available! Druid 26.0 has a few key features including schema auto discovery and shuffle JOINs but that’s not all. On this episode, we’re joined by Vadim Ogievetsky, Apache Druid PMC, co-founder of Imply and one of the very first Druid users, to talk through what’s new and why it’s cool. Special thanks to the Apache Druid community:  60+ contributors and the nearly 400 commits made this possi...

Mark as Played

Popular Podcasts

    Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

    Stuff You Should Know

    If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

    Intentionally Disturbing

    Join me on this podcast as I navigate the murky waters of human behavior, current events, and personal anecdotes through in-depth interviews with incredible people—all served with a generous helping of sarcasm and satire. After years as a forensic and clinical psychologist, I offer a unique interview style and a low tolerance for bullshit, quickly steering conversations toward depth and darkness. I honor the seriousness while also appreciating wit. I’m your guide through the twisted labyrinth of the human psyche, armed with dark humor and biting wit.

    The Bobby Bones Show

    Listen to 'The Bobby Bones Show' by downloading the daily full replay.

    The Clay Travis and Buck Sexton Show

    The Clay Travis and Buck Sexton Show. Clay Travis and Buck Sexton tackle the biggest stories in news, politics and current events with intelligence and humor. From the border crisis, to the madness of cancel culture and far-left missteps, Clay and Buck guide listeners through the latest headlines and hot topics with fun and entertaining conversations and opinions.

Advertise With Us
Music, radio and podcasts, all free. Listen online or download the iHeart App.

Connect

© 2025 iHeartMedia, Inc.