Slight Reliability

Slight Reliability

Learning SRE, one day at a time.

Episodes

December 16, 2025 โ€ข 31 mins

Send us a text

From the day we invented computers we've been struggling to keep applications running and delivering services to the business. Is this latest wave of AI helping or hurting us?

This week I'm joined by Causely founder Shmuel Kliger to dive into...

๐ŸŒŠ The three waves of AI hype over the decades (the history of AI)
โ˜ ๏ธ The dangers of over-promising and under-delivering what AI can do
๐Ÿง  What is causal reaso...

Mark as Played

Send us a text

What is operational intelligence and how is it different from observability or BI?

This week I'm joined by SquaredUp's VP of Innovation Adam Kinniburgh to answer that question and many more including...

โ“ What is operational intelligence?
๐Ÿ™ˆ Relating observability back to customer, business, or revenue
๐Ÿ˜Ž The value of giving stakeholders confidence
๐ŸŒ‰ Who bridges the gap between tech and business o...

Mark as Played

Send us a text

How does leading platform teams differ from leading product teams?

This week I'm joined by experienced technology leader Dinesh Sukhija to answer that question and many more including...

โ“ What is a platform team?
โšฝ Coaching engineers to focus on outcomes
โ˜€๏ธ Connecting platform initiatives to business goals
โœ‹ Identifying the limiters in your team
๐ŸŽค Spreading knowledge and avoiding single points of ...

Mark as Played
November 18, 2025 โ€ข 19 mins

Send us a text

How has my first two years as a manager in tech been? What have I learned? What do I need to work on?

This week I share my experiences over the past couple of years. I cover:

๐Ÿ”ฅ My recent close call with burnout
๐Ÿซถ How I attempted to build a team culture
๐Ÿ’ช The importance of tough conversations
๐Ÿฅฑ How roles and responsibilities might be boring to think about but is critical
โ“ What's next?

...and much...

Mark as Played

Send us a text

How could AI help human beings negotiate the mountains of telemetry we collect to get simple and fast insight?

This week I'm joined by Ottermon AI CEO and founder Checo Pacheco about the lifecycle of observability coverage and tooling within organisations and how AI is helping to find signals amongst the noise and reduce cognitive load for SREs. We discuss...

๐ŸŽ‚ The need for a layer of logic on top of our telemetry...

Mark as Played

Send us a text

What is chaos engineering and how is it being used in 2025?

This week I'm joined by Gremlin CEO and founder Kolton Andrus to discuss...

๐ŸŒช๏ธ What is chaos engineering and what is its origins?
๐Ÿชด How has it evolved over the year?
๐Ÿค– The role of AI agents in SRE work
๐Ÿ’ฐ Justifying the value of chaos engineering
๐Ÿƒโ€โ™€๏ธโ€โžก๏ธ How do I get started?

...and much more.

You can find Kolton on:

LinkedIn: https://www....

Mark as Played
October 7, 2025 โ€ข 23 mins

Send us a text

What are Team Topologies? How can they be used to deliver value simpler and more effectively (and in a more humane way)?

This week I'm joined by Luke McManus to discuss...

โ›ฐ๏ธ What are the four team topologies?
๐Ÿ† Can we have too much collaboration?
โŒš Team interaction models
๐ŸŒ Cognitive load
๐Ÿƒโ€โ™€๏ธโ€โžก๏ธ Value dynamics mapping

...and much more.

You can find Luke on:

LinkedIn: https://www.linkedin.com/in/lu...

Mark as Played

Send us a text

How do you begin contributing to an open source project? What's it like? What do you get out of it?

This week I'm joined by Wendy Ha who shares her unique story of joining the Kubernetes project and becoming a contributor. We explore...

โ›ฐ๏ธ What it's like working on one of the biggest open source projects in the world
๐Ÿ† The benefits of contributing to open source
โŒš How much time and effort does it...

Mark as Played

Send us a text

As an #SRE how do you influence senior leadership to get support and priority for the things you care about?

To answer this question I'm joined by Nora Jones, founder of Jeli and now Head of Pricing, Product Strategy and Growth at PagerDuty. Our conversation touches on...

๐Ÿค How understanding needs to flow both ways (between engineers and leaders)
๐ŸŽจ Reliability is as much an art as a science
๐Ÿ“ Using napki...

Mark as Played

Send us a text

This week I do a retrospective on the Slight Reliability podcast.

๐Ÿ‘‚ How many people listen to it?
โค๏ธ How do I feel about the show?
๐ŸŽ‰ What's going well?
๐Ÿชด What could be better?
โ” What's next for the show?

If you want to check out the podcast that came before Slight Reliability, you can find Performance Time archived on YouTube here:
https://www.youtube.com/@performance-time

You can find St...

Mark as Played
August 12, 2025 โ€ข 38 mins

Send us a text

Have you burned out at work? What was your experience? How did you work through it?

This week I'm joined by the incredible Colette Alexander to discuss what burnout is, what it means, and we both share our personal experiences burning out at work. We cover...

๐Ÿ”ฅ What is burnout?
โ“ Why does it happen?
๐Ÿซ€ What are the symptoms?
๐ŸฅŠ Fight, flight, or freeze
๐Ÿง‘โ€๐Ÿš’ Advice on how to recover

...and much more...

Mark as Played

Send us a text

This week I'm joined by the wonderful Hanson Ho to discuss the unique challenges and opportunities in making our mobile apps observable! We cover...

๐Ÿ“ฑ The mobile/backend observability divide
โœ๏ธ The challenge of distributed tracing on mobile apps
๐ŸŒ The entire device runtime environment matters for your app
๐Ÿ‘ค The quest for user-centric mobile observability
โœ… Advice on how to get started with mobil...

Mark as Played

Send us a text

This week on the I'm joined once more by SRE leader Michelle Casey who gives a broad and shallow introduction to resilience engineering. We cover...

๐Ÿ‹๏ธโ€โ™€๏ธ Reliability VS Robustness VS Resilience
๐Ÿงฉ What is a complex system?
๐Ÿ”ข Safety one/safety two
๐Ÿง  Mental models
๐Ÿ˜ฉ Human error

...and so much more.

Resources from this episode:

Four concepts for resilience (paper) by Dr. David Woods https://www.rese...

Mark as Played
June 24, 2025 โ€ข 48 mins

Send us a text

This week on the 100th episode I'm joined by DevOps and Resilience Engineering legend John Allspaw to talk about learning (especially from incidents). We discuss...

๐Ÿ“’ Classroom VS situated learning
๐Ÿค The myth of the perfect handover
ITIL as a coping strategy to try and make sense of the organic, wild, and messy
๐Ÿฅ• How you cannot incentivise to avoid incidents (it doesn't work that way)
โค๏ธโ€๏ฟฝ...

Mark as Played

Send us a text

This week I'm joined by SRE leader Trent Hornibrook who shares a story about how he improved on-call early in his career, and then we explore the broader theme of focusing on the things that matter in observability, incident response, on-call, and beyond. We discuss...

๐Ÿ”Œ Empowering engineers to implement change in your org
๐Ÿง‘โ€๐Ÿผ Focusing on what matters (customer & business > technology)
๐Ÿ‘€ Not jus...

Mark as Played

Send us a text

This week I'm joined by SRE leader Andrew Hatch from Cisco ThousandEyes to talk about a dirty word in the resilience community... root cause. In this excellent conversation we explore...

๐ŸŒŒ Is the root cause of every incident the big bang?
๐Ÿฆ– How the value of root cause degrades as complexity increases
๐Ÿซฃ That if the culture is not blameless, people will hide things
๐ŸŒณ Alternative approaches to root ca...

Mark as Played

Send us a text

This week I'm joined by David Dick from 2 Steps to (finally!) discuss synthetic monitoring. We cover...

๐Ÿค– What is synthetic monitoring?
๐Ÿฆพ What are the benefits and drawbacks to using it?
โ˜ข๏ธ Non-web based synthetics (the tough stuff)
๐Ÿน Combining RUM and synthetics
๐Ÿซข Does synthetics need an OTEL-like framework?

...and much more.

You can find David on:

LinkedIn: https://www.linkedin.com/in/david-dick...

Mark as Played
April 23, 2025 โ€ข 31 mins

Send us a text

This week I'm joined by Cin7 Engineering Director Milan Brown to unpack the challenges of technology management and leadership. We discuss...

โœ–๏ธ Theory X vs Theory Y management
๐Ÿ—ฃ๏ธ Intention based leadership and communication
๐Ÿข Conditions in an org for people to thrive
๐Ÿ˜ตโ€๐Ÿ’ซ How do you learn to manage and lead?
๐Ÿซค Managing people when you're not an expert in what they do

...and much more.

Resou...

Mark as Played
March 28, 2025 โ€ข 36 mins

Send us a text

This week Leon Adato and I break down the state of applying for roles in tech. We cover...

๐Ÿ“ What a resume or CV is and is not
๐Ÿค Leveraging your connections rather than relying on applying cold
๐Ÿช„ How most job descriptions are works of fiction
๐Ÿฆพ White-fonting to game AI resume assessment
๐Ÿงช Experimental ways we could recruit

...and our pitch for Kubernetes the Rock Opera (and much more)

You can find Le...

Mark as Played

Send us a text

This week Priyam Kumar shares his story of moving from a massive organisation to a startup and the challenges and growth that came from that. We discuss...

๐Ÿช– War stories and examples of production incidents
๐Ÿฉน The "hacks" we build to keep things running (and how maybe that's just normal)
๐Ÿ˜Ž Keeping it simple... YAGNI (You Ain't Gonna Need It!)
๐Ÿงฏ The perils of getting stuck in reactive ...

Mark as Played

Popular Podcasts

    If you've ever wanted to know about champagne, satanism, the Stonewall Uprising, chaos theory, LSD, El Nino, true crime and Rosa Parks, then look no further. Josh and Chuck have you covered.

    Dateline NBC

    Current and classic episodes, featuring compelling true-crime mysteries, powerful documentaries and in-depth investigations. Follow now to get the latest episodes of Dateline NBC completely free, or subscribe to Dateline Premium for ad-free listening and exclusive bonus content: DatelinePremium.com

    The Bobby Bones Show

    Listen to 'The Bobby Bones Show' by downloading the daily full replay.

    The Joe Rogan Experience

    The official podcast of comedian Joe Rogan.

    Betrayal: Weekly

    Betrayal Weekly is back for a brand new season. Every Thursday, Betrayal Weekly shares first-hand accounts of broken trust, shocking deceptions, and the trail of destruction they leave behind. Hosted by Andrea Gunning, this weekly ongoing series digs into real-life stories of betrayal and the aftermath. From stories of double lives to dark discoveries, these are cautionary tales and accounts of resilience against all odds. From the producers of the critically acclaimed Betrayal series, Betrayal Weekly drops new episodes every Thursday. Please join our Substack for additional exclusive content, curated book recommendations and community discussions. Sign up FREE by clicking this link Beyond Betrayal Substack. Join our community dedicated to truth, resilience and healing. Your voice matters! Be a part of our Betrayal journey on Substack. And make sure to check out Seasons 1-4 of Betrayal, along with Betrayal Weekly Season 1.

Advertise With Us
Music, radio and podcasts, all free. Listen online or download the iHeart App.

Connect

ยฉ 2025 iHeartMedia, Inc.