Listen to this chapter in the podcast from the beginning to learn more about full episode.

Ruby Rogues

The Sounds of Silence: Lessons From an API Outage with Paul Zaich - RUBY 652

Mon, 23 Sep

From Default Workspace • No contributors

Description

Paul Zaich from Checkr tells us about a critical outage that occurred, what caused it and how they tracked down and fixed the issue. The conversation ranges through troubleshooting complex systems, building team culture, blameless post-mortems, and monitoring the right things to make sure your applications don't fail or alert you when they do.LinksPaul's TwitterPaul's LinkedInPicksBlood Pressure Monitor - Daveeft - LukeRuby one-liners cookbook - PaulPodcast Growth Summit - ChuckMost Valuable Dev - ChuckMost Valuable Dev Summit - ChuckMushroom Wars - ChuckGmelius - ChuckBecome a supporter of this podcast: https://www.spreaker.com/podcast/ruby-rogues--6102073/support.

Audio

Featured in this Episode

Transcription

Full Episode

5.742 - 24.793 Luke Stutters

Hey, everybody, and welcome to another episode of the Ruby Rogues podcast. This week on our panel, we have Luke Stutters. Hello. We have Dave Kimura. Hey, everyone. I'm Charles Maxwood from devchat.tv. Quick shout out about mostvaluable.dev. Go check it out. We have a special guest this week, and that is Paul Zeich.

25.214 - 27.015 Paul Zaich

Zeich. Well done. Thank you.

27.924 - 42.288 Luke Stutters

Now, you're here from Checkr. You gave a talk at RailsConf about how you broke stuff or somebody broke stuff. Do you want to just kind of give us a quick intro to who you are and what you do? And then we'll dive in and talk about what broke and how you figured it out?

42.608 - 54.957 Paul Zaich

Sure. So I've been a software engineer for about 10 years. Recently in the last year or so, transitioned into an engineering management role. But I've worked at a number of different Small startups.

54.977 - 68.865 Paul Zaich

I joined Checkr in 2017 when the company was at about 100 employees, 30 engineers, contributed as an engineer for a couple of years to our team, and then have recently transitioned, like I said, into an engineering management role.

69.221 - 89.054 Luke Stutters

at the company. Very cool. I actually have a Checkr t-shirt in my closet that I never wear. It's Checkr for those that are listening and not reading it. Yeah. So why don't you kind of tee us up for this as far as, yeah, what happened? What broke? Yeah. Give us a preliminary timeline and explain what Checkr does and why that matters.

89.494 - 104.825 Paul Zaich

Sure. So Checkr was founded in 2014. Daniel and Jonathan are founders. I had worked in the on-demand space, another company, and had discovered that it was very difficult to integrate background checks into their onboarding process.

105.046 - 119.916 Paul Zaich

Background checks tend to be a very important final safety step for a lot of these companies to make sure that their platform is going to be safe and secure for customers. their customers. And so in 2014, they started an automated background check company.

120.356 - 146.593 Paul Zaich

And initially, the biggest selling point was that Checkr abstracted away a lot of the complexity of background check process, collecting candidate information, and then executing that flow and exposing that via an API that was developed in a Sinatra app. And three years later, in 2017, I just joined about four or five months before this particular incident happened.

Ruby Rogues

The Sounds of Silence: Lessons From an API Outage with Paul Zaich - RUBY 652

Full Episode

Want to see the complete chapter?

Login Required