Beyond Chaos Engineering: Using Reliability Scores to Drive Real Results

Improving reliability starts with measuring it. But today, most organizations only have backwards-facing measurements of reliability like incidents and SLOs—these only show you what’s gone wrong. Teams need to measure the reliability of their services without waiting for an outage. They need a Reliability Score.

In this webinar, we’ll walk you through the new practice of Reliability Scoring and how it can drive faster, more measurable reliability improvements across your organization. You’ll walk away with strategies to help you implement Reliability Scoring and new tools you can use to automate the process at scale.


Register Now

Thank you for registering for this on-demand event. You will receive an email momentarily with a link to watch the session.

About this webinar

In order to make reliability improvements tangible, there needs to be a way to quantify and track the reliability of systems and services in a meaningful way.

In this webinar, you'll learn:

  • How to calculate a Reliability Score
  • How to use Reliability Scores to drive alignment across services and teams
  • How to automate Reliability Scores in your CI/CD pipeline using observability tools
About the speakers

Ryan Detwiller

Director of Product Marketing

Ryan works with Gremlin’s product and marketing teams to understand reliability challenges at fast-moving companies, and ensure Gremlin delivers the tools, practices, and advice needed to develop world-class reliability programs. Prior to Gremlin, Ryan held leadership roles in product, marketing, and general management with a number of B2B technology companies building complex distributed systems, including Open Mesh and Datto.

Andre Newman

Sr. Reliability Specialist

At Gremlin, Andre promotes the benefits of Chaos Engineering and reliability testing to engineering teams around the world, including at some of the largest enterprise organizations. Prior to Gremlin, he created technical content explaining Kubernetes and containerization, the shift to cloud computing, DevOps, observability, and more. His work has been featured in The New Stack, DZone, Software Engineering Daily, TechBeacon, and StatusCode Weekly.

Avoid downtime. Use Gremlin to turn failure into resilience.

Gremlin empowers you to proactively root out failure before it causes downtime. See how you can harness chaos to build resilient systems by requesting a demo of Gremlin.GET STARTED

Product Hero ImageShape