How to Baseline and Improve Reliability with Automated Scoring

Organizations running complex distributed systems face a reliability gap.

Improving reliability starts with measuring it, but most organizations can only observe what’s already gone wrong. Teams are beginning to drive reliability improvements by proactively measuring, testing, and automating reliability before incidents.

This webinar walks through the practice of automated Reliability Scoring and how it can drive improvements across your organization. You’ll learn strategies to help you implement Reliability Scoring and see tools you can use to automate the process at scale.


Register Now

Thank you for registering for this on-demand event. You will receive an email momentarily with a link to watch the session.

About this webinar

Learn a new approach to reliability management. Use testing, scoring, and automation to proactively manage and improve reliability in organizations running complex, distributed systems.

  • Baseline reliability with a reliability score
  • Use scores to identify risks and prioritize remediations
  • Automate reliability practices organization-wide
About the speakers

Ryan Detwiller

Director of Product Marketing

Ryan works with Gremlin’s product and marketing teams to understand reliability challenges at fast-moving companies, and ensure Gremlin delivers the tools, practices, and advice needed to develop world-class reliability programs. Prior to Gremlin, Ryan held leadership roles in product, marketing, and general management with a number of B2B technology companies building complex distributed systems, including Open Mesh and Datto.

Andre Newman

Sr. Reliability Specialist

At Gremlin, Andre promotes the benefits of Chaos Engineering and reliability testing to engineering teams around the world, including at some of the largest enterprise organizations. Prior to Gremlin, he created technical content explaining Kubernetes and containerization, the shift to cloud computing, DevOps, observability, and more. His work has been featured in The New Stack, DZone, Software Engineering Daily, TechBeacon, and StatusCode Weekly.

Avoid downtime. Use Gremlin to turn failure into resilience.

Gremlin empowers you to proactively root out failure before it causes downtime. See how you can harness chaos to build resilient systems by requesting a demo of Gremlin.GET STARTED

Product Hero ImageShape