September 21, 2023 - 3 min read
How a simple metric drives reliability culture at Slack
How do you track reliability in an organization with hundreds of engineers, dozens of daily production changes , and over 32 million monthly users ? Even more, how do you do this in a way that's simple, presentable to executives, and…
No matching categories
What is Reliability Management?
October 20, 2022 - 4 min read
Measuring and improving the reliability of technical systems has always been challenging. As an industry, we've developed several practices to try and address reliability concerns, such as incident response, observability, and Chaos…
Four tests to measure and improve reliability: what matters and how it works
September 2, 2022 - 5 min read
Legendary race car driver Carroll Smith once said, "until we have established reliability, there is no sense at all in wasting time trying to make the thing go faster." Even though he was referring to cars, the same goes for technology: no…
How Gremlin's reliability score works
July 14, 2022 - 5 min read
In order to make reliability improvements tangible, there needs to be a way to quantify and track the reliability of systems and services in a meaningful way. This "reliability score" should indicate at a glance how likely a service is to…
Sign up to get the latest info about Gremlin
Ebook: Closing the Reliability Gap
How Gremlin's Reliability Score Works
What is Reliability Management
What is Site Reliability Engineering?
What is Chaos Engineering?
© 2023 Gremlin Inc. Covina, CA 91723