How to test zone redundancy using Gremlin

Office Hours
June 13, 2024 at 11 am PST / 2 pm EST

Register Now

Thank you for registering! We'll e-mail you a link to join the webinar on June 13, 2024.

Zone failures are rare, but they still happen. When an entire zone fails, many of the most common redundancy techniques fail. How do you avoid outages like these, especially if they affect an entire datacenter?

In this webinar, we’ll show you how to prepare for zone outages using Gremlin. You’ll learn how Gremlin’s built-in reliability tests and Scenarios already test your services against zone failures. You’ll also learn how to customize these tests to target different zones, how to recreate an outage in a different zone from the ones your systems are running in, and how to monitor your services throughout using Health Checks.

We'll cover:

  • How to recreate a zone failure using Gremlin Scenarios
  • Expanding zone failure tests to your dependencies
  • Customizing zone redundancy tests for AWS 
  • Monitoring your services during experiments using Health Checks
About the speakers

Andre Newman

Sr. Reliability Specialist

At Gremlin, Andre promotes the benefits of Chaos Engineering and reliability testing to engineering teams around the world, including at some of the largest enterprise organizations. Prior to Gremlin, he created technical content explaining Kubernetes and containerization, the shift to cloud computing, DevOps, observability, and more. His work has been featured in The New Stack, DZone, Software Engineering Daily, TechBeacon, and StatusCode Weekly.

Dan Muret

Sr. Solutions Architect

At Gremlin, Dan works closely with organizations to understand, implement, and design Chaos Engineering and reliability testing practices. Prior to Gremlin, he’s worked as a system administrator and solutions architect for companies like IBM, Zerto, and Veeam/Kasten. Dan’s real-world experience in system architecture, cloud migrations, disaster recovery, and resilience testing help him guide companies to make the most out of their reliability and Chaos Engineering efforts.

Avoid downtime. Use Gremlin to turn failure into resilience.

Gremlin empowers you to proactively root out failure before it causes downtime. See how you can harness chaos to build resilient systems by requesting a demo of Gremlin.GET STARTED

Product Hero ImageShape