How to run fault injection tests on AWS managed services

Office Hours

Register Now

Thank you for registering! Click here to watch the webinar on-demand.

Fully-managed SaaS services offer incredible scalability and accessibility, but at a cost: they’re also single points of failure. If your application depends on a SaaS service and the service fails, guess who your customers will blame? We need to design applications to anticipate and work around managed service failures, but how do we do that without having to wait for the service to fail?

In this Office Hours session, we’ll show you how you can recreate a failure in a managed service provider using Gremlin’s fault injection tools. You’ll learn how to run experiments that replicate SaaS outages in a safe, controlled, reversible way, while only impacting the services you want to test. We’ll also show you how you can easily choose from a pre-populated list of managed services directly in the Gremlin web app.


  • The challenges with testing managed services
  • How to track managed services as dependencies
  • How to run reliability tests on managed services
About the speakers

Andre Newman

Sr. Reliability Specialist

At Gremlin, Andre promotes the benefits of Chaos Engineering and reliability testing to engineering teams around the world, including at some of the largest enterprise organizations. Prior to Gremlin, he created technical content explaining Kubernetes and containerization, the shift to cloud computing, DevOps, observability, and more. His work has been featured in The New Stack, DZone, Software Engineering Daily, TechBeacon, and StatusCode Weekly.

Dan Muret

Sr. Solutions Architect

At Gremlin, Dan works closely with organizations to understand, implement, and design Chaos Engineering and reliability testing practices. Prior to Gremlin, he’s worked as a system administrator and solutions architect for companies like IBM, Zerto, and Veeam/Kasten. Dan’s real-world experience in system architecture, cloud migrations, disaster recovery, and resilience testing help him guide companies to make the most out of their reliability and Chaos Engineering efforts.

Avoid downtime. Use Gremlin to turn failure into resilience.

Gremlin empowers you to proactively root out failure before it causes downtime. See how you can harness chaos to build resilient systems by requesting a demo of Gremlin.GET STARTED

Product Hero ImageShape