We use Gremlin to test various failure scenarios and build confidence in the resiliency of our microservices. The ability to target containerized services with an easy-to-use UI has reduced the amount of time it takes us to do fault injection significantly.
Doing Chaos Engineering with Gremlin has helped us break down knowledge monopolies and validate our runbooks, resulting in dramatic improvements to our incident response times and production environments.
It's about having the courage to do upfront what can prevent you from losing your face later. It's a rationally easy call to make to do Chaos Engineering.