
Improve Kubernetes reliability faster with Gremlin and Dynatrace
It’s now easier than ever to start testing Kubernetes with Dynatrace and Gremlin. With a new strategic integration, Kubernetes services set up in Dynatrace are automatically discovered in Gremlin to make testing set up simple and fast.
At a time when AI is driving massive expansions in infrastructure and dramatically increasing deployment speed, being able to set up and test new services quickly is more important than ever.
Kubernetes is the foundation of modern cloud-native infrastructure, supporting a wide spectrum of organizations, from nimble startups to global enterprises. As AI-driven innovation accelerates, the reliability of Kubernetes becomes mission-critical.”
What is a health check?
The whole point of Fault Injection tests is to see how your services and applications respond under potential failure conditions. That’s where health checks come in. By integrating directly with your observability platform, health checks monitor the status of your service during experiments.
A well-designed health check should tie directly into metrics that are important to your organization and your business. Many of these metrics have already been planned and selected as part of instrumenting your system with observability, such as error rates or latency.

When you run a test, a health check will monitor your metrics to determine whether your test passes or fails. This also helps point your team towards possible areas to address.
Health checks also help improve your observability. By injecting faults, you can verify that your observability data isn’t lagging behind, fine-tune alerts, check paging systems, and optimize your setup to help your team respond as quickly as possible during incidents.
By integrating Gremlin with your observability platform, like Dynatrace, you can seamlessly tie your efforts together to maximize the impact of your reliability efforts.
Many teams have faced challenges operationalizing reliability testing across complex cloud-native architectures, often requiring multiple manual steps to identify and target the right resources. By combining advanced AI observability and topology insights with Gremlin’s fault injection and reliability capabilities, customers can more easily identify, test, optimize, and strengthen critical services at scale.”
Automatically discover Kubernetes services
Powered by Dynatrace’s AI-driven observability and topology mapping, the new integration automatically discovers Kubernetes services within Gremlin. In only a few clicks, you can select a Kubernetes object from a dropdown list pulled from Dynatrace, create a health check, and be ready for testing.

When combined with Gremlin’s standardized test suites, this integration becomes a powerful tool for organizations to scale Chaos Engineering and reliability testing across their teams. New services can be onboarded for reliability testing in minutes. Just create a Dynatrace Health Check, assign it to your services, and they’re ready to go.
With this new integration, Gremlin and Dynatrace are simplifying how organizations introduce Fault Injection into their Kubernetes environments.”
Make sure your applications are resilient and reliable
With Chaos Engineering and standardized reliability testing, your teams will be able to prove resilience to known failure modes, optimize their response playbooks, and address issues before they cause customer-impacting outages.
And with faster onboarding from Dynatrace and Gremlin, you’ll be able to quickly and effectively scale those efforts across your organization to increase availability and uptime.
Our partnership with Gremlin simplifies Chaos Engineering, helping teams ensure resilience and performance across complex, distributed systems.”
Gremlin's automated reliability platform empowers you to find and fix availability risks before they impact your users. Start finding hidden risks in your systems with a free 30 day trial.
sTART YOUR TRIAL