It’s now easier than ever to start testing Kubernetes with Dynatrace and Gremlin. With a new strategic integration, Kubernetes services set up in Dynatrace are automatically discovered in Gremlin to make testing set up simple and fast.

At a time when AI is driving massive expansions in infrastructure and dramatically increasing deployment speed, being able to set up and test new services quickly is more important than ever.

Kubernetes is the foundation of modern cloud-native infrastructure, supporting a wide spectrum of organizations, from nimble startups to global enterprises. As AI-driven innovation accelerates, the reliability of Kubernetes becomes mission-critical.”
—Wayne Segar, Global Field CTO, Dynatrace

What is a health check?

The whole point of Fault Injection tests is to see how your services and applications respond under potential failure conditions. That’s where health checks come in. By integrating directly with your observability platform, health checks monitor the status of your service during experiments.

A well-designed health check should tie directly into metrics that are important to your organization and your business. Many of these metrics have already been planned and selected as part of instrumenting your system with observability, such as error rates or latency.

Common Dynatrace signals that can be used for health checks.

When you run a test, a health check will monitor your metrics to determine whether your test passes or fails. This also helps point your team towards possible areas to address.
Health checks also help improve your observability. By injecting faults, you can verify that your observability data isn’t lagging behind, fine-tune alerts, check paging systems, and optimize your setup to help your team respond as quickly as possible during incidents.
By integrating Gremlin with your observability platform, like Dynatrace, you can seamlessly tie your efforts together to maximize the impact of your reliability efforts.

Many teams have faced challenges operationalizing reliability testing across complex cloud-native architectures, often requiring multiple manual steps to identify and target the right resources. By combining advanced AI observability and topology insights with Gremlin’s fault injection and reliability capabilities, customers can more easily identify, test, optimize, and strengthen critical services at scale.”
—Sam Rossoff, CTO, Gremlin

Automatically discover Kubernetes services

Powered by Dynatrace’s AI-driven observability and topology mapping, the new integration automatically discovers Kubernetes services within Gremlin. In only a few clicks, you can select a Kubernetes object from a dropdown list pulled from Dynatrace, create a health check, and be ready for testing.

When combined with Gremlin’s standardized test suites, this integration becomes a powerful tool for organizations to scale Chaos Engineering and reliability testing across their teams. New services can be onboarded for reliability testing in minutes. Just create a Dynatrace Health Check, assign it to your services, and they’re ready to go.

With this new integration, Gremlin and Dynatrace are simplifying how organizations introduce Fault Injection into their Kubernetes environments.”
—Sam Rossoff, CTO, Gremlin

Make sure your applications are resilient and reliable

With Chaos Engineering and standardized reliability testing, your teams will be able to prove resilience to known failure modes, optimize their response playbooks, and address issues before they cause customer-impacting outages.

And with faster onboarding from Dynatrace and Gremlin, you’ll be able to quickly and effectively scale those efforts across your organization to increase availability and uptime.

Our partnership with Gremlin simplifies Chaos Engineering, helping teams ensure resilience and performance across complex, distributed systems.”
—Wayne Segar, Global Field CTO, Dynatrace
No items found.
Start your free trial

Gremlin's automated reliability platform empowers you to find and fix availability risks before they impact your users. Start finding hidden risks in your systems with a free 30 day trial.

sTART YOUR TRIAL

Gavin Cahill
Gavin Cahill
Sr. Content Manager