Gremlin's Chaos Engineering tools allow devs and SREs to safely, securely, and easily simulate real outages with an ever-growing library of attacks. Run game days with the only Failure-as-a-Service platform.
Listeners of the SE Daily podcast who run an attack with Gremlin get a free t-shirt and stickers.
Gremlin has a full suite of enterprise-grade failure testing modes so you can find out how resilient your production system is.
Recreate real world technical and business failures and prepare for the failure of dependencies, internal or external.
Which resource is your bottleneck? CPU, Memory, IO, or Disk? Find out for certain.
Processes die, time drifts, instances reboot. Are you ready?
You cannot rely on the network, nor on your dependencies being available. What happens when they slow down or disappear all together?
Recreate application outages to quickly resolve or prevent them in the first place.
Confidently experiment with a tiny blast radius to quickly reduce your failure surface area.
Maintain a consistent end user experience by preparing for dependency failure.
Understand how your applications behave in the face of failure in a serverless environment.
Minimize business impact and maximize learning with precise, fine-grained experiments
Experiment on a single user, device, or <attribute> to begin.
Request level granularity, from 0.01% of traffic up to 100%.
Fail or Delay any part of your application, from functions to endpoints.
Run disciplined chaos experiments to identify weak points in your system and fix them before they become a problem.
I. Plan an Experiment
Create a hypothesis for what might go wrong.
II. Run the smallest version
Execute a simple test to see how your system responds.
III. Scale or squash
Scale the experiment until you identify a bug. Then squash it.
If the unexpected happens, Gremlin's failsafes automatically halt your experiment and fall back to steady state.
Gremlin is built to not only cause failure, but to handle it as well.
Security is a first class citizen and is part of our DNA.
Gremlin runs on default Linux permissions and doesn’t require root access.
Every action taken on the platform creates an audit trail.
Multi-factor authentication, Secure Single Sign On, Role-based access control (RBAC). Gremlin is secured to allow experimentation in production.
The Gremlin client, daemon, API, and website undergo regular security auditing by an external auditor.
Get Gremlin up and running in moments with 3 lines of code.