Release Notes


May 2024
May 31, 2024
March 2024
March 29, 2024
  • NewInclude and exclude specific Detected Risks in your custom reliability test suite
  • NewReliability Management is now available for Windows-based services
  • NewNew Experiment type: Process Exhaustion for Windows
  • NewThe Gremlin agent now supports OpenShift 4.14
February 2024
February 29, 2024
January 2024
January 31, 2024
December 2023
December 31, 2023
November 2023
November 30, 2023
  • NEWThe Company Summary report can now be filtered by tags, enabling custom views and reporting of risks, services, and scores
  • Info Detected Risks are now included in the reliability score for each service Scenarios page.
  • New You can now mark a service dependency as a single point of failure to prevent tests being run against it
  • New New company setting: limit running experiments during certain time windows
  • New New company setting: disable “Target All”option from Experiment and Scenario screens
October 2023
October 31, 2023
  • New Custom Test Suites - You can now create custom RM Test Suites using Scenarios. This also lets you customize how reliability scores are calculated.
  • New Parallel Scenarios - Gremlin now supports running multiple experiments in parallel within a Scenario.
  • New CI/CD Integration - We've added CI/CD examples for running a Scenario, running an RM test, and getting an RM score using GitHub actions and Jenkins pipelines.
August 2023
August 31, 2023
  • Info Improved view for Company Summary reports (previously called the Dashboard). Plan usage has been moved to Company Settings.
  • Info Scenarios can now be deleted from the Scenarios page.
  • New Gremlin now supports delegation of Namespaces to a Team for service creation (manual and automatic)
July 2023
  • New Added Detected Risks for automatically detecting high-priority reliability concerns in a Kubernetes environment.
  • New Launched the beta release of Failure Flags, Gremlin's new framework for running Chaos Engineering experiments on AWS Lambda functions, serverless workloads, and containers.
  • New Added service annotations, which lets you automatically register your Kubernetes services in Gremlin by adding a simple annotation.
  • New Added web app support for managing multiple services simultaneously. This also includes adding a Health Check to multiple services. The Services list has been reworked to reflect this change.
June 2023
June 29, 2023
  • New AWS CloudWatch Update Improved search and automated health check creation.
  • New Label targeting support for Kubernetes in FI allows for finer grained targeting (and Service creation heuristics) based upon labels within Kubernetes in FI
  • New Configurable Session TTL and Session Renewal Options
  • New Fixed issue for some customers requesting shared access to new namespaces from the experiment targeting page
  • Fix Fixed issue for TEAM_VIEWER role missing access on some endpoints for viewing attacks
May 2023
May 31, 2023
  • New Certificate Expiry test added to the list of available Fault Injection experiments.
  • New Amazon CloudWatch added as an observability tool for Health Checks.
  • Info Renamed "attacks" to "experiments" (this does not change the Gremlin REST API /attacks endpoint).
April 2023
April 30, 2023
  • New Shared Health Checks - Teams can now share Health Checks with each other at the Team level, making it easier to collaborate and ensure system availability.
  • New Datadog Health Checks - Datadog customers can now access a searchable drop-down menu of monitors, making it easier to find and select the right monitor for their needs.
  • New Additional Providers for Network Tests - We have extended the set of network IP ranges that we are collecting, providing more comprehensive network test coverage.
March 2023
March 31, 2023
  • New Added the ability to create a service without having to add Health Checks. You will still need to add Health Checks to run reliability tests, however.
  • New The Gremlin Kubernetes Agent now supports ARM64.
  • New Attacks can now target Kubernetes resources with restricted network access.
  • New Gremlin now supports targeting DeploymentConfig objects in OpenShift.
  • New Added an overall reliability score trend line to the Company Dashboard, as well as the ability to export the Company Dashboard to PDF.
  • New The screen shown when running a reliability test now shows a visual timeline of the test, including when health checks were performed.
  • Info "Status Checks" and "Golden Signals" have been renamed to "Health Checks".
February 2023
February 28, 2023
  • New First public release of the Gremlin Reliability Management (RM) API. Click here to see the relevant API documentation.
  • New Made several improvements to the RM Services dashboard, including a trend line graph showing each service's reliability score over the past 30 days.
  • New Added the ability to review and add suggested dependencies to Gremlin RM services.
  • New Added the ability to disable API key usage at the company level.
  • New Gremlin now supports running network attacks on multiple NICs.
January 2023
January 31, 2023
  • New Added a "Run All Tests" button to the service creation process and on the service overview page.
  • New Added support for additional Datadog regions, specifically the EU Region.
  • New Added the ability to flag services as Production, which adds additional warnings before you can run tests.
  • New In Gremlin Fault Injection (FI), newly discovered agents will automatically join ongoing attacks (as long as they meet the targeting criteria).
  • Info The Getting Started page will now automatically refresh to show newly detected agents.
December 2022
December 23, 2022
  • New Added the ability to assign an owner to a service. By default, this is the user who created the service. This can be changed in the Service Settings screen.
  • Info We increased the process collection frequency for new accounts. This means new Gremlin users will no longer need to wait up to an hour before being able to create their first service.
November 2022
November 30, 2022
October 2022
October 31, 2022
  • New Process Discovery is enabled by default for new agent installations.
  • New Added the option to Autoschedule all reliability tests.
September 2022
September 1, 2022
June 2022
June 1, 2022
  • New The Gremlin agent now supports ARM64.
May 2022
May 1, 2022
  • New A Rerun Scenario button is now available for every Scenario on the GameDay Runs tab.
  • Info Attachments on the GameDays Summary tab are now limited to 50 MiB.
April 2022
April 1, 2022
  • New GameDays are now available in Gremlin - you can perform all GameDay activities from planning, running, and sharing results in Gremlin. See GameDays Overview for more information.
  • New Jira integration was added to Scenario Runs and GameDay Summaries. See Tracking results in Jira for details.
March 2022
March 1, 2022
  • New Updated the Gremlin Datadog integration to include Kubernetes objects for observability of Gremlin attacks in Datadog. Tags added include cluster, namespace, specific object (deployment, statefulset, replicaset, and so on), and pod name/ID.
February 2022
February 1, 2022
  • New Custom Azure tags are now supported for the Linux agent, version 2.22.5.
  • New You can now run multiple network attacks on the same target specifying different network devices.
  • Info The Application Level Fault Injection (ALFI) feature was deprecated.
January 2022
January 1, 2022
  • New On the Attacks page, unhealthy Kubernetes pods now have a visual warning indicating that they should not be targeted. Targets containing unhealthy pods now have a visual warning as well.
  • New Gremlin now supports network attacks for IPv6.
  • New Error codes from killed executions are now shown on the Attack Details page.
This is some text inside of a div block.
Installing the Gremlin Agent
Authenticating the Gremlin Agent
Configuring the Gremlin Agent
Managing the Gremlin Agent
User Management
Health Checks
Command Line Interface
Updating Gremlin
Reliability Management (RM) Quick Start Guide
Services and Dependencies
Detected Risks
Reliability Tests
Reliability Score
Deploying Failure Flags on AWS Lambda
Deploying Failure Flags on AWS ECS
Deploying Failure Flags on Kubernetes
Classes, methods, & attributes
API Keys
Container security
Additional Configuration for Helm
Amazon CloudWatch Health Check
AppDynamics Health Check
Blackhole Experiment
CPU Experiment
Certificate Expiry
Custom Health Check
Custom Load Generator
DNS Experiment
Datadog Health Check
Disk Experiment
Dynatrace Health Check
Grafana Cloud Health Check
Grafana Cloud K6
IO Experiment
Install Gremlin on Kubernetes manually
Install Gremlin on OpenShift 4
Installing Gremlin on AWS - Configuring your VPC
Installing Gremlin on Kubernetes with Helm
Installing Gremlin on Windows
Installing Gremlin on a virtual machine
Installing the Failure Flags SDK
Latency Experiment
Memory Experiment
Network Tags
New Relic Health Check
Packet Loss Attack
PagerDuty Health Check
Preview: Gremlin in Kubernetes Restricted Networks
Private Network Integration Agent
Process Collection
Process Killer Experiment
Prometheus Health Check
Configuring Role Based Access Control (RBAC)
Running Failure Flags experiments
Scheduling Scenarios
Shared Scenarios
Shutdown Experiment
Managing Teams
Time Travel Experiment
Troubleshooting Gremlin on OpenShift
User Authentication via SAML and Okta
Managing Users
Integration Agent for Linux
Test Suites
Restricting Testing Times
Process Exhaustion Experiment
Enabling DNS collection
Authenticating Users with Microsoft Entra ID (Azure Active Directory) via SAML
AWS Quick Start Guide