Planning and Architecting for Reliability - Part 2

Don’t wait for an incident to start focusing on the reliability of your systems. This two-part series takes a proactive approach to reliability, so you can prevent incidents from happening in the first place. In this, the second part, we take actions to improve reliability by running tests to fortify the technologies in your stack and build resilience to common failure modes.