AI-Augmented Chaos: Intelligent Resilience Testing for Cloud Systems
Use machine learning to schedule and adapt chaos engineering experiments based on real-time risk, cost, and performance to improve cloud system resilience.
Microsoft 365 services in North America briefly failed due to a misconfigured portion of network infrastructure, disrupting services including Teams until traffic was rerouted.