Automated Chaos Experiments: Enhancing Continuous Testing with Controlled Failure Scenarios
Keywords:
Chaos engineering , CI/CD workflows, organizations.Abstract
Chaos engineering has become an essential field for improving the resilience of contemporary software systems by proactively detecting and mitigating vulnerabilities through controlled failure scenarios. This study examines the progression of chaotic engineering from manual techniques to sophisticated automated systems, highlighting its incorporation into continuous testing pipelines. Automated chaos engineering utilizes tools and frameworks such as ChaosEater, Gremlin, and digital twins to methodically introduce errors, facilitating real-time monitoring, resilience evaluation, and recovery analysis. Automating experiment design,
execution, and feedback loops substantially decreases operational overhead while improving scalability, security, and observability in distributed systems.
References
Basiri, A., Hochstein, L., Jones, N., & Tucker, H. (2019). "Automating chaos experiments in production." arXiv preprint arXiv:1905.04648.
arxiv.org
Simonsson, J., Zhang, L., Morin, B., Baudry, B., & Monperrus, M. (2019). "Observability and Chaos Engineering on System Calls for Containerized Applications in Docker." arXiv preprint arXiv:1907.13039. arxiv.org