Score: 1

Model Discovery and Graph Simulation: A Lightweight Alternative to Chaos Engineering

Published: June 12, 2025 | arXiv ID: 2506.11176v1

By: Anatoly A. Krasnovsky, Alexander Zorkin

Potential Business Impact:

Finds app problems before they break.

Business Areas:
Simulation Software

Microservice applications are prone to cascading failures because of dense inter-service dependencies. Ensuring resilience usually demands fault-injection experiments in production-like setups. We propose \textit{model discovery} -- an automated CI/CD step that extracts a live dependency graph from trace data -- and show that this lightweight representation is sufficient for accurate resilience prediction. Using the DeathStarBench Social Network, we build the graph, simulate failures via Monte-Carlo, and run matching chaos experiments on the real system. The graph model closely matches reality: with no replication, 16 trials yield an observed resilience of 0.186 versus a predicted 0.161; with replication, both observed and predicted values converge to 0.305 (mean absolute error \leq 0.0004). These results indicate that even a simple, automatically discovered graph can estimate microservice availability with high fidelity, offering rapid design-time insight without full-scale failure testing.

Repos / Data Links

Page Count
8 pages

Category
Computer Science:
Software Engineering