PerfCE: Performance Debugging on Databases with Chaos Engineering-Enhanced Causality Analysis
Zhenlan Ji, Pingchuan Ma, Shuai Wang

TL;DR
PERFCE introduces a novel approach combining chaos engineering and causality analysis to diagnose database performance issues effectively, using offline learning and online root cause identification.
Contribution
This work pioneers the application of chaos engineering for performance debugging in databases, integrating causal graphs and structural equation models for accurate root cause analysis.
Findings
PERFCE outperforms prior methods on synthetic datasets.
It achieves high accuracy on real-world MySQL and TiDB databases.
The framework is moderately expensive but effective.
Abstract
Debugging performance anomalies in real-world databases is challenging. Causal inference techniques enable qualitative and quantitative root cause analysis of performance downgrade. Nevertheless, causality analysis is practically challenging, particularly due to limited observability. Recently, chaos engineering has been applied to test complex real-world software systems. Chaos frameworks like Chaos Mesh mutate a set of chaos variables to inject catastrophic events (e.g., network slowdowns) to "stress" software systems. The systems under chaos stress are then tested using methods like differential testing to check if they retain their normal functionality (e.g., SQL query output is always correct under stress). Despite its ubiquity in the industry, chaos engineering is now employed mostly to aid software testing rather for performance debugging. This paper identifies novel usage of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Anomaly Detection Techniques and Applications · Bayesian Modeling and Causal Inference
