DeLag: Using Multi-Objective Optimization to Enhance the Detection of Latency Degradation Patterns in Service-based Systems
Luca Traini, Vittorio Cortellessa

TL;DR
DeLag is an automated, multi-objective search-based method that improves detection of latency degradation patterns in service-based systems, outperforming existing approaches in accuracy and efficiency.
Contribution
We introduce DeLag, a novel search-based approach that simultaneously detects multiple latency degradation patterns with optimized metrics, advancing performance debugging techniques.
Findings
DeLag outperforms three state-of-the-art methods in effectiveness.
DeLag is more efficient than most baselines on large datasets.
DeLag achieves statistically significant improvements in detection accuracy.
Abstract
Performance debugging in production is a fundamental activity in modern service-based systems. The diagnosis of performance issues is often time-consuming, since it requires thorough inspection of large volumes of traces and performance indices. In this paper we present DeLag, a novel automated search-based approach for diagnosing performance issues in service-based systems. DeLag identifies subsets of requests that show, in the combination of their Remote Procedure Call execution times, symptoms of potentially relevant performance issues. We call such symptoms Latency Degradation Patterns. DeLag simultaneously searches for multiple latency degradation patterns while optimizing precision, recall and latency dissimilarity. Experimentation on 700 datasets of requests generated from two microservice-based systems shows that our approach provides better and more stable effectiveness than…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Cloud Computing and Resource Management · Software Engineering Research
