Detecting Performance Degradation of Software-Intensive Systems in the Presence of Trends and Long-Range Dependence
Alexey Artemov, Evgeny Burnaev

TL;DR
This paper presents a holistic change-point detection methodology for large-scale software systems, effectively identifying performance degradation amidst trends and long-range dependencies using ensemble detectors and optimal trend estimation.
Contribution
It introduces a novel trend estimation method tailored for long-range dependent noise and an ensemble-based change-point detection approach for system failure detection.
Findings
Effective detection of performance degradation in real datasets
Robustness to trends and long-range dependence
Improved detection accuracy over existing methods
Abstract
As contemporary software-intensive systems reach increasingly large scale, it is imperative that failure detection schemes be developed to help prevent costly system downtimes. A promising direction towards the construction of such schemes is the exploitation of easily available measurements of system performance characteristics such as average number of processed requests and queue size per unit of time. In this work, we investigate a holistic methodology for detection of abrupt changes in time series data in the presence of quasi-seasonal trends and long-range dependence with a focus on failure detection in computer systems. We propose a trend estimation method enjoying optimality properties in the presence of long-range dependent noise to estimate what is considered "normal" system behaviour. To detect change-points and anomalies, we develop an approach based on the ensembles of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
