Adaptive Detection of Software Aging under Workload Shift
Rafael Jose Moura Silva, Maria Gizele Nascimento, Fumio Machida, Ermeson Andrade

TL;DR
This paper introduces an adaptive machine learning approach for detecting software aging in long-running systems under changing workloads, demonstrating superior accuracy over static models, especially with workload shifts.
Contribution
It applies and evaluates adaptive concept drift detection methods, specifically ADWIN, for software aging detection under dynamic workload conditions, which is a novel application.
Findings
Adaptive model with ADWIN maintains high accuracy (F1 > 0.93) across workload shifts.
Static models' performance drops significantly under unseen workload profiles.
Adaptive detection methods outperform static models in dynamic environments.
Abstract
Software aging is a phenomenon that affects long-running systems, leading to progressive performance degradation and increasing the risk of failures. To mitigate this problem, this work proposes an adaptive approach based on machine learning for software aging detection in environments subject to dynamic workload conditions. We evaluate and compare a static model with adaptive models that incorporate adaptive detectors, specifically the Drift Detection Method (DDM) and Adaptive Windowing (ADWIN), originally developed for concept drift scenarios and applied in this work to handle workload shifts. Experiments with simulated sudden, gradual, and recurring workload transitions show that static models suffer a notable performance drop when applied to unseen workload profiles, whereas the adaptive model with ADWIN maintains high accuracy, achieving an F1-Score above 0.93 in all analyzed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware Engineering Research · Distributed systems and fault tolerance · Software System Performance and Reliability
