Predicting Failures in Multi-Tier Distributed Systems
Leonardo Mariani, Mauro Pezz\`e, Oliviero Riganelli, Rui Xin

TL;DR
This paper introduces PreMiSE, a lightweight method that accurately predicts failures and locates faults in multi-tier distributed systems, enhancing system dependability with minimal false positives.
Contribution
PreMiSE combines anomaly and signature-based techniques to improve failure prediction and fault localization accuracy in cloud-based multi-tier systems.
Findings
High prediction precision with low false positives
Effective fault localization in cloud-based systems
Low overhead of the proposed approach
Abstract
Many applications are implemented as multi-tier software systems, and are executed on distributed infrastructures, like cloud infrastructures, to benefit from the cost reduction that derives from dynamically allocating resources on-demand. In these systems, failures are becoming the norm rather than the exception, and predicting their occurrence, as well as locating the responsible faults, are essential enablers of preventive and corrective actions that can mitigate the impact of failures, and significantly improve the dependability of the systems. Current failure prediction approaches suffer either from false positives or limited accuracy, and do not produce enough information to effectively locate the responsible faults. In this paper, we present PreMiSE, a lightweight and precise approach to predict failures and locate the corresponding faults in multi-tier distributed systems.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Network Security and Intrusion Detection · Anomaly Detection Techniques and Applications
