FC-ADL: Efficient Microservice Anomaly Detection and Localisation Through Functional Connectivity
Giles Winchester, George Parisis, Luc Berthouze

TL;DR
FC-ADL introduces a scalable, neuroscience-inspired method for detecting and localizing anomalies in microservice architectures by analyzing dynamic interdependencies, outperforming existing approaches in accuracy and efficiency.
Contribution
The paper presents FC-ADL, a novel scalable approach leveraging functional connectivity to improve anomaly detection and localization in large-scale microservice systems.
Findings
Achieves top detection and localization performance across various fault scenarios.
Demonstrates scalability on Alibaba's large real-world microservice deployment.
Reduces computational overhead compared to causal inference methods.
Abstract
Microservices have transformed software architecture through the creation of modular and independent services. However, they introduce operational complexities in service integration and system management that makes swift and accurate anomaly detection and localisation challenging. Despite the complex, dynamic, and interconnected nature of microservice architectures, prior works that investigate metrics for anomaly detection rarely include explicit information about time-varying interdependencies. And whilst prior works on fault localisation typically do incorporate information about dependencies between microservices, they scale poorly to real world large-scale deployments due to their reliance on computationally expensive causal inference. To address these challenges we propose FC-ADL, an end-to-end scalable approach for detecting and localising anomalous changes from microservice…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software-Defined Networks and 5G · Cloud Computing and Resource Management
