FaaSRCA: Full Lifecycle Root Cause Analysis for Serverless Applications
Jin Huang, Pengfei Chen, Guangba Yu, Yilun Wang, Haiyu Huang, Zilong, He

TL;DR
FaaSRCA is a comprehensive root cause analysis method for serverless applications that considers the entire lifecycle, using multi-modal data and graph neural networks to improve accuracy over existing methods.
Contribution
It introduces a novel full lifecycle RCA approach for serverless systems, leveraging multi-modal observability data and GAT-based graph auto-encoders for precise root cause localization.
Findings
Outperforms baseline methods with 21.25% to 81.63% top-k precision improvements.
Effectively models the entire lifecycle of serverless functions.
Demonstrates robustness on two serverless benchmarks.
Abstract
Serverless becomes popular as a novel computing paradigms for cloud native services. However, the complexity and dynamic nature of serverless applications present significant challenges to ensure system availability and performance. There are many root cause analysis (RCA) methods for microservice systems, but they are not suitable for precise modeling serverless applications. This is because: (1) Compared to microservice, serverless applications exhibit a highly dynamic nature. They have short lifecycle and only generate instantaneous pulse-like data, lacking long-term continuous information. (2) Existing methods solely focus on analyzing the running stage and overlook other stages, failing to encompass the entire lifecycle of serverless applications. To address these limitations, we propose FaaSRCA, a full lifecycle root cause analysis method for serverless applications. It integrates…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoftware System Performance and Reliability · Software Reliability and Analysis Research · Software Testing and Debugging Techniques
