Hypergraph and Latent ODE Learning for Multimodal Root Cause Localization in Microservices
Xin Liu, Yuhang He, Sichen Zhao, Kejian Tong, Xingyu Zhang

TL;DR
This paper introduces HyperODE RCA, a comprehensive framework for root cause localization in microservices, integrating hypergraph attention, latent ODEs, and multimodal fusion to improve accuracy and interpretability.
Contribution
It combines hypergraph attention, latent ODEs, and multimodal fusion in a unified model, advancing root cause analysis in complex microservice environments.
Findings
Outperforms strong baselines in ranking and classification tasks.
Enhances interpretability via learned hypergraph attention.
Demonstrates robustness with variational and causal regularizations.
Abstract
Root cause localization in cloud native microservice systems requires modeling complex service dependencies, irregular temporal dynamics, and heterogeneous observability data. We present HyperODE RCA, a unified framework that combines hypergraph attention learning, latent ordinary differential equations, and multimodal cross attention fusion for fine grained root cause analysis. The method learns higher order service interactions through differentiable hyperedge construction, captures continuous anomaly evolution from irregular observations with an ODE RNN encoder, and adaptively fuses logs, traces, metrics, entities, and events using context aware modality routing. We further improve robustness with a variational information bottleneck, temporal causal regularization, and invariant risk constraints. Experiments on the Tianchi AIOps benchmark show clear gains over strong baselines in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
