ActionNex: A Virtual Outage Manager for Cloud Computing
Zhenfeng Lin, Haoji Hu, Ming Hao, Xuchao Zhang, Ryan Zhang, Junhao Li, Ze Li, Oleg Kulygin, Chetan Bansal, Hatay Tuna, Murali Chintalapati, Sheila Jiang, Salman Zafar, Angie Anderson

TL;DR
ActionNex is a production-grade AI system designed to assist in outage management for cloud computing by providing real-time, context-aware recommendations based on multimodal operational data.
Contribution
It introduces a hierarchical memory and reasoning framework that integrates knowledge distillation, real-time signals, and human feedback for outage assistance.
Findings
Achieved 71.4% precision and 52.8-54.8% recall on real Azure outage data.
Successfully piloted in production with positive early feedback.
Supports end-to-end outage management with real-time updates and continual learning.
Abstract
Outage management in large-scale cloud operations remains heavily manual, requiring rapid triage, cross-team coordination, and experience-driven decisions under partial observability. We present \textbf{ActionNex}, a production-grade agentic system that supports end-to-end outage assistance, including real-time updates, knowledge distillation, and role- and stage-conditioned next-best action recommendations. ActionNex ingests multimodal operational signals (e.g., outage content, telemetry, and human communications) and compresses them into critical events that represent meaningful state transitions. It couples this perception layer with a hierarchical memory subsystem: long-term Key-Condition-Action (KCA) knowledge distilled from playbooks and historical executions, episodic memory of prior outages, and working memory of the live context. A reasoning agent aligns current critical events…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
