Large Language Models for Agentic NetOps and AIOps: Architectures, Evaluation, and Safety
Muhammad Bilal, Jon Crowcroft, Ruizhi Wang, Xiaolong Xu, Schahram Dustdar

TL;DR
This paper explores how large language models are transforming network and IT operations through agent-based systems, emphasizing architecture, evaluation, and safety considerations.
Contribution
It provides a comprehensive survey of agentic NetOps and AIOps architectures, evaluation methods, and safety/security issues, highlighting the importance of workflow-centered assessment.
Findings
Operational reliability depends on surrounding machinery, not just the model.
Evaluation should include trace quality, sandbox testing, and rollback mechanisms.
Security and governance risks are heightened with close-to-control agents.
Abstract
Large language models are increasingly being used to support network operations (NetOps) and artificial intelligence for IT operations (AIOps), including incident investigation, root-cause analysis, configuration synthesis, and limited self-healing. In both NetOps and AIOps, this shift is changing how tasks are managed. Agent-based operations work as workflows, from gathering evidence to taking action, following permissions, policies, and checks, and providing rollback options when necessary. This is crucial because operational decisions can have instant impacts. To make the argument concrete, we organise the relevant literature around the hierarchy of autonomy, tool scope, evidence traces, and assurance contracts. These contracts define what an agent may observe, propose, and execute. They also define the checks that must pass before any action is allowed. A consistent pattern appears…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
