Large Language Models for Agentic NetOps and AIOps: Architectures, Evaluation, and Safety

Muhammad Bilal; Jon Crowcroft; Ruizhi Wang; Xiaolong Xu; Schahram Dustdar

arXiv:2605.12729·cs.NI·May 14, 2026

Large Language Models for Agentic NetOps and AIOps: Architectures, Evaluation, and Safety

Muhammad Bilal, Jon Crowcroft, Ruizhi Wang, Xiaolong Xu, Schahram Dustdar

PDF

TL;DR

This paper explores how large language models are transforming network and IT operations through agent-based systems, emphasizing architecture, evaluation, and safety considerations.

Contribution

It provides a comprehensive survey of agentic NetOps and AIOps architectures, evaluation methods, and safety/security issues, highlighting the importance of workflow-centered assessment.

Findings

01

Operational reliability depends on surrounding machinery, not just the model.

02

Evaluation should include trace quality, sandbox testing, and rollback mechanisms.

03

Security and governance risks are heightened with close-to-control agents.

Abstract

Large language models are increasingly being used to support network operations (NetOps) and artificial intelligence for IT operations (AIOps), including incident investigation, root-cause analysis, configuration synthesis, and limited self-healing. In both NetOps and AIOps, this shift is changing how tasks are managed. Agent-based operations work as workflows, from gathering evidence to taking action, following permissions, policies, and checks, and providing rollback options when necessary. This is crucial because operational decisions can have instant impacts. To make the argument concrete, we organise the relevant literature around the hierarchy of autonomy, tool scope, evidence traces, and assurance contracts. These contracts define what an agent may observe, propose, and execute. They also define the checks that must pass before any action is allowed. A consistent pattern appears…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.