AgentOps: Enabling Observability of LLM Agents
Liming Dong, Qinghua Lu, Liming Zhu

TL;DR
This paper introduces a taxonomy for AgentOps to improve observability of LLM agents, enhancing AI safety by enabling better monitoring, logging, and analytics of agent behavior.
Contribution
It presents a comprehensive taxonomy of AgentOps artifacts and data, developed through a systematic study, to guide the design of observability infrastructure for LLM agents.
Findings
Developed a taxonomy of AgentOps artifacts and data
Mapped existing tools to the taxonomy
Provides a reference for designing monitoring systems
Abstract
Large language model (LLM) agents have demonstrated remarkable capabilities across various domains, gaining extensive attention from academia and industry. However, these agents raise significant concerns on AI safety due to their autonomous and non-deterministic behavior, as well as continuous evolving nature . From a DevOps perspective, enabling observability in agents is necessary to ensuring AI safety, as stakeholders can gain insights into the agents' inner workings, allowing them to proactively understand the agents, detect anomalies, and prevent potential failures. Therefore, in this paper, we present a comprehensive taxonomy of AgentOps, identifying the artifacts and associated data that should be traced throughout the entire lifecycle of agents to achieve effective observability. The taxonomy is developed based on a systematic mapping study of existing AgentOps tools. Our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMulti-Agent Systems and Negotiation
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Dropout · Dense Connections · Layer Normalization · Adam · Attention Dropout · Linear Layer · Weight Decay · Linear Warmup With Linear Decay
