Perspectives on a Reliability Monitoring Framework for Agentic AI Systems

Niclas Flehmig; Mary Ann Lundteigen; Shen Yin

arXiv:2511.09178·cs.AI·November 13, 2025

Perspectives on a Reliability Monitoring Framework for Agentic AI Systems

Niclas Flehmig, Mary Ann Lundteigen, Shen Yin

PDF

Open Access

TL;DR

This paper proposes a two-layered reliability monitoring framework for agentic AI systems, combining out-of-distribution detection and transparency to enhance safety and support human intervention in high-risk applications.

Contribution

It introduces a novel two-layered monitoring framework specifically designed for agentic AI systems, addressing their unique reliability challenges during operation.

Findings

01

Framework enables detection of novel inputs and internal transparency.

02

Supports human decision-making for intervention.

03

Lays foundation for future mitigation techniques.

Abstract

The implementation of agentic AI systems has the potential of providing more helpful AI systems in a variety of applications. These systems work autonomously towards a defined goal with reduced external control. Despite their potential, one of their flaws is the insufficient reliability which makes them especially unsuitable for high-risk domains such as healthcare or process industry. Unreliable systems pose a risk in terms of unexpected behavior during operation and mitigation techniques are needed. In this work, we derive the main reliability challenges of agentic AI systems during operation based on their characteristics. We draw the connection to traditional AI systems and formulate a fundamental reliability challenge during operation which is inherent to traditional and agentic AI systems. As our main contribution, we propose a two-layered reliability monitoring framework for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Risk and Safety Analysis · Safety Systems Engineering in Autonomy