Perspectives on a Reliability Monitoring Framework for Agentic AI Systems
Niclas Flehmig, Mary Ann Lundteigen, Shen Yin

TL;DR
This paper proposes a two-layered reliability monitoring framework for agentic AI systems, combining out-of-distribution detection and transparency to enhance safety and support human intervention in high-risk applications.
Contribution
It introduces a novel two-layered monitoring framework specifically designed for agentic AI systems, addressing their unique reliability challenges during operation.
Findings
Framework enables detection of novel inputs and internal transparency.
Supports human decision-making for intervention.
Lays foundation for future mitigation techniques.
Abstract
The implementation of agentic AI systems has the potential of providing more helpful AI systems in a variety of applications. These systems work autonomously towards a defined goal with reduced external control. Despite their potential, one of their flaws is the insufficient reliability which makes them especially unsuitable for high-risk domains such as healthcare or process industry. Unreliable systems pose a risk in terms of unexpected behavior during operation and mitigation techniques are needed. In this work, we derive the main reliability challenges of agentic AI systems during operation based on their characteristics. We draw the connection to traditional AI systems and formulate a fundamental reliability challenge during operation which is inherent to traditional and agentic AI systems. As our main contribution, we propose a two-layered reliability monitoring framework for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Risk and Safety Analysis · Safety Systems Engineering in Autonomy
