AgentGuardian: Learning Access Control Policies to Govern AI Agent Behavior

Nadya Abaev; Denis Klimov; Gerard Levinov; David Mimran; Yuval Elovici; Asaf Shabtai

arXiv:2601.10440·cs.CR·January 16, 2026

AgentGuardian: Learning Access Control Policies to Govern AI Agent Behavior

Nadya Abaev, Denis Klimov, Gerard Levinov, David Mimran, Yuval Elovici, Asaf Shabtai

PDF

Open Access

TL;DR

AgentGuardian is a security framework that learns and enforces context-aware access control policies to ensure AI agents operate securely, accurately, and without malicious misuse, by monitoring behaviors and input patterns during a controlled learning phase.

Contribution

It introduces a novel adaptive policy learning framework for AI agents that enhances security and robustness through context-aware governance and control-flow analysis.

Findings

01

Effectively detects malicious or misleading inputs

02

Reduces hallucination-driven errors in AI agents

03

Maintains normal agent functionality during enforcement

Abstract

Artificial intelligence (AI) agents are increasingly used in a variety of domains to automate tasks, interact with users, and make decisions based on data inputs. Ensuring that AI agents perform only authorized actions and handle inputs appropriately is essential for maintaining system integrity and preventing misuse. In this study, we introduce the AgentGuardian, a novel security framework that governs and protects AI agent operations by enforcing context-aware access-control policies. During a controlled staging phase, the framework monitors execution traces to learn legitimate agent behaviors and input patterns. From this phase, it derives adaptive policies that regulate tool calls made by the agent, guided by both real-time input context and the control flow dependencies of multi-step agent actions. Evaluation across two real-world AI agent applications demonstrates that…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAccess Control and Trust · Security and Verification in Computing · Adversarial Robustness in Machine Learning