AEGIS: No Tool Call Left Unchecked -- A Pre-Execution Firewall and Audit Layer for AI Agents
Aojie Yuan, Zhiyuan Su, Yue Zhao

TL;DR
AEGIS introduces a pre-execution firewall and audit layer for AI agents that enhances control, safety, and accountability by intercepting and validating tool calls before execution, supporting multiple frameworks and reducing risks.
Contribution
This paper presents AEGIS, a novel pre-execution interception framework for AI agents that enables risk assessment, human approval, and tamper-evident auditing across diverse programming environments.
Findings
Blocks all attack instances in the curated suite
Achieves a 1.2% false positive rate on benign calls
Adds median latency of 8.3 ms during interception
Abstract
AI agents increasingly act through external tools: they query databases, execute shell commands, read and write files, and send network requests. Yet in most current agent stacks, model-generated tool calls are handed to the execution layer with no framework-agnostic control point in between. Post-execution observability can record these actions, but it cannot stop them before side effects occur. We present AEGIS, a pre-execution firewall and audit layer for AI agents. AEGIS interposes on the tool-execution path and applies a three-stage pipeline: (i) deep string extraction from tool arguments, (ii) content-first risk scanning, and (iii) composable policy validation. High-risk calls can be held for human approval, and all decisions are recorded in a tamper-evident audit trail based on Ed25519 signatures and SHA-256 hash chaining. In the current implementation, AEGIS supports 14 agent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSecurity and Verification in Computing · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)
