SentinelAgent: Intent-Verified Delegation Chains for Securing Federal Multi-Agent AI Systems
KrishnaSaiReddy Patil

TL;DR
SentinelAgent introduces a formal framework and runtime protocol for verifiable delegation chains in multi-agent AI systems, ensuring policy compliance and intent preservation under adversarial conditions.
Contribution
The paper presents a novel formal calculus, a runtime enforcement protocol, and comprehensive verification methods for secure, intent-preserving delegation in federal multi-agent AI systems.
Findings
Achieved 100% true positive rate with zero false positives on DelegationBench v4.
Blocked all 30 adversarial attacks in black-box testing with no false positives.
Mechanically verified properties across 2.7 million states with zero violations.
Abstract
When Agent A delegates to Agent B, which invokes Tool C on behalf of User X, no existing framework can answer: whose authorization chain led to this action, and where did it violate policy? This paper introduces SentinelAgent, a formal framework for verifiable delegation chains in federal multi-agent AI systems. The Delegation Chain Calculus (DCC) defines seven properties - six deterministic (authority narrowing, policy preservation, forensic reconstructibility, cascade containment, scope-action conformance, output schema conformance) and one probabilistic (intent preservation) - with four meta-theorems and one proposition establishing the practical infeasibility of deterministic intent verification. The Intent-Preserving Delegation Protocol (IPDP) enforces all seven properties at runtime through a non-LLM Delegation Authority Service. A three-point verification lifecycle achieves 100%…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
