FORTIS: Benchmarking Over-Privilege in Agent Skills

Shawn Li; Chenxiao Yu; Han Wang; Wei Yang; Ryan Rossi; Franck Dernoncourt; Xiyang Hu; Philip Yu; Chaowei Xiao; Huan Zhang; Yue Zhao

arXiv:2605.09163·cs.AI·May 14, 2026

FORTIS: Benchmarking Over-Privilege in Agent Skills

Shawn Li, Chenxiao Yu, Han Wang, Wei Yang, Ryan Rossi, Franck Dernoncourt, Xiyang Hu, Philip Yu, Chaowei Xiao, Huan Zhang, Yue Zhao

PDF

1 Repo

TL;DR

FORTIS introduces a benchmark to evaluate over-privilege in language model agents, revealing that current models frequently exceed their intended skill boundaries, especially under realistic conditions.

Contribution

The paper presents FORTIS, a novel benchmark for assessing over-privilege in agent skills, highlighting prevalent privilege escalation issues in state-of-the-art models.

Findings

01

Models often select higher-privilege skills than necessary.

02

Over-privileged behavior occurs across diverse models and domains.

03

Real-world conditions exacerbate privilege escalation issues.

Abstract

Large language model agents increasingly operate through an intermediate skill layer that mediates between user intent and concrete task execution. This layer is widely treated as an organizational abstraction, but we argue it is also a privilege boundary that current models routinely exceed. We present \textbf{FORTIS}, a benchmark that evaluates over-privilege in agent skills across two stages: whether a model selects the minimally sufficient skill from a large overlapping library, and whether it executes that skill without expanding into broader tools or actions than the skill permits. Across ten frontier models and three domains, we find that over-privileged behavior is the norm rather than the exception. Models consistently reach for higher-privilege skills and tools than the task requires, failing at both stages at rates that remain high even for the strongest available models.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lili0415/FORTIS-Benchmark
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.