Permit: Permission-Aware Representation Intervention for Controlled Generation in Large Language Models

Pengcheng Sun; Lan Zhang; Zhaopeng Zhang; Jiewei Lai; Chen Tang

arXiv:2605.09480·cs.CR·May 12, 2026

Permit: Permission-Aware Representation Intervention for Controlled Generation in Large Language Models

Pengcheng Sun, Lan Zhang, Zhaopeng Zhang, Jiewei Lai, Chen Tang

PDF

TL;DR

Permit introduces a permission-aware framework that enforces fine-grained control over large language models' outputs by intervening in hidden states, significantly reducing information leakage.

Contribution

It proposes a novel method to control LLM outputs by manipulating hidden states based on permission conditions, with minimal additional parameters.

Findings

01

Permit outperforms state-of-the-art methods in permission control accuracy.

02

It reduces information leakage to near zero.

03

Achieves over 18% F1-score improvement with >98% fewer trainable parameters.

Abstract

Large language models (LLMs) are increasingly deployed in enterprise settings where they handle sensitive documents and user context, raising acute concerns over security and controllability. Conventional access control regulates whether information is accessible to the model, yet leaves how the model uses that information at generation time largely unconstrained: once sensitive content enters the context, outputs may still drift beyond a user's authorized scope. We present Permit, a novel permission-aware representation intervention framework that closes this gap by enforcing fine-grained control directly on the model's hidden states. Through exploratory analysis, we find that permission conditions induce hidden-state shifts that are (i) separable across permissions and (ii) concentrated in a small set of dominant directions. Permit exploits this geometry in two stages: it first…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.