No More, No Less: Least-Privilege Language Models
Paulius Rauba, Dominykas Seputis, Patrikas Vanagas, Mihaela van der Schaar

TL;DR
This paper introduces least-privilege language models, a novel framework inspired by computer security principles, to control internal model capabilities during deployment without retraining, enhancing security and flexibility.
Contribution
It formalizes the concept of privilege within language models, proposes a monitor-allocator-enforcer stack for deployment control, and introduces Nested Least-Privilege Networks as a reversible, shape-preserving intervention.
Findings
Provides a policy-usable privilege-utility frontier
Enables selective suppression of model capabilities
Maintains limited collateral degradation
Abstract
Least privilege is a core security principle: grant each request only the minimum access needed to achieve its goal. Deployed language models almost never follow it, instead being exposed through a single API endpoint that serves all users and requests. This gap exists not because least privilege would be unhelpful; deployments would benefit greatly from reducing unnecessary capability exposure. The real obstacle is definitional and mechanistic: what does "access" mean inside a language model, and how can we enforce it without retraining or deploying multiple models? We take inspiration from least privilege in computer systems and define a class of models called least-privilege language models, where privilege is reachable internal computation during the forward pass. In this view, lowering privilege literally shrinks the model's accessible function class, as opposed to denying access…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAccess Control and Trust · Security and Verification in Computing · Software System Performance and Reliability
