To Redact, or not to Redact? A Local LLM Approach to Deliberative Process Privilege Classification
Maik Larooij, David Graus

TL;DR
This paper develops a local, small-scale LLM-based method for classifying sensitive government documents under FOIA exemptions, avoiding cloud API use and improving classification performance.
Contribution
It introduces a novel approach using a local Qwen3.5 9B model with advanced prompting techniques for sensitivity classification, outperforming previous models.
Findings
Chain-of-Thought and few-shot prompting improve recall and F2 score.
The local model approaches the performance of commercial models.
Sentences classified as deliberative contain more opinion-expressing verbs and first-person phrasing.
Abstract
Government transparency laws, like the Freedom of Information (FOIA) acts in the United States and United Kingdom, and the Woo (Open Government Act) in the Netherlands, grant citizens the right to directly request documents from the government. As these documents might contain sensitive information, such as personal information or threats to national security, the laws allow governments to redact sensitive parts of the documents prior to release. We build on prior research to perform automatic sensitivity classification for the FOIA Exemption 5 deliberative process privilege using Large Language Models (LLMs). However, processing documents not yet cleared for review via third-party cloud APIs is often legally or politically untenable. Therefore, in this work, we perform sensitivity classification with a small, local model, deployable on consumer-grade hardware (Qwen3.5 9B). We compare…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
