SnapAudit: Active Auditing of Differentially Private In-Context Learning via Snapshot-Based Simulation
Yuyang Xia, Ruixuan Liu, Li Xiong

TL;DR
SnapAudit is an efficient framework for actively auditing differential privacy in in-context learning, using snapshot-based simulation to verify privacy guarantees and uncover flaws in existing mechanisms.
Contribution
It introduces a novel snapshot-based simulation method for active privacy auditing that is faster and more accurate than prior approaches.
Findings
Achieves 80-200x speedup over passive auditing methods.
Uncovers underestimation of leakage in Gaussian noise calibrations.
Identifies privacy violations due to incorrect sensitivity analysis.
Abstract
In-context learning (ICL) allows LLMs to adapt to new tasks via a few demonstrations, but those demonstrations may contain sensitive data. Differentially private (DP) ICL mechanisms mitigate this risk by injecting noise into the aggregation step, but verifying that an implementation actually meets its claimed privacy bound currently requires repeated end-to-end membership-inference attacks (MIAs) against the pipeline as a black box, incurring prohibitive LLM cost and yielding unstable empirical privacy estimates. We propose SnapAudit, an active auditing framework that decomposes a DP-ICL pipeline into a deterministic clean-inference stage and a stochastic DP-noise stage, and audits the full pipeline by combining a small snapshot of the former with bootstrap simulation of the latter. Because clean LLM outputs are near-deterministic at temperature zero, a few thousand clean LLM calls…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
