Adversarial Sample-Based Approach for Tighter Privacy Auditing in Final   Model-Only Scenarios

Sangyeon Yoon; Wonje Jeung; Albert No

arXiv:2412.01756·cs.CR·February 25, 2025

Adversarial Sample-Based Approach for Tighter Privacy Auditing in Final Model-Only Scenarios

Sangyeon Yoon, Wonje Jeung, Albert No

PDF

Open Access

TL;DR

This paper presents a new adversarial auditing method that provides tighter empirical privacy bounds for differentially private models in final model-only scenarios, improving over traditional heuristics.

Contribution

A novel loss-based input-space auditing technique that achieves more accurate empirical privacy bounds without extra assumptions.

Findings

01

Achieves empirical lower bounds closer to theoretical privacy guarantees.

02

Outperforms canary-based heuristics in final model-only scenarios.

03

Demonstrates effectiveness on MNIST with a privacy budget of 10.0.

Abstract

Auditing Differentially Private Stochastic Gradient Descent (DP-SGD) in the final model setting is challenging and often results in empirical lower bounds that are significantly looser than theoretical privacy guarantees. We introduce a novel auditing method that achieves tighter empirical lower bounds without additional assumptions by crafting worst-case adversarial samples through loss-based input-space auditing. Our approach surpasses traditional canary-based heuristics and is effective in final model-only scenarios. Specifically, with a theoretical privacy budget of $ε = 10.0$ , our method achieves empirical lower bounds of $4.914$ , compared to the baseline of $4.385$ for MNIST. Our work offers a practical framework for reliable and accurate privacy auditing in differentially private machine learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning