Tighter Privacy Auditing of DP-SGD in the Hidden State Threat Model

Tudor Cebere; Aur\'elien Bellet; Nicolas Papernot

arXiv:2405.14457·cs.LG·May 30, 2025·1 cites

Tighter Privacy Auditing of DP-SGD in the Hidden State Threat Model

Tudor Cebere, Aur\'elien Bellet, Nicolas Papernot

PDF

Open Access 1 Video 3 Reviews

TL;DR

This paper introduces a novel privacy auditing method for DP-SGD under a hidden state threat model, revealing that existing privacy bounds can be tightened by adversarial gradient crafting, especially when intermediate updates are concealed.

Contribution

It proposes an adversarial gradient crafting approach to more accurately audit privacy loss in DP-SGD under the hidden state threat model, challenging existing bounds.

Findings

01

Adversarial gradient sequences outperform previous auditing methods.

02

Concealing intermediate updates does not always improve privacy guarantees.

03

Existing privacy bounds can be tightened in certain regimes.

Abstract

Machine learning models can be trained with formal privacy guarantees via differentially private optimizers such as DP-SGD. In this work, we focus on a threat model where the adversary has access only to the final model, with no visibility into intermediate updates. In the literature, this hidden state threat model exhibits a significant gap between the lower bound from empirical privacy auditing and the theoretical upper bound provided by privacy accounting. To challenge this gap, we propose to audit this threat model with adversaries that craft a gradient sequence designed to maximize the privacy loss of the final model without relying on intermediate updates. Our experiments show that this approach consistently outperforms previous attempts at auditing the hidden state model. Furthermore, our results advance the understanding of achievable privacy guarantees within this threat model.…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 6Confidence 2

Strengths

* The problem setting and results are really interesting and open up new avenues for future work. Identifying different regimes where the gap (between the new auditing lower bounds and the theoretical upper bounds) vanishes and where the gap remains is a nice contribution. * The paper flows nicely and is and enjoyable to read. * Simplicity is a virtue and I think the design of the gradient-crafting adversaries for privacy auditing is clever yet also intuitive.

Weaknesses

* The paper uncovers some interesting empirical results but doesn’t offer up much explanation for them. E.g. for regimes where there is a gap, we don’t really get an explanation as to why this gap exists. I do feel that this work falls short of its goal to “enhance our understanding of privacy leakage” (line 537) by reporting the results without interpretation. I think that including more discussions like the “high-level explanation” starting at line 373 would help round out the paper and provid

Reviewer 02Rating 6Confidence 3

Strengths

This paper applies GC models to audit various regimes, including both small and over-parameterized models. Additionally, different deep learning architectures (CNN, ResNet, FCNN) are evaluated in the experiments. Overall, the empirical results are convincing.

Weaknesses

The main weakness of this paper is that it does not introduce a new privacy auditing method; rather, it extends the existing gradient-crafting method to the hidden state regime. The primary technical contribution appears to be constructing a sequence of gradients without requiring knowledge of the intermediate gradients at each iteration.

Reviewer 03Rating 8Confidence 4

Strengths

Auditing DP algorithms is an interesting and important line of work. Previous work, both in auditing and privacy accounting, have suggested that hiding the intermediate iterations of DP-SGD can be beneficial for the privacy guarantees. In this work, authors improve the privacy auditing of non-convex loss functions, by carefully selecting gradient canaries that get inserted for to the DP-SGD gradient sum. The proposed method significantly improves the existing methods, suggesting that the previou

Weaknesses

In general the paper is very well written and the arguments are easy to follow. However, I get a bit confused on the discussion regarding accounting in the section 5.1. Since the threat model studied does not benefit from the subsampling amplification, I don't see any reason of using PRV accounting. When there is no subsampling, the privacy analysis could be performed tightly with Gaussian DP or Analytical Gaussian accounting (Balle et al 2018). The proposed method assumes that the crafted grad

Videos

Tighter Privacy Auditing of DP-SGD in the Hidden State Threat Model· slideslive

Taxonomy

TopicsBiometric Identification and Security

MethodsFocus