Unleashing the Power of Randomization in Auditing Differentially Private   ML

Krishna Pillutla; Galen Andrew; Peter Kairouz; H. Brendan McMahan,; Alina Oprea; Sewoong Oh

arXiv:2305.18447·cs.LG·May 31, 2023·5 cites

Unleashing the Power of Randomization in Auditing Differentially Private ML

Krishna Pillutla, Galen Andrew, Peter Kairouz, H. Brendan McMahan,, Alina Oprea, Sewoong Oh

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel auditing methodology for differentially private machine learning that leverages randomized canaries, lifted differential privacy, and advanced statistical tests to improve audit accuracy and efficiency.

Contribution

It proposes Lifted Differential Privacy (LiDP) for handling randomized datasets and develops a new auditing framework using multiple canaries and correlation-aware confidence intervals.

Findings

01

Significant improvements in sample complexity demonstrated empirically.

02

Framework effectively distinguishes models with different canary counts.

03

Method adaptable to stronger canary designs and real-world data.

Abstract

We present a rigorous methodology for auditing differentially private machine learning algorithms by adding multiple carefully designed examples called canaries. We take a first principles approach based on three key components. First, we introduce Lifted Differential Privacy (LiDP) that expands the definition of differential privacy to handle randomized datasets. This gives us the freedom to design randomized canaries. Second, we audit LiDP by trying to distinguish between the model trained with $K$ canaries versus $K - 1$ canaries in the dataset, leaving one canary out. By drawing the canaries i.i.d., LiDP can leverage the symmetry in the design and reuse each privately trained model to run multiple statistical tests, one for each canary. Third, we introduce novel confidence intervals that take advantage of the multiple test statistics by adapting to the empirical higher-order…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Unleashing the Power of Randomization in Auditing Differentially Private ML· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Cryptography and Data Security