Privacy Accounting and Quality Control in the Sage Differentially   Private ML Platform

Mathias Lecuyer; Riley Spahn; Kiran Vodrahalli; Roxana Geambasu,; Daniel Hsu

arXiv:1909.01502·stat.ML·September 10, 2019

Privacy Accounting and Quality Control in the Sage Differentially Private ML Platform

Mathias Lecuyer, Riley Spahn, Kiran Vodrahalli, Roxana Geambasu,, Daniel Hsu

PDF

TL;DR

This paper introduces Sage, a DP ML platform that manages cumulative data leakage and balances privacy with utility through innovative accounting and adaptive training methods, enabling continuous learning on sensitive data streams.

Contribution

The paper presents practical solutions for privacy budget management and utility optimization in DP ML systems, including block composition and privacy-adaptive training.

Findings

01

Block composition enables endless training on sensitive data streams.

02

Privacy-adaptive training balances privacy and utility effectively.

03

Sage demonstrates practical DP ML deployment with high privacy guarantees.

Abstract

Companies increasingly expose machine learning (ML) models trained over sensitive user data to untrusted domains, such as end-user devices and wide-access model stores. We present Sage, a differentially private (DP) ML platform that bounds the cumulative leakage of training data through models. Sage builds upon the rich literature on DP ML algorithms and contributes pragmatic solutions to two of the most pressing systems challenges of global DP: running out of privacy budget and the privacy-utility tradeoff. To address the former, we develop block composition, a new privacy loss accounting method that leverages the growing database regime of ML workloads to keep training models endlessly on a sensitive data stream while enforcing a global DP guarantee for the stream. To address the latter, we develop privacy-adaptive training, a process that trains a model on growing amounts of data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.