DP-UTIL: Comprehensive Utility Analysis of Differential Privacy in   Machine Learning

Ismat Jarin; Birhanu Eshete

arXiv:2112.12998·cs.CR·December 28, 2021

DP-UTIL: Comprehensive Utility Analysis of Differential Privacy in Machine Learning

Ismat Jarin, Birhanu Eshete

PDF

Open Access 1 Repo

TL;DR

DP-UTIL provides a comprehensive framework for analyzing the impact of various differential privacy perturbation techniques on machine learning model utility and privacy leakage across different datasets and models.

Contribution

This paper introduces DP-UTIL, a holistic utility analysis framework for differential privacy in machine learning, covering five perturbation spots and enabling comparative analysis.

Findings

01

Prediction perturbation yields lowest utility loss across models and datasets.

02

Objective perturbation results in lowest privacy leakage for logistic regression.

03

Gradient perturbation results in lowest privacy leakage for deep neural networks.

Abstract

Differential Privacy (DP) has emerged as a rigorous formalism to reason about quantifiable privacy leakage. In machine learning (ML), DP has been employed to limit inference/disclosure of training examples. Prior work leveraged DP across the ML pipeline, albeit in isolation, often focusing on mechanisms such as gradient perturbation. In this paper, we present, DP-UTIL, a holistic utility analysis framework of DP across the ML pipeline with focus on input perturbation, objective perturbation, gradient perturbation, output perturbation, and prediction perturbation. Given an ML task on privacy-sensitive data, DP-UTIL enables a ML privacy practitioner perform holistic comparative analysis on the impact of DP in these five perturbation spots, measured in terms of model utility loss, privacy leakage, and the number of truly revealed training samples. We evaluate DP-UTIL over classification…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

um-dsp/DP-UTIL
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning

MethodsLogistic Regression