DP-UTIL: Comprehensive Utility Analysis of Differential Privacy in Machine Learning
Ismat Jarin, Birhanu Eshete

TL;DR
DP-UTIL provides a comprehensive framework for analyzing the impact of various differential privacy perturbation techniques on machine learning model utility and privacy leakage across different datasets and models.
Contribution
This paper introduces DP-UTIL, a holistic utility analysis framework for differential privacy in machine learning, covering five perturbation spots and enabling comparative analysis.
Findings
Prediction perturbation yields lowest utility loss across models and datasets.
Objective perturbation results in lowest privacy leakage for logistic regression.
Gradient perturbation results in lowest privacy leakage for deep neural networks.
Abstract
Differential Privacy (DP) has emerged as a rigorous formalism to reason about quantifiable privacy leakage. In machine learning (ML), DP has been employed to limit inference/disclosure of training examples. Prior work leveraged DP across the ML pipeline, albeit in isolation, often focusing on mechanisms such as gradient perturbation. In this paper, we present, DP-UTIL, a holistic utility analysis framework of DP across the ML pipeline with focus on input perturbation, objective perturbation, gradient perturbation, output perturbation, and prediction perturbation. Given an ML task on privacy-sensitive data, DP-UTIL enables a ML privacy practitioner perform holistic comparative analysis on the impact of DP in these five perturbation spots, measured in terms of model utility loss, privacy leakage, and the number of truly revealed training samples. We evaluate DP-UTIL over classification…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning
MethodsLogistic Regression
