Regression with Label Differential Privacy

Badih Ghazi; Pritish Kamath; Ravi Kumar; Ethan Leeman; Pasin; Manurangsi; Avinash V Varadarajan; Chiyuan Zhang

arXiv:2212.06074·cs.LG·October 6, 2023

Regression with Label Differential Privacy

Badih Ghazi, Pritish Kamath, Ravi Kumar, Ethan Leeman, Pasin, Manurangsi, Avinash V Varadarajan, Chiyuan Zhang

PDF

Open Access 1 Video

TL;DR

This paper introduces an optimal label differential privacy mechanism for regression models, utilizing a randomized response on bins approach, with an efficient algorithm and strong experimental validation.

Contribution

It proposes a novel optimal label DP mechanism for regression, based on a global prior and a randomized response on bins, with an efficient bin optimization algorithm.

Findings

01

The proposed mechanism is optimal under certain regression loss functions.

02

Experimental results show the method's effectiveness across multiple datasets.

03

The approach outperforms existing privacy-preserving regression techniques.

Abstract

We study the task of training regression models with the guarantee of label differential privacy (DP). Based on a global prior distribution on label values, which could be obtained privately, we derive a label DP randomization mechanism that is optimal under a given regression loss function. We prove that the optimal mechanism takes the form of a "randomized response on bins", and propose an efficient algorithm for finding the optimal bin values. We carry out a thorough experimental evaluation on several datasets demonstrating the efficacy of our algorithm.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Regression with Label Differential Privacy· slideslive

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques