DP-Mix: Mixup-based Data Augmentation for Differentially Private   Learning

Wenxuan Bao; Francesco Pittaluga; Vijay Kumar B G; Vincent; Bindschaedler

arXiv:2311.01295·cs.LG·November 3, 2023·1 cites

DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning

Wenxuan Bao, Francesco Pittaluga, Vijay Kumar B G, Vincent, Bindschaedler

PDF

Open Access 1 Video

TL;DR

This paper introduces DP-Mix, two novel data augmentation methods tailored for differentially private learning, significantly improving model performance by combining mixup with self-augmentation and synthetic data from diffusion models.

Contribution

The paper proposes two new mixup-based data augmentation techniques specifically designed for differentially private learning, addressing the incompatibility of traditional augmentation methods.

Findings

01

DP-Mix_Self achieves state-of-the-art results across datasets.

02

DP-Mix_Diff further enhances performance using synthetic data.

03

Both methods outperform existing private learning approaches.

Abstract

Data augmentation techniques, such as simple image transformations and combinations, are highly effective at improving the generalization of computer vision models, especially when training data is limited. However, such techniques are fundamentally incompatible with differentially private learning approaches, due to the latter's built-in assumption that each training image's contribution to the learned model is bounded. In this paper, we investigate why naive applications of multi-sample data augmentation techniques, such as mixup, fail to achieve good performance and propose two novel data augmentation techniques specifically designed for the constraints of differentially private learning. Our first technique, DP-Mix_Self, achieves SoTA classification performance across a range of datasets and settings by performing mixup on self-augmented data. Our second technique, DP-Mix_Diff,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

DP-Mix: Mixup-based Data Augmentation for Differentially Private Learning· slideslive

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Machine Learning and ELM · Machine Learning and Algorithms