ARMOR: Shielding Unlearnable Examples against Data Augmentation
Xueluan Gong, Yuji Wang, Yanjiao Chen, Haocheng Dong, Yiming Li,, Mengyuan Sun, Shuaike Li, Qian Wang, and Chen Chen

TL;DR
This paper reveals that data augmentation can compromise the privacy of unlearnable examples and proposes ARMOR, a defense framework that effectively preserves data privacy against augmentation-induced privacy breaches.
Contribution
We introduce ARMOR, a novel defense framework that protects unlearnable data from privacy breaches caused by data augmentation, using surrogate models and adaptive augmentation strategies.
Findings
Data augmentation significantly increases model accuracy on unlearnable examples.
ARMOR effectively reduces the model accuracy on protected data by up to 60%.
ARMOR outperforms six state-of-the-art defense methods across multiple datasets.
Abstract
Private data, when published online, may be collected by unauthorized parties to train deep neural networks (DNNs). To protect privacy, defensive noises can be added to original samples to degrade their learnability by DNNs. Recently, unlearnable examples are proposed to minimize the training loss such that the model learns almost nothing. However, raw data are often pre-processed before being used for training, which may restore the private information of protected data. In this paper, we reveal the data privacy violation induced by data augmentation, a commonly used data pre-processing technique to improve model generalization capability, which is the first of its kind as far as we are concerned. We demonstrate that data augmentation can significantly raise the accuracy of the model trained on unlearnable examples from 21.3% to 66.1%. To address this issue, we propose a defense…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Imaging Techniques and Applications · Parallel Computing and Optimization Techniques · Advanced Data Storage Technologies
