Nonlinear Transformations Against Unlearnable Datasets

Thushari Hapuarachchi; Jing Lin; Kaiqi Xiong; Mohamed Rahouti; Gitte; Ost

arXiv:2406.02883·cs.LG·June 6, 2024·1 cites

Nonlinear Transformations Against Unlearnable Datasets

Thushari Hapuarachchi, Jing Lin, Kaiqi Xiong, Mohamed Rahouti, Gitte, Ost

PDF

Open Access

TL;DR

This paper introduces a nonlinear transformation framework that significantly improves the ability of deep neural networks to learn from datasets previously considered unlearnable, challenging existing data protection methods.

Contribution

The study proposes a novel nonlinear transformation approach that outperforms linear techniques in breaking unlearnable datasets created by various data protection strategies.

Findings

01

Improved learning accuracy on unlearnable CIFAR10 datasets by up to 249.59%.

02

Achieved over 100% improvement for Autoregressive and REM approaches.

03

Demonstrated that current unlearnable data methods are insufficient for data protection.

Abstract

Automated scraping stands out as a common method for collecting data in deep learning models without the authorization of data owners. Recent studies have begun to tackle the privacy concerns associated with this data collection method. Notable approaches include Deepconfuse, error-minimizing, error-maximizing (also known as adversarial poisoning), Neural Tangent Generalization Attack, synthetic, autoregressive, One-Pixel Shortcut, Self-Ensemble Protection, Entangled Features, Robust Error-Minimizing, Hypocritical, and TensorClog. The data generated by those approaches, called "unlearnable" examples, are prevented "learning" by deep learning models. In this research, we investigate and devise an effective nonlinear transformation framework and conduct extensive experiments to demonstrate that a deep neural network can effectively learn from the data/examples traditionally considered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications

MethodsDense Connections · Convolution · Q-Learning · Deep Q-Network · Random Ensemble Mixture