Nonlinear Transformations Against Unlearnable Datasets
Thushari Hapuarachchi, Jing Lin, Kaiqi Xiong, Mohamed Rahouti, Gitte, Ost

TL;DR
This paper introduces a nonlinear transformation framework that significantly improves the ability of deep neural networks to learn from datasets previously considered unlearnable, challenging existing data protection methods.
Contribution
The study proposes a novel nonlinear transformation approach that outperforms linear techniques in breaking unlearnable datasets created by various data protection strategies.
Findings
Improved learning accuracy on unlearnable CIFAR10 datasets by up to 249.59%.
Achieved over 100% improvement for Autoregressive and REM approaches.
Demonstrated that current unlearnable data methods are insufficient for data protection.
Abstract
Automated scraping stands out as a common method for collecting data in deep learning models without the authorization of data owners. Recent studies have begun to tackle the privacy concerns associated with this data collection method. Notable approaches include Deepconfuse, error-minimizing, error-maximizing (also known as adversarial poisoning), Neural Tangent Generalization Attack, synthetic, autoregressive, One-Pixel Shortcut, Self-Ensemble Protection, Entangled Features, Robust Error-Minimizing, Hypocritical, and TensorClog. The data generated by those approaches, called "unlearnable" examples, are prevented "learning" by deep learning models. In this research, we investigate and devise an effective nonlinear transformation framework and conduct extensive experiments to demonstrate that a deep neural network can effectively learn from the data/examples traditionally considered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications
MethodsDense Connections · Convolution · Q-Learning · Deep Q-Network · Random Ensemble Mixture
