Scale-up Unlearnable Examples Learning with High-Performance Computing
Yanfan Zhu, Issac Lyngaas, Murali Gopalakrishnan Meena, Mary Ellen I., Koran, Bradley Malin, Daniel Moyer, Shunxing Bao, Anuj Kapadia, Xiao Wang,, Bennett Landman, Yuankai Huo

TL;DR
This paper explores scaling up Unlearnable Examples (UEs) using high-performance computing to improve data privacy in AI models, revealing how batch size impacts unlearnability across various datasets.
Contribution
It demonstrates the application of distributed data parallel training on supercomputers to enhance UE performance and investigates the effect of batch size on data unlearnability.
Findings
Optimal batch sizes vary across datasets for maximum unlearnability.
Large batch sizes can improve or destabilize UE performance depending on the dataset.
High-performance computing enables extensive experiments to refine data protection strategies.
Abstract
Recent advancements in AI models are structured to retain user interactions, which could inadvertently include sensitive healthcare data. In the healthcare field, particularly when radiologists use AI-driven diagnostic tools hosted on online platforms, there is a risk that medical imaging data may be repurposed for future AI training without explicit consent, spotlighting critical privacy and intellectual property concerns around healthcare data usage. Addressing these privacy challenges, a novel approach known as Unlearnable Examples (UEs) has been introduced, aiming to make data unlearnable to deep learning models. A prominent method within this area, called Unlearnable Clustering (UC), has shown improved UE performance with larger batch sizes but was previously limited by computational resources. To push the boundaries of UE performance with theoretically unlimited resources, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOnline Learning and Analytics · Intelligent Tutoring Systems and Adaptive Learning
