PureGen: Universal Data Purification for Train-Time Poison Defense via Generative Model Dynamics
Sunay Bhat, Jeffrey Jiang, Omead Pooladzandi, Alexander Branch,, Gregory Pottie

TL;DR
PureGen introduces universal data purification techniques using generative models like EBMs and DDPMs to defend against train-time data poisoning attacks, maintaining high classification accuracy without attack-specific tuning.
Contribution
The paper proposes a novel universal data purification method employing stochastic transforms via generative models, offering robust defense against various poisoning attacks without significant performance loss.
Findings
State-of-the-art defense against multiple poisoning attacks
Effective on CIFAR-10, Tiny-ImageNet, CINIC-10 datasets
Minimal impact on classifier generalization
Abstract
Train-time data poisoning attacks threaten machine learning models by introducing adversarial examples during training, leading to misclassification. Current defense methods often reduce generalization performance, are attack-specific, and impose significant training overhead. To address this, we introduce a set of universal data purification methods using a stochastic transform, , realized via iterative Langevin dynamics of Energy-Based Models (EBMs), Denoising Diffusion Probabilistic Models (DDPMs), or both. These approaches purify poisoned data with minimal impact on classifier generalization. Our specially trained EBMs and DDPMs provide state-of-the-art defense against various attacks (including Narcissus, Bullseye Polytope, Gradient Matching) on CIFAR-10, Tiny-ImageNet, and CINIC-10, without needing attack or classifier-specific information. We discuss performance…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsInsect Pheromone Research and Control
MethodsSparse Evolutionary Training · Diffusion
