PureEBM: Universal Poison Purification via Mid-Run Dynamics of Energy-Based Models
Omead Pooladzandi, Jeffrey Jiang, Sunay Bhat, Gregory Pottie

TL;DR
PureEBM introduces a universal purification method using Langevin sampling of Energy-Based Models to defend classifiers against various data poisoning attacks with minimal impact on model generalization.
Contribution
The paper presents a novel universal data purification technique using EBMs and Langevin sampling, improving robustness against diverse poisoning attacks.
Findings
Achieves state-of-the-art defense performance against multiple poison types.
Maintains classifier generalization with minimal feature impact.
Effective even with poisoned EBM training data.
Abstract
Data poisoning attacks pose a significant threat to the integrity of machine learning models by leading to misclassification of target distribution data by injecting adversarial examples during training. Existing state-of-the-art (SoTA) defense methods suffer from limitations, such as significantly reduced generalization performance and significant overhead during training, making them impractical or limited for real-world applications. In response to this challenge, we introduce a universal data purification method that defends naturally trained classifiers from malicious white-, gray-, and black-box image poisons by applying a universal stochastic preprocessing step , realized by iterative Langevin sampling of a convergent Energy Based Model (EBM) initialized with an image Mid-run dynamics of purify poison information with minimal impact on features…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPlant-based Medicinal Research
MethodsFocus · energy-based model
