PurSAMERE: Reliable Adversarial Purification via Sharpness-Aware Minimization of Expected Reconstruction Error
Vinh Hoang, Sebastian Krumscheid, Holger Rauhut, Ra\'ul Tempone

TL;DR
PurSAMERE introduces a deterministic purification technique that enhances adversarial robustness by guiding inputs toward data distribution modes using sharpness-aware minimization of expected reconstruction error, outperforming existing methods.
Contribution
The paper presents a novel deterministic purification method employing sharpness-aware minimization to improve adversarial robustness, addressing limitations of stochastic approaches.
Findings
Significant robustness gains against strong white-box attacks.
Effective purification guided by expected reconstruction error minimization.
Theoretical proof of recovery of local density maximizers in small-noise limit.
Abstract
We propose a novel deterministic purification method to improve adversarial robustness by mapping a potentially adversarial sample toward a nearby sample that lies close to a mode of the data distribution, where classifiers are more reliable. We design the method to be deterministic to ensure reliable test accuracy and to prevent the degradation of effective robustness observed in stochastic purification approaches when the adversary has full knowledge of the system and its randomness. We employ a score model trained by minimizing the expected reconstruction error of noise-corrupted data, thereby learning the structural characteristics of the input data distribution. Given a potentially adversarial input, the method searches within its local neighborhood for a purified sample that minimizes the expected reconstruction error under noise corruption and then feeds this purified sample to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Physical Unclonable Functions (PUFs) and Hardware Security · Smart Grid Security and Resilience
