TL;DR
FlowPure introduces a novel continuous normalizing flow-based method for adversarial purification, leveraging attack knowledge and stochastic training to outperform existing defenses while preserving accuracy.
Contribution
It proposes FlowPure, a continuous normalizing flow approach trained with Conditional Flow Matching, improving adversarial purification and detection over prior diffusion-based methods.
Findings
Outperforms state-of-the-art purification defenses on CIFAR datasets.
Preserves benign accuracy while effectively removing adversarial perturbations.
Achieves near-perfect detection of PGD adversarial samples.
Abstract
Despite significant advances in the area, adversarial robustness remains a critical challenge in systems employing machine learning models. The removal of adversarial perturbations at inference time, known as adversarial purification, has emerged as a promising defense strategy. To achieve this, state-of-the-art methods leverage diffusion models that inject Gaussian noise during a forward process to dilute adversarial perturbations, followed by a denoising step to restore clean samples before classification. In this work, we propose FlowPure, a novel purification method based on Continuous Normalizing Flows (CNFs) trained with Conditional Flow Matching (CFM) to learn mappings from adversarial examples to their clean counterparts. Unlike prior diffusion-based approaches that rely on fixed noise processes, FlowPure can leverage specific attack knowledge to improve robustness under known…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
