Adversarial Training on Purification (AToP): Advancing Both Robustness   and Generalization

Guang Lin; Chao Li; Jianhai Zhang; Toshihisa Tanaka; Qibin Zhao

arXiv:2401.16352·cs.CV·August 26, 2024·2 cites

Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization

Guang Lin, Chao Li, Jianhai Zhang, Toshihisa Tanaka, Qibin Zhao

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces AToP, a novel adversarial training pipeline combining random transforms and fine-tuning to improve both robustness and generalization of neural networks against unseen adversarial attacks.

Contribution

The paper proposes a new pipeline, AToP, that integrates perturbation destruction and adversarial fine-tuning to enhance robustness and generalization simultaneously.

Findings

01

Achieves optimal robustness on CIFAR datasets.

02

Demonstrates strong generalization to unseen attacks.

03

Outperforms existing defense methods in experiments.

Abstract

The deep neural networks are known to be vulnerable to well-designed adversarial attacks. The most successful defense technique based on adversarial training (AT) can achieve optimal robustness against particular attacks but cannot generalize well to unseen attacks. Another effective defense technique based on adversarial purification (AP) can enhance generalization but cannot achieve optimal robustness. Meanwhile, both methods share one common limitation on the degraded standard accuracy. To mitigate these issues, we propose a novel pipeline to acquire the robust purifier model, named Adversarial Training on Purification (AToP), which comprises two components: perturbation destruction by random transforms (RT) and purifier model fine-tuned (FT) by adversarial loss. RT is essential to avoid overlearning to known attacks, resulting in the robustness generalization to unseen attacks, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

glin2022/atop
pytorchOfficial

Videos

Adversarial Training on Purification (AToP): Advancing Both Robustness and Generalization· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications