NPAT Null-Space Projected Adversarial Training Towards Zero Deterioration
Hanyi Hu, Qiao Han, Kui Chen, Yao Yang

TL;DR
This paper introduces NPAT, a novel adversarial training method using null-space projection to enhance robustness against attacks with minimal impact on normal accuracy.
Contribution
It pioneers the use of null-space projection in adversarial training, proposing two algorithms (NPDA and NPGD) to improve robustness while preserving generalization.
Findings
Achieves comparable robustness to existing methods
Maintains high generalization performance
Effective on CIFAR10 and SVHN datasets
Abstract
To mitigate the susceptibility of neural networks to adversarial attacks, adversarial training has emerged as a prevalent and effective defense strategy. Intrinsically, this countermeasure incurs a trade-off, as it sacrifices the model's accuracy in processing normal samples. To reconcile the trade-off, we pioneer the incorporation of null-space projection into adversarial training and propose two innovative Null-space Projection based Adversarial Training(NPAT) algorithms tackling sample generation and gradient optimization, named Null-space Projected Data Augmentation (NPDA) and Null-space Projected Gradient Descent (NPGD), to search for an overarching optimal solutions, which enhance robustness with almost zero deterioration in generalization performance. Adversarial samples and perturbations are constrained within the null-space of the decision boundary utilizing a closed-form…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRadiation Detection and Scintillator Technologies · Nuclear Physics and Applications · CCD and CMOS Imaging Sensors
