Improving Clean Accuracy via a Tangent-Space Perspective on Adversarial Training
Bongsoo Yi, Rongjie Lai, Yao Li

TL;DR
This paper introduces TART, a novel adversarial training method that improves clean accuracy by leveraging tangent space geometry to better balance robustness and accuracy.
Contribution
TART is the first framework to explicitly incorporate tangent space concepts into adversarial training, enhancing clean accuracy without sacrificing robustness.
Findings
TART improves clean accuracy on benchmark datasets.
TART maintains robustness against adversarial attacks.
TART effectively exploits data manifold geometry.
Abstract
Adversarial training has proven effective in improving the robustness of deep neural networks against adversarial attacks. However, this enhanced robustness often comes at the cost of a substantial drop in accuracy on clean data. In this paper, we address this limitation by introducing Tangent Direction Guided Adversarial Training (TART), a novel method that enhances clean accuracy by exploiting the geometry of the data manifold. We argue that adversarial examples with large components in the normal direction can overly distort the decision boundary and degrade clean accuracy. TART addresses this issue by estimating the tangent direction of adversarial examples and adaptively modulating the perturbation bound based on the norm of their tangential component. To the best of our knowledge, TART is the first adversarial defense framework that explicitly incorporates the concept of tangent…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
