Reliably fast adversarial training via latent adversarial perturbation
Geon Yeong Park, Sang Wan Lee

TL;DR
This paper introduces SLAT, a single-step latent adversarial training method that improves reliability and efficiency in adversarial training by perturbing latent representations, outperforming existing methods.
Contribution
SLAT is a novel single-step adversarial training approach that perturbs latent features using gradient information, enhancing reliability without extra computational cost.
Findings
SLAT outperforms state-of-the-art accelerated adversarial training methods.
SLAT maintains similar computational cost to fast gradient sign method.
SLAT improves local linearity and robustness in adversarial training.
Abstract
While multi-step adversarial training is widely popular as an effective defense method against strong adversarial attacks, its computational cost is notoriously expensive, compared to standard training. Several single-step adversarial training methods have been proposed to mitigate the above-mentioned overhead cost; however, their performance is not sufficiently reliable depending on the optimization setting. To overcome such limitations, we deviate from the existing input-space-based adversarial training regime and propose a single-step latent adversarial training method (SLAT), which leverages the gradients of latent representation as the latent adversarial perturbation. We demonstrate that the L1 norm of feature gradients is implicitly regularized through the adopted latent perturbation, thereby recovering local linearity and ensuring reliable performance, compared to the existing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · High-Velocity Impact and Material Behavior
