LADDER: Latent Boundary-guided Adversarial Training
Xiaowei Zhou, Ivor W. Tsang, Jie Yin

TL;DR
LADDER introduces a novel adversarial training method that generates high-quality adversarial examples in the latent space guided by decision boundary normals, improving robustness without sacrificing standard accuracy.
Contribution
The paper proposes LADDER, a new adversarial training framework that creates boundary-guided adversarial examples in the latent space, enhancing robustness and accuracy trade-offs.
Findings
LADDER outperforms baseline models on multiple datasets.
It achieves a better balance between accuracy and robustness.
Generated adversarial examples are high-quality and boundary-aware.
Abstract
Deep Neural Networks (DNNs) have recently achieved great success in many classification tasks. Unfortunately, they are vulnerable to adversarial attacks that generate adversarial examples with a small perturbation to fool DNN models, especially in model sharing scenarios. Adversarial training is proved to be the most effective strategy that injects adversarial examples into model training to improve the robustness of DNN models against adversarial attacks. However, adversarial training based on the existing adversarial examples fails to generalize well to standard, unperturbed test data. To achieve a better trade-off between standard accuracy and adversarial robustness, we propose a novel adversarial training framework called LAtent bounDary-guided aDvErsarial tRaining (LADDER) that adversarially trains DNN models on latent boundary-guided adversarial examples. As opposed to most of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
MethodsTest · Support Vector Machine
