Directional Adversarial Training for Cost Sensitive Deep Learning Classification Applications
Matteo Terzi, Gian Antonio Susto, Pratik Chaudhari

TL;DR
This paper introduces WPGD, a novel adversarial training algorithm that enables cost-sensitive robustness and better manages the trade-off between accuracy and robustness in deep learning models.
Contribution
The paper proposes WPGD, an efficient method for directional adversarial training that controls robustness-accuracy trade-offs and addresses unbalanced datasets.
Findings
WPGD effectively balances robustness and accuracy in image recognition.
WPGD discovers directions requiring robustness, improving model resilience.
Adversarial training performance is affected by dataset imbalance.
Abstract
In many real-world applications of Machine Learning it is of paramount importance not only to provide accurate predictions, but also to ensure certain levels of robustness. Adversarial Training is a training procedure aiming at providing models that are robust to worst-case perturbations around predefined points. Unfortunately, one of the main issues in adversarial training is that robustness w.r.t. gradient-based attackers is always achieved at the cost of prediction accuracy. In this paper, a new algorithm, called Wasserstein Projected Gradient Descent (WPGD), for adversarial training is proposed. WPGD provides a simple way to obtain cost-sensitive robustness, resulting in a finer control of the robustness-accuracy trade-off. Moreover, WPGD solves an optimal transport problem on the output space of the network and it can efficiently discover directions where robustness is required,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
