Scaling Adversarial Training via Data Selection

Youran Ye; Dejin Wang; Ajinkya Bhandare

arXiv:2512.22069·cs.LG·December 29, 2025

Scaling Adversarial Training via Data Selection

Youran Ye, Dejin Wang, Ajinkya Bhandare

PDF

Open Access

TL;DR

This paper introduces Selective Adversarial Training, which reduces computational costs by focusing on critical samples during adversarial training, maintaining robustness while halving the computation.

Contribution

It proposes two principled sample selection criteria for efficient adversarial training, significantly reducing computation without sacrificing robustness.

Findings

01

Achieves comparable or better robustness than full PGD training.

02

Reduces adversarial computation by up to 50%.

03

Effective on MNIST and CIFAR-10 datasets.

Abstract

Projected Gradient Descent (PGD) is a strong and widely used first-order adversarial attack, yet its computational cost scales poorly, as all training samples undergo identical iterative inner-loop optimization despite contributing unequally to robustness. Motivated by this inefficiency, we propose \emph{Selective Adversarial Training}, which perturbs only a subset of critical samples in each minibatch. Specifically, we introduce two principled selection criteria: (1) margin-based sampling, which prioritizes samples near the decision boundary, and (2) gradient-matching sampling, which selects samples whose gradients align with the dominant batch optimization direction. Adversarial examples are generated only for the selected subset, while the remaining samples are trained cleanly using a mixed objective. Experiments on MNIST and CIFAR-10 show that the proposed methods achieve robustness…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Generative Adversarial Networks and Image Synthesis