Provable Robustness of Adversarial Training for Learning Halfspaces with Noise
Difan Zou, Spencer Frei, Quanquan Gu

TL;DR
This paper provides theoretical guarantees for adversarial training methods in learning robust halfspaces under label noise, showing they achieve provable robustness bounds for different perturbation norms.
Contribution
It offers the first provable analysis of adversarial training for robust halfspaces with noise, establishing bounds for both convex and nonconvex loss functions.
Findings
Adversarial training achieves robust error bounds proportional to the optimal robust error.
Results apply to distributions with anti-concentration properties like log-concave distributions.
Nonconvex sigmoidal loss improves robustness bounds compared to standard convex loss.
Abstract
We analyze the properties of adversarial training for learning adversarially robust halfspaces in the presence of agnostic label noise. Denoting as the best robust classification error achieved by a halfspace that is robust to perturbations of balls of radius , we show that adversarial training on the standard binary cross-entropy loss yields adversarially robust halfspaces up to (robust) classification error for , and when . Our results hold for distributions satisfying anti-concentration properties enjoyed by log-concave isotropic distributions among others. We additionally show that if one instead uses a nonconvex sigmoidal loss, adversarial training yields halfspaces with an improved robust classification error…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Machine Learning and Algorithms · Integrated Circuits and Semiconductor Failure Analysis
