Adversarial Training with Generated Data in High-Dimensional Regression:   An Asymptotic Study

Yue Xing

arXiv:2306.12582·stat.ML·June 23, 2023

Adversarial Training with Generated Data in High-Dimensional Regression: An Asymptotic Study

Yue Xing

PDF

Open Access

TL;DR

This paper provides a theoretical analysis of adversarial training with generated data in high-dimensional linear regression, revealing improved performance with regularization and offering a practical cross-validation shortcut.

Contribution

It offers the first asymptotic analysis of two-stage adversarial training with generated data in high-dimensional settings, highlighting the benefits of regularization.

Findings

01

Double-descent phenomenon observed in ridgeless training.

02

Regularization improves adversarial training performance.

03

Derived a shortcut cross-validation formula for the method.

Abstract

In recent years, studies such as \cite{carmon2019unlabeled,gowal2021improving,xing2022artificial} have demonstrated that incorporating additional real or generated data with pseudo-labels can enhance adversarial training through a two-stage training approach. In this paper, we perform a theoretical analysis of the asymptotic behavior of this method in high-dimensional linear regression. While a double-descent phenomenon can be observed in ridgeless training, with an appropriate $L_{2}$ regularization, the two-stage adversarial training achieves a better performance. Finally, we derive a shortcut cross-validation formula specifically tailored for the two-stage training method.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAnomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning · Spectroscopy Techniques in Biomedical and Chemical Research