Practical Convex Formulation of Robust One-hidden-layer Neural Network   Training

Yatong Bai; Tanmay Gautam; Yu Gai; Somayeh Sojoudi

arXiv:2105.12237·cs.LG·May 27, 2021·1 cites

Practical Convex Formulation of Robust One-hidden-layer Neural Network Training

Yatong Bai, Tanmay Gautam, Yu Gai, Somayeh Sojoudi

PDF

Open Access

TL;DR

This paper introduces a convex optimization approach for training robust one-hidden-layer neural networks efficiently, providing improved adversarial robustness and performance over existing methods like FGSM and PGD.

Contribution

It develops a scalable convex formulation for robust neural network training and proposes a stochastic approximation method that significantly reduces computational complexity.

Findings

01

The method achieves better adversarial robustness than existing techniques.

02

It provides an efficient convex optimization framework for training neural networks.

03

Experimental results show improved performance in binary classification and regression tasks.

Abstract

Recent work has shown that the training of a one-hidden-layer, scalar-output fully-connected ReLU neural network can be reformulated as a finite-dimensional convex program. Unfortunately, the scale of such a convex program grows exponentially in data size. In this work, we prove that a stochastic procedure with a linear complexity well approximates the exact formulation. Moreover, we derive a convex optimization approach to efficiently solve the "adversarial training" problem, which trains neural networks that are robust to adversarial input perturbations. Our method can be applied to binary classification and regression, and provides an alternative to the current adversarial training methods, such as Fast Gradient Sign Method (FGSM) and Projected Gradient Descent (PGD). We demonstrate in experiments that the proposed method achieves a noticeably better adversarial robustness and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Stochastic Gradient Optimization Techniques