Recovery Guarantees for One-hidden-layer Neural Networks
Kai Zhong, Zhao Song, Prateek Jain, Peter L. Bartlett, Inderjit S., Dhillon

TL;DR
This paper establishes theoretical guarantees for recovering the parameters of one-hidden-layer neural networks using tensor initialization and gradient descent, with guarantees on sample and computational complexity for various activation functions.
Contribution
It provides the first recovery guarantees for 1-hidden-layer neural networks with sample complexity and computational complexity linear in input dimension and logarithmic in precision.
Findings
Tensor methods initialize parameters within the strong convexity region.
Gradient descent converges locally for smooth activations.
Sample complexity is proportional to input dimension and log of inverse precision.
Abstract
In this paper, we consider regression problems with one-hidden-layer neural networks (1NNs). We distill some properties of activation functions that lead to in the neighborhood of the ground-truth parameters for the 1NN squared-loss objective. Most popular nonlinear activation functions satisfy the distilled properties, including rectified linear units (ReLUs), leaky ReLUs, squared ReLUs and sigmoids. For activation functions that are also smooth, we show guarantees of gradient descent under a resampling rule. For homogeneous activations, we show tensor methods are able to initialize the parameters to fall into the local strong convexity region. As a result, tensor initialization followed by gradient descent is guaranteed to recover the ground truth with sample complexity $ d \cdot \log(1/\epsilon) \cdot…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning and ELM · Stochastic Gradient Optimization Techniques
