More is Less: Inducing Sparsity via Overparameterization
Hung-Hsu Chou, Johannes Maly, Holger Rauhut

TL;DR
This paper demonstrates that overparameterized neural networks trained with gradient descent tend to find sparse solutions, improving understanding of implicit bias and sample complexity in compressed sensing.
Contribution
It introduces a novel analysis showing gradient flow converges to minimal -norm solutions in sparse recovery, enhancing theoretical understanding of overparameterization.
Findings
Gradient flow converges to -minimal solutions when reconstructing sparse vectors.
Overparameterization improves sample complexity in compressed sensing.
Theoretical predictions match numerical experiments.
Abstract
In deep learning it is common to overparameterize neural networks, that is, to use more parameters than training samples. Quite surprisingly training the neural network via (stochastic) gradient descent leads to models that generalize very well, while classical statistics would suggest overfitting. In order to gain understanding of this implicit bias phenomenon we study the special case of sparse recovery (compressed sensing) which is of interest on its own. More precisely, in order to reconstruct a vector from underdetermined linear measurements, we introduce a corresponding overparameterized square loss functional, where the vector to be reconstructed is deeply factorized into several vectors. We show that, if there exists an exact solution, vanilla gradient flow for the overparameterized loss functional converges to a good approximation of the solution of minimal -norm. The…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques · Medical Imaging Techniques and Applications
