Geometry and Local Recovery of Global Minima of Two-layer Neural   Networks at Overparameterization

Leyang Zhang; Yaoyu Zhang; Tao Luo

arXiv:2309.00508·cs.LG·April 11, 2025

Geometry and Local Recovery of Global Minima of Two-layer Neural Networks at Overparameterization

Leyang Zhang, Yaoyu Zhang, Tao Luo

PDF

Open Access

TL;DR

This paper explores the geometry of the loss landscape for two-layer neural networks, showing how overparameterization leads to well-separated global minima and favorable local convergence properties.

Contribution

It introduces novel techniques to analyze the geometry of global minima and demonstrates local recoverability of two-layer neural networks in overparameterized regimes.

Findings

01

Global minima become geometrically separated as sample size increases

02

Gradient flow converges locally with quantifiable rates

03

Overparameterization enables local recovery of networks

Abstract

Under mild assumptions, we investigate the geometry of the loss landscape for two-layer neural networks in the vicinity of global minima. Utilizing novel techniques, we demonstrate: (i) how global minima with zero generalization error become geometrically separated from other global minima as the sample size grows; and (ii) the local convergence properties and rate of gradient flow dynamics. Our results indicate that two-layer neural networks can be locally recovered in the regime of overparameterization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning and ELM · Model Reduction and Neural Networks