Quotient Geometry, Effective Curvature, and Implicit Bias in Simple Shallow Neural Networks
Hang-Cheng Dong, Pengcheng Cheng

TL;DR
This paper introduces a differential-geometric framework for analyzing shallow neural networks by considering the quotient space of parameters modulo symmetries, leading to intrinsic geometric insights and a better understanding of implicit bias.
Contribution
It develops a quotient-space approach to analyze the intrinsic geometry of shallow networks, removing representation artifacts and clarifying the role of symmetries in training dynamics and implicit bias.
Findings
Ambient flatness depends on parameter representation
Quotient-level curvature better explains local dynamics
Implicit bias is naturally described in quotient coordinates
Abstract
Overparameterized shallow neural networks admit substantial parameter redundancy: distinct parameter vectors may represent the same predictor due to hidden-unit permutations, rescalings, and related symmetries. As a result, geometric quantities computed directly in the ambient Euclidean parameter space can reflect artifacts of representation rather than intrinsic properties of the predictor. In this paper, we develop a differential-geometric framework for analyzing simple shallow networks through the quotient space obtained by modding out parameter symmetries on a regular set. We first characterize the symmetry and quotient structure of regular shallow-network parameters and show that the finite-sample realization map induces a natural metric on the quotient manifold. This leads to an effective notion of curvature that removes degeneracy along symmetry orbits and yields a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Advanced Graph Neural Networks · Neural Networks and Reservoir Computing
