Disentangle Sample Size and Initialization Effect on Perfect Generalization for Single-Neuron Target
Jiajie Zhao, Zhiwei Bai, Yaoyu Zhang

TL;DR
This paper investigates how initialization scale and sample size affect perfect generalization in overparameterized neural networks, using a single-neuron target as a simplified model to reveal critical thresholds and dynamics.
Contribution
It introduces the concept of the initial imbalance ratio and identifies sample size thresholds that govern target recovery, supported by theoretical proofs and empirical validation.
Findings
Smaller initialization improves generalization.
Identifies optimistic and separation sample size thresholds.
Recovery of the target function depends on sample size and initialization.
Abstract
Overparameterized models like deep neural networks have the intriguing ability to recover target functions with fewer sampled data points than parameters (see arXiv:2307.08921). To gain insights into this phenomenon, we concentrate on a single-neuron target recovery scenario, offering a systematic examination of how initialization and sample size influence the performance of two-layer neural networks. Our experiments reveal that a smaller initialization scale is associated with improved generalization, and we identify a critical quantity called the "initial imbalance ratio" that governs training dynamics and generalization under small initialization, supported by theoretical proofs. Additionally, we empirically delineate two critical thresholds in sample size--termed the "optimistic sample size" and the "separation sample size"--that align with the theoretical frameworks established by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Force Microscopy Techniques and Applications · Image Processing Techniques and Applications
MethodsSparse Evolutionary Training · ALIGN
