Generalization Bounds for Neural Networks via Approximate Description Length
Amit Daniely, Elad Granot

TL;DR
This paper establishes near-optimal sample complexity bounds for neural networks with bounded weights, introducing a new technique based on approximate descriptions to analyze generalization.
Contribution
It develops a novel method using approximate descriptions to derive tight sample complexity bounds for neural networks with bounded weights.
Findings
Sample complexity is O(d R^2 / ^2) for networks with bounded spectral and Frobenius norms.
The bounds are robust when considering deviations from reference matrices, leading to sub-linear parameter dependence.
Introduces a new technique based on approximate descriptions for analyzing the sample complexity of neural network classes.
Abstract
We investigate the sample complexity of networks with bounds on the magnitude of its weights. In particular, we consider the class \[ H=\left\{W_t\circ\rho\circ \ldots\circ\rho\circ W_{1} :W_1,\ldots,W_{t-1}\in M_{d, d}, W_t\in M_{1,d}\right\} \] where the spectral norm of each is bounded by , the Frobenius norm is bounded by , and is the sigmoid function or the smoothened ReLU function . We show that for any depth , if the inputs are in , the sample complexity of is . This bound is optimal up to log-factors, and substantially improves over the previous state of the art of . We furthermore show that this bound remains valid if instead of considering the magnitude of the 's, we consider the magnitude of $W_i -…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Machine Learning in Materials Science · Advanced Memory and Neural Computing
Methods*Communicated@Fast*How Do I Communicate to Expedia?
