Generalization Bounds for Neural Networks via Approximate Description   Length

Amit Daniely; Elad Granot

arXiv:1910.05697·cs.LG·October 15, 2019·5 cites

Generalization Bounds for Neural Networks via Approximate Description Length

Amit Daniely, Elad Granot

PDF

Open Access

TL;DR

This paper establishes near-optimal sample complexity bounds for neural networks with bounded weights, introducing a new technique based on approximate descriptions to analyze generalization.

Contribution

It develops a novel method using approximate descriptions to derive tight sample complexity bounds for neural networks with bounded weights.

Findings

01

Sample complexity is O(d R^2 / ^2) for networks with bounded spectral and Frobenius norms.

02

The bounds are robust when considering deviations from reference matrices, leading to sub-linear parameter dependence.

03

Introduces a new technique based on approximate descriptions for analyzing the sample complexity of neural network classes.

Abstract

We investigate the sample complexity of networks with bounds on the magnitude of its weights. In particular, we consider the class \[ H=\left\{W_t\circ\rho\circ \ldots\circ\rho\circ W_{1} :W_1,\ldots,W_{t-1}\in M_{d, d}, W_t\in M_{1,d}\right\} \] where the spectral norm of each $W_{i}$ is bounded by $O (1)$ , the Frobenius norm is bounded by $R$ , and $ρ$ is the sigmoid function $\frac{e ^{x}}{1 + e ^{x}}$ or the smoothened ReLU function $ln (1 + e^{x})$ . We show that for any depth $t$ , if the inputs are in $[- 1, 1]^{d}$ , the sample complexity of $H$ is $\tilde{O} (\frac{d R ^{2}}{ϵ ^{2}})$ . This bound is optimal up to log-factors, and substantially improves over the previous state of the art of $\tilde{O} (\frac{d ^{2} R ^{2}}{ϵ ^{2}})$ . We furthermore show that this bound remains valid if instead of considering the magnitude of the $W_{i}$ 's, we consider the magnitude of $W_i -…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Machine Learning in Materials Science · Advanced Memory and Neural Computing

Methods*Communicated@Fast*How Do I Communicate to Expedia?