A Generalization Bound for Nearly-Linear Networks

Eugene Golikov

arXiv:2407.06765·cs.LG·July 10, 2024

A Generalization Bound for Nearly-Linear Networks

Eugene Golikov

PDF

Open Access

TL;DR

This paper introduces a priori generalization bounds for nearly-linear neural networks, which are non-vacuous and do not require training data, advancing theoretical understanding of neural network generalization.

Contribution

It presents the first non-vacuous a-priori generalization bounds for neural networks close to linear, based on their perturbation from linearity.

Findings

01

Bounds are non-vacuous for nearly-linear networks

02

Bounds do not require actual training data for evaluation

03

First such bounds for this class of neural networks

Abstract

We consider nonlinear networks as perturbations of linear ones. Based on this approach, we present novel generalization bounds that become non-vacuous for networks that are close to being linear. The main advantage over the previous works which propose non-vacuous generalization bounds is that our bounds are a-priori: performing the actual training is not required for evaluating the bounds. To the best of our knowledge, they are the first non-vacuous generalization bounds for neural nets possessing this property.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGraph theory and applications · graph theory and CDMA systems