Interpolation, extrapolation, and local generalization in common neural   networks

Laurent Bonnasse-Gahot

arXiv:2207.08648·cs.LG·July 19, 2022·1 cites

Interpolation, extrapolation, and local generalization in common neural networks

Laurent Bonnasse-Gahot

PDF

Open Access

TL;DR

This paper investigates neural network generalization, revealing that in an intrinsic low-dimensional space, test samples often lie within the convex hull of training data, indicating neural networks operate mainly in an interpolation regime rather than extrapolation.

Contribution

The study uncovers that neural network activities are low-dimensional and that test samples are often within the convex hull of training data in this intrinsic space, challenging the assumption of extrapolative behavior.

Findings

01

Neural activities form a low-dimensional intrinsic space.

02

Test samples mostly lie within the convex hull of training data in this space.

03

Proximity measures other than convex hull membership better predict performance.

Abstract

There has been a long history of works showing that neural networks have hard time extrapolating beyond the training set. A recent study by Balestriero et al. (2021) challenges this view: defining interpolation as the state of belonging to the convex hull of the training set, they show that the test set, either in input or neural space, cannot lie for the most part in this convex hull, due to the high dimensionality of the data, invoking the well known curse of dimensionality. Neural networks are then assumed to necessarily work in extrapolative mode. We here study the neural activities of the last hidden layer of typical neural networks. Using an autoencoder to uncover the intrinsic space underlying the neural activities, we show that this space is actually low-dimensional, and that the better the model, the lower the dimensionality of this intrinsic space. In this space, most samples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Neural Networks and Applications

MethodsTest