Intrinsic Dimension, Persistent Homology and Generalization in Neural   Networks

Tolga Birdal; Aaron Lou; Leonidas Guibas; Umut \c{S}im\c{s}ekli

arXiv:2111.13171·cs.LG·November 29, 2021

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks

Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut \c{S}im\c{s}ekli

PDF

Open Access 2 Repos 1 Video

TL;DR

This paper introduces a topological data analysis approach to estimate the intrinsic dimension of neural network training trajectories, providing insights into generalization that outperform existing methods in efficiency and applicability.

Contribution

It develops a novel, mathematically grounded TDA-based method to compute the persistent homology dimension, linking it to generalization error without extra assumptions.

Findings

01

Efficient estimation of intrinsic dimension in deep networks.

02

Persistent homology dimension correlates with generalization error.

03

Method outperforms existing approaches in various settings.

Abstract

Disobeying the classical wisdom of statistical learning theory, modern deep neural networks generalize well even though they typically contain millions of parameters. Recently, it has been shown that the trajectories of iterative optimization algorithms can possess fractal structures, and their generalization error can be formally linked to the complexity of such fractals. This complexity is measured by the fractal's intrinsic dimension, a quantity usually much smaller than the number of parameters in the network. Even though this perspective provides an explanation for why overparametrized networks would not overfit, computing the intrinsic dimension (e.g., for monitoring generalization during training) is a notoriously difficult task, where existing methods typically fail even in moderate ambient dimensions. In this study, we consider this problem from the lens of topological data…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks· slideslive

Taxonomy

TopicsTopological and Geometric Data Analysis · Advanced Graph Neural Networks · Morphological variations and asymmetry