Intrinsic Dimension, Persistent Homology and Generalization in Neural Networks
Tolga Birdal, Aaron Lou, Leonidas Guibas, Umut \c{S}im\c{s}ekli

TL;DR
This paper introduces a topological data analysis approach to estimate the intrinsic dimension of neural network training trajectories, providing insights into generalization that outperform existing methods in efficiency and applicability.
Contribution
It develops a novel, mathematically grounded TDA-based method to compute the persistent homology dimension, linking it to generalization error without extra assumptions.
Findings
Efficient estimation of intrinsic dimension in deep networks.
Persistent homology dimension correlates with generalization error.
Method outperforms existing approaches in various settings.
Abstract
Disobeying the classical wisdom of statistical learning theory, modern deep neural networks generalize well even though they typically contain millions of parameters. Recently, it has been shown that the trajectories of iterative optimization algorithms can possess fractal structures, and their generalization error can be formally linked to the complexity of such fractals. This complexity is measured by the fractal's intrinsic dimension, a quantity usually much smaller than the number of parameters in the network. Even though this perspective provides an explanation for why overparametrized networks would not overfit, computing the intrinsic dimension (e.g., for monitoring generalization during training) is a notoriously difficult task, where existing methods typically fail even in moderate ambient dimensions. In this study, we consider this problem from the lens of topological data…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsTopological and Geometric Data Analysis · Advanced Graph Neural Networks · Morphological variations and asymmetry
