Topological Generalization Bounds for Discrete-Time Stochastic Optimization Algorithms
Rayna Andreeva, Benjamin Dupuis, Rik Sarkar, Tolga Birdal, Umut, \c{S}im\c{s}ekli

TL;DR
This paper introduces new topology-based complexity measures for discrete-time training trajectories of deep neural networks, providing reliable bounds on generalization error that outperform existing methods across various architectures and datasets.
Contribution
It develops computationally efficient, topology-based complexity measures tailored for discrete-time training data, offering provable bounds on generalization error without restrictive assumptions.
Findings
Complexity measures strongly correlate with generalization error.
Outperforms existing topological bounds on multiple datasets and architectures.
Applicable to diverse domains, tasks, and neural network architectures.
Abstract
We present a novel set of rigorous and computationally efficient topology-based complexity notions that exhibit a strong correlation with the generalization gap in modern deep neural networks (DNNs). DNNs show remarkable generalization properties, yet the source of these capabilities remains elusive, defying the established statistical learning theory. Recent studies have revealed that properties of training trajectories can be indicative of generalization. Building on this insight, state-of-the-art methods have leveraged the topology of these trajectories, particularly their fractal dimension, to quantify generalization. Most existing works compute this quantity by assuming continuous- or infinite-time training dynamics, complicating the development of practical estimators capable of accurately predicting generalization without access to test data. In this paper, we respect the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsTopological and Geometric Data Analysis · Image Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques
MethodsSparse Evolutionary Training
