A Geometric Modeling of Occam's Razor in Deep Learning
Ke Sun, Frank Nielsen

TL;DR
This paper introduces a geometric information-theoretic approach to explain why deep neural networks perform well despite their high parameter count, linking simplicity to better generalization through manifold complexity analysis.
Contribution
It proposes a novel geometric framework using singular semi-Riemannian geometry to analyze the complexity of DNNs via the Fisher information matrix.
Findings
Derives complexity measures based on singularity analysis of the Fisher information matrix.
Provides bounds on model complexity that explain DNNs' generalization capabilities.
Links the simplicity of models to their effective description length.
Abstract
Why do deep neural networks (DNNs) benefit from very high dimensional parameter spaces? Their huge parameter complexities vs stunning performance in practice is all the more intriguing and not explainable using the standard theory of model selection for regular models. In this work, we propose a geometrically flavored information-theoretic approach to study this phenomenon. With the belief that simplicity is linked to better generalization, as grounded in the theory of minimum description length, the objective of our analysis is to examine and bound the complexity of DNNs. We introduce the locally varying dimensionality of the parameter space of neural network models by considering the number of significant dimensions of the Fisher information matrix, and model the parameter space as a manifold using the framework of singular semi-Riemannian geometry. We derive model complexity measures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopological and Geometric Data Analysis · Morphological variations and asymmetry · Neural Networks and Applications
