Deep Learning is Singular, and That's Good

Daniel Murfet; Susan Wei; Mingming Gong; Hui Li; Jesse Gell-Redman,; Thomas Quella

arXiv:2010.11560·cs.LG·December 5, 2023

Deep Learning is Singular, and That's Good

Daniel Murfet, Susan Wei, Mingming Gong, Hui Li, Jesse Gell-Redman,, Thomas Quella

PDF

Open Access 1 Repo

TL;DR

This paper discusses the significance of singular models in deep learning, highlighting how their unique properties challenge traditional inference methods and proposing singular learning theory as a promising framework for understanding neural networks.

Contribution

It introduces the relevance of singular learning theory to deep learning, combining theoretical insights and experiments to motivate its application in practice.

Findings

01

Neural networks are singular models with complex parameter spaces.

02

Classical inference methods are unsuitable for singular models.

03

Singular learning theory offers a promising framework for deep learning understanding.

Abstract

In singular models, the optimal set of parameters forms an analytic set with singularities and classical statistical inference cannot be applied to such models. This is significant for deep learning as neural networks are singular and thus "dividing" by the determinant of the Hessian or employing the Laplace approximation are not appropriate. Despite its potential for addressing fundamental issues in deep learning, singular learning theory appears to have made little inroads into the developing canon of deep learning theory. Via a mix of theory and experiment, we present an invitation to singular learning theory as a vehicle for understanding deep learning and suggest important future work to make singular learning theory directly applicable to how deep learning is performed in practice.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

susanwe/RLCT
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Mechanics and Entropy · Model Reduction and Neural Networks · Stochastic Gradient Optimization Techniques