The Local Learning Coefficient: A Singularity-Aware Complexity Measure

Edmund Lau; Zach Furman; George Wang; Daniel Murfet; Susan Wei

arXiv:2308.12108·stat.ML·October 2, 2024

The Local Learning Coefficient: A Singularity-Aware Complexity Measure

Edmund Lau, Zach Furman, George Wang, Daniel Murfet, Susan Wei

PDF

Open Access 2 Repos

TL;DR

The paper introduces the Local Learning Coefficient (LLC), a new complexity measure for deep neural networks that accounts for singularities in the loss landscape, providing insights into model complexity and training heuristics.

Contribution

It defines and explores the LLC based on Singular Learning Theory, proposes a scalable estimator, and demonstrates its application across various neural network architectures.

Findings

01

LLC offers insights into DNN complexity and training effects.

02

The estimator scales to large models like ResNets and transformers.

03

LLC helps reconcile deep learning complexity with parsimony principles.

Abstract

The Local Learning Coefficient (LLC) is introduced as a novel complexity measure for deep neural networks (DNNs). Recognizing the limitations of traditional complexity measures, the LLC leverages Singular Learning Theory (SLT), which has long recognized the significance of singularities in the loss landscape geometry. This paper provides an extensive exploration of the LLC's theoretical underpinnings, offering both a clear definition and intuitive insights into its application. Moreover, we propose a new scalable estimator for the LLC, which is then effectively applied across diverse architectures including deep linear networks up to 100M parameters, ResNet image models, and transformer language models. Empirical evidence suggests that the LLC provides valuable insights into how training heuristics might influence the effective complexity of DNNs. Ultimately, the LLC emerges as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Statistical Mechanics and Entropy · Model Reduction and Neural Networks