Spring-block theory of feature learning in deep neural networks

Cheng Shi; Liming Pan; Ivan Dokmani\'c

arXiv:2407.19353·cond-mat.dis-nn·June 30, 2025

Spring-block theory of feature learning in deep neural networks

Cheng Shi, Liming Pan, Ivan Dokmani\'c

PDF

Open Access 1 Repo

TL;DR

This paper introduces a phase diagram and a mechanical theory to explain how deep neural networks learn features, linking layer dynamics to generalization performance.

Contribution

It presents a novel noise-nonlinearity phase diagram and a macroscopic mechanical theory that elucidates feature learning across layers in deep networks.

Findings

01

Identifies regimes where shallow or deep layers learn more effectively.

02

Links feature learning to generalization through a mechanical theory.

03

Reproduces the phase diagram with the proposed theory.

Abstract

Feature-learning deep nets progressively collapse data to a regular low-dimensional geometry. How this emerges from the collective action of nonlinearity, noise, learning rate, and other factors, has eluded first-principles theories built from microscopic neuronal dynamics. We exhibit a noise-nonlinearity phase diagram that identifies regimes where shallow or deep layers learn more effectively and propose a macroscopic mechanical theory that reproduces the diagram and links feature learning across layers to generalization.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

DaDaCheng/DNN_Spring
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications