Spatially heterogeneous learning by a deep student machine

Hajime Yoshino

arXiv:2302.07419·cond-mat.dis-nn·August 1, 2023

Spatially heterogeneous learning by a deep student machine

Hajime Yoshino

PDF

Open Access

TL;DR

This paper uses a statistical mechanics approach to analyze deep neural networks, revealing heterogeneous learning patterns and showing that generalization ability persists even in heavily over-parameterized, deep regimes.

Contribution

It provides an exact solution in the dense limit for deep neural networks and explores the effects of network depth, over-parameterization, and data effective dimension on learning and generalization.

Findings

01

Learning is heterogeneous across network layers.

02

Generalization ability persists in deep, over-parameterized networks.

03

Reducing data effective dimension improves generalization.

Abstract

Deep neural networks (DNN) with a huge number of adjustable parameters remain largely black boxes. To shed light on the hidden layers of DNN, we study supervised learning by a DNN of width $N$ and depth $L$ consisting of $N L$ perceptrons with $c$ inputs by a statistical mechanics approach called the teacher-student setting. We consider an ensemble of student machines that exactly reproduce $M$ sets of $N$ dimensional input/output relations provided by a teacher machine. We show that the problem becomes exactly solvable in what we call as 'dense limit': $N ≫ c ≫ 1$ and $M ≫ 1$ with fixed $α = M / c$ using the replica method developed in (H. Yoshino, (2020)). We also study the model numerically performing simple greedy MC simulations. Simulations reveal that learning by the DNN is quite heterogeneous in the network space: configurations of the teacher and the student machines are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeographic Information Systems Studies · 3D Modeling in Geospatial Applications · Augmented Reality Applications