Spatially heterogeneous learning by a deep student machine
Hajime Yoshino

TL;DR
This paper uses a statistical mechanics approach to analyze deep neural networks, revealing heterogeneous learning patterns and showing that generalization ability persists even in heavily over-parameterized, deep regimes.
Contribution
It provides an exact solution in the dense limit for deep neural networks and explores the effects of network depth, over-parameterization, and data effective dimension on learning and generalization.
Findings
Learning is heterogeneous across network layers.
Generalization ability persists in deep, over-parameterized networks.
Reducing data effective dimension improves generalization.
Abstract
Deep neural networks (DNN) with a huge number of adjustable parameters remain largely black boxes. To shed light on the hidden layers of DNN, we study supervised learning by a DNN of width and depth consisting of perceptrons with inputs by a statistical mechanics approach called the teacher-student setting. We consider an ensemble of student machines that exactly reproduce sets of dimensional input/output relations provided by a teacher machine. We show that the problem becomes exactly solvable in what we call as 'dense limit': and with fixed using the replica method developed in (H. Yoshino, (2020)). We also study the model numerically performing simple greedy MC simulations. Simulations reveal that learning by the DNN is quite heterogeneous in the network space: configurations of the teacher and the student machines are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeographic Information Systems Studies · 3D Modeling in Geospatial Applications · Augmented Reality Applications
