Neural Scaling Laws From Large-N Field Theory: Solvable Model Beyond the   Ridgeless Limit

Zhengkang Zhang

arXiv:2405.19398·hep-th·May 31, 2024

Neural Scaling Laws From Large-N Field Theory: Solvable Model Beyond the Ridgeless Limit

Zhengkang Zhang

PDF

Open Access 1 Datasets

TL;DR

This paper uses large-N field theory to analytically derive and extend neural scaling laws in a simplified model, revealing a duality that explains the symmetry between model and data set sizes.

Contribution

It extends previous work by solving a model with nonzero ridge parameter, providing more precise scaling laws and uncovering a duality transformation at the diagram level.

Findings

01

Derived new scaling laws for neural networks with regularization.

02

Uncovered a duality explaining symmetry between model and data sizes.

03

Extended theoretical understanding of neural scaling beyond the ridgeless limit.

Abstract

Many machine learning models based on neural networks exhibit scaling laws: their performance scales as power laws with respect to the sizes of the model and training data set. We use large-N field theory methods to solve a model recently proposed by Maloney, Roberts and Sully which provides a simplified setting to study neural scaling laws. Our solution extends the result in this latter paper to general nonzero values of the ridge parameter, which are essential to regularize the behavior of the model. In addition to obtaining new and more precise scaling laws, we also uncover a duality transformation at the diagrams level which explains the symmetry between model and training data set sizes. The same duality underlies recent efforts to design neural networks to simulate quantum field theories.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Thermodynamics and Statistical Mechanics

MethodsSparse Evolutionary Training