TASI Lectures on Physics for Machine Learning

Jim Halverson

arXiv:2408.00082·hep-th·August 2, 2024·1 cites

TASI Lectures on Physics for Machine Learning

Jim Halverson

PDF

Open Access

TL;DR

This paper provides a comprehensive overview of neural network theory from a physics perspective, covering classical and recent results like the universal approximation theorem, neural tangent kernel, and connections to field theory.

Contribution

It offers a unified, physics-inspired framework for understanding neural network expressivity, dynamics, and statistical properties, integrating recent advances and classical results.

Findings

01

Neural tangent kernel characterizes neural network training dynamics.

02

Maximal update parameterization enhances feature learning.

03

Connections between neural networks and field theory are elucidated.

Abstract

These notes are based on lectures I gave at TASI 2024 on Physics for Machine Learning. The focus is on neural network theory, organized according to network expressivity, statistics, and dynamics. I present classic results such as the universal approximation theorem and neural network / Gaussian process correspondence, and also more recent results such as the neural tangent kernel, feature learning with the maximal update parameterization, and Kolmogorov-Arnold networks. The exposition on neural network theory emphasizes a field theoretic perspective familiar to theoretical physicists. I elaborate on connections between the two, including a neural network approach to field theory.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Physics and Python Applications

MethodsFocus · Gaussian Process