Finite-Width Neural Tangent Kernels from Feynman Diagrams

Max Guillen; Philipp Misof; Jan E. Gerken

arXiv:2508.11522·cs.LG·February 16, 2026

Finite-Width Neural Tangent Kernels from Feynman Diagrams

Max Guillen, Philipp Misof, Jan E. Gerken

PDF

TL;DR

This paper introduces a Feynman diagram-based method to compute finite-width corrections to neural tangent kernels, enabling better understanding of neural network training dynamics beyond the infinite-width approximation.

Contribution

The authors develop a novel Feynman diagram framework for calculating finite-width corrections to NTK statistics, extending analysis to layer-wise recursion relations and higher-order tensors.

Findings

01

Feynman diagrams simplify finite-width correction calculations.

02

Finite-width effects are negligible for scale-invariant nonlinearities like ReLU.

03

Numerical results match sampled neural network statistics for widths greater than 20.

Abstract

Neural tangent kernels (NTKs) are a powerful tool for analyzing deep, non-linear neural networks. In the infinite-width limit, NTKs can easily be computed for most common architectures, yielding full analytic control over the training dynamics. However, at infinite width, important properties of training such as NTK evolution or feature learning are absent. Nevertheless, finite width effects can be included by computing corrections to the Gaussian statistics at infinite width. We introduce Feynman diagrams for computing finite-width corrections to NTK statistics. These dramatically simplify the necessary algebraic manipulations and enable the computation of layer-wise recursion relations for arbitrary statistics involving preactivations, NTKs and certain higher-derivative tensors (dNTK and ddNTK) required to predict the training dynamics at leading order. We demonstrate the feasibility…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.