A Neural Network Perturbation Theory Based on the Born Series
Bastian Kaspschak, Ulf-G. Mei{\ss}ner

TL;DR
This paper develops a graph-theoretical neural network perturbation theory inspired by the Born series, enabling systematic access to higher-order derivatives for applications in theoretical physics.
Contribution
It introduces a novel framework using propagators and vertices to perform higher-order Taylor expansions of neural networks, inspired by Feynman diagrams in quantum field theory.
Findings
Successfully models first- and second-order Born approximations
Neural networks adapt mainly to the leading order of target functions
Iterative approach improves higher-order derivative learning
Abstract
Deep Learning using the eponymous deep neural networks (DNNs) has become an attractive approach towards various data-based problems of theoretical physics in the past decade. There has been a clear trend to deeper architectures containing increasingly more powerful and involved layers. Contrarily, Taylor coefficients of DNNs still appear mainly in the light of interpretability studies, where they are computed at most to first order. However, especially in theoretical physics numerous problems benefit from accessing higher orders, as well. This gap motivates a general formulation of neural network (NN) Taylor expansions. Restricting our analysis to multilayer perceptrons (MLPs) and introducing quantities we refer to as propagators and vertices, both depending on the MLP's weights and biases, we establish a graph-theoretical approach. Similarly to Feynman rules in quantum field theories,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
