A Neural Network Perturbation Theory Based on the Born Series

Bastian Kaspschak; Ulf-G. Mei{\ss}ner

arXiv:2009.03192·cs.LG·June 30, 2021

A Neural Network Perturbation Theory Based on the Born Series

Bastian Kaspschak, Ulf-G. Mei{\ss}ner

PDF

1 Datasets

TL;DR

This paper develops a graph-theoretical neural network perturbation theory inspired by the Born series, enabling systematic access to higher-order derivatives for applications in theoretical physics.

Contribution

It introduces a novel framework using propagators and vertices to perform higher-order Taylor expansions of neural networks, inspired by Feynman diagrams in quantum field theory.

Findings

01

Successfully models first- and second-order Born approximations

02

Neural networks adapt mainly to the leading order of target functions

03

Iterative approach improves higher-order derivative learning

Abstract

Deep Learning using the eponymous deep neural networks (DNNs) has become an attractive approach towards various data-based problems of theoretical physics in the past decade. There has been a clear trend to deeper architectures containing increasingly more powerful and involved layers. Contrarily, Taylor coefficients of DNNs still appear mainly in the light of interpretability studies, where they are computed at most to first order. However, especially in theoretical physics numerous problems benefit from accessing higher orders, as well. This gap motivates a general formulation of neural network (NN) Taylor expansions. Restricting our analysis to multilayer perceptrons (MLPs) and introducing quantities we refer to as propagators and vertices, both depending on the MLP's weights and biases, we establish a graph-theoretical approach. Similarly to Feynman rules in quantum field theories,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Kylan12/Synthetic-AI-ML-Dataset
dataset· 42 dl
42 dl

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.