Generalization capabilities of neural networks in lattice applications

Srinath Bulusu; Matteo Favoni; Andreas Ipp; David I. M\"uller; Daniel; Schuh

arXiv:2112.12474·hep-lat·December 24, 2021

Generalization capabilities of neural networks in lattice applications

Srinath Bulusu, Matteo Favoni, Andreas Ipp, David I. M\"uller, Daniel, Schuh

PDF

Open Access

TL;DR

This paper demonstrates that translationally equivariant neural networks outperform non-equivariant ones in lattice field theory tasks, showing better generalization across physical parameters and lattice sizes.

Contribution

It systematically compares equivariant and non-equivariant neural networks, highlighting the advantages of incorporating symmetries in lattice applications.

Findings

01

Equivariant networks outperform non-equivariant ones in regression and classification tasks.

02

Equivariant architectures generalize better to unseen physical parameters.

03

Performance gains extend across different lattice sizes.

Abstract

In recent years, the use of machine learning has become increasingly popular in the context of lattice field theories. An essential element of such theories is represented by symmetries, whose inclusion in the neural network properties can lead to high reward in terms of performance and generalizability. A fundamental symmetry that usually characterizes physical systems on a lattice with periodic boundary conditions is equivariance under spacetime translations. Here we investigate the advantages of adopting translationally equivariant neural networks in favor of non-equivariant ones. The system we consider is a complex scalar field with quartic interaction on a two-dimensional lattice in the flux representation, on which the networks carry out various regression and classification tasks. Promising equivariant and non-equivariant architectures are identified with a systematic search. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNuclear Physics and Applications · Machine Learning in Materials Science · Model Reduction and Neural Networks