The autoregressive neural network architecture of the Boltzmann distribution of pairwise interacting spins systems
Indaco Biazzo

TL;DR
This paper introduces an exact autoregressive neural network architecture derived from the Boltzmann distribution of binary spin systems, linking physical models with neural network design to improve approximation accuracy.
Contribution
It provides a novel, physically interpretable ARNN architecture for spin systems, enabling systematic derivation of models from physical principles and demonstrating superior performance.
Findings
Derived ARNN architectures for Curie-Weiss and Sherrington-Kirkpatrick models
Achieved better approximation of Boltzmann distributions than existing architectures
Established a physical-interpretation framework for neural network design
Abstract
Generative Autoregressive Neural Networks (ARNNs) have recently demonstrated exceptional results in image and language generation tasks, contributing to the growing popularity of generative models in both scientific and commercial applications. This work presents an exact mapping of the Boltzmann distribution of binary pairwise interacting systems into autoregressive form. The resulting ARNN architecture has weights and biases of its first layer corresponding to the Hamiltonian's couplings and external fields, featuring widely used structures such as the residual connections and a recurrent architecture with clear physical meanings. Moreover, its architecture's explicit formulation enables the use of statistical physics techniques to derive new ARNNs for specific systems. As examples, new effective ARNN architectures are derived from two well-known mean-field systems, the Curie-Weiss…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Neural Networks and Applications · Machine Learning in Materials Science
