Deep Learning the Ising Model Near Criticality
Alan Morningstar, Roger G. Melko

TL;DR
This study compares shallow and deep neural networks in modeling the 2D Ising system near criticality, finding shallow networks more efficient for representing physical probability distributions in this context.
Contribution
It demonstrates that shallow neural networks outperform deep ones in representing Ising model data near phase transitions, challenging assumptions about the advantages of depth.
Findings
Shallow networks are more accurate near criticality.
Model accuracy depends mainly on the first hidden layer size.
Deep networks do not show significant advantage in this setting.
Abstract
It is well established that neural networks with deep architectures perform better than shallow networks for many tasks in machine learning. In statistical physics, while there has been recent interest in representing physical data with generative modelling, the focus has been on shallow neural networks. A natural question to ask is whether deep neural networks hold any advantage over shallow networks in representing such data. We investigate this question by using unsupervised, generative graphical models to learn the probability distribution of a two-dimensional Ising system. Deep Boltzmann machines, deep belief networks, and deep restricted Boltzmann networks are trained on thermal spin configurations from this system, and compared to the shallow architecture of the restricted Boltzmann machine. We benchmark the models, focussing on the accuracy of generating energetic observables…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Model Reduction and Neural Networks · Computational Physics and Python Applications
