Fine-tuning Vision Transformers for the Prediction of State Variables in   Ising Models

Onur Kara; Arijit Sehanobish; Hector H Corzo

arXiv:2109.13925·cs.CV·December 1, 2021

Fine-tuning Vision Transformers for the Prediction of State Variables in Ising Models

Onur Kara, Arijit Sehanobish, Hector H Corzo

PDF

TL;DR

This paper demonstrates that Vision Transformers can effectively predict state variables in 2D Ising model simulations, outperforming CNNs with limited data, and suggests potential for broader applications in physics simulations.

Contribution

Introduces the application of Vision Transformers to predict Ising model states, showing superior performance over CNNs with small datasets.

Findings

01

ViT outperforms CNNs in Ising model state prediction

02

ViT requires fewer microstate images for accurate predictions

03

Potential for applying ViT to other physical simulations

Abstract

Transformers are state-of-the-art deep learning models that are composed of stacked attention and point-wise, fully connected layers designed for handling sequential data. Transformers are not only ubiquitous throughout Natural Language Processing (NLP), but, recently, they have inspired a new wave of Computer Vision (CV) applications research. In this work, a Vision Transformer (ViT) is applied to predict the state variables of 2-dimensional Ising model simulations. Our experiments show that ViT outperform state-of-the-art Convolutional Neural Networks (CNN) when using a small number of microstate images from the Ising model corresponding to various boundary conditions and temperatures. This work opens the possibility of applying ViT to other simulations, and raises interesting research directions on how attention maps can learn about the underlying physics governing different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Dropout · Layer Normalization · Position-Wise Feed-Forward Layer · Adam · Dense Connections · Byte Pair Encoding · Label Smoothing