TNT: Vision Transformer for Turbulence Simulations
Yuchen Dang, Zheyuan Hu, Miles Cranmer, Michael Eickenberg, Shirley Ho

TL;DR
This paper introduces TNT, a transformer-based neural network that efficiently simulates turbulence on coarser grids, outperforming traditional methods and demonstrating robustness and potential for broader application.
Contribution
The paper presents a novel transformer architecture, TNT, tailored for turbulence simulation, extending positional embeddings and introducing TMSA for improved spatiotemporal modeling.
Findings
TNT outperforms state-of-the-art U-net in turbulence prediction metrics.
TNT generates stable, long-range turbulence predictions.
Model robustness is confirmed across different initial conditions.
Abstract
Turbulence is notoriously difficult to model due to its multi-scale nature and sensitivity to small perturbations. Classical solvers of turbulence simulation generally operate on finer grids and are computationally inefficient. In this paper, we propose the Turbulence Neural Transformer (TNT), which is a learned simulator based on the transformer architecture, to predict turbulent dynamics on coarsened grids. TNT extends the positional embeddings of vanilla transformers to a spatiotemporal setting to learn the representation in the 3D time-series domain, and applies Temporal Mutual Self-Attention (TMSA), which captures adjacent dependencies, to extract deep and dynamic features. TNT is capable of generating comparatively long-range predictions stably and accurately, and we show that TNT outperforms the state-of-the-art U-net simulator on several metrics. We also test the model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Vision and Imaging · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis
