Scalable Transformer for PDE Surrogate Modeling

Zijie Li; Dule Shu; Amir Barati Farimani

arXiv:2305.17560·cs.LG·November 6, 2023·20 cites

Scalable Transformer for PDE Surrogate Modeling

Zijie Li, Dule Shu, Amir Barati Farimani

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces FactFormer, a scalable Transformer model with axial factorized kernels for efficient and accurate PDE surrogate modeling on high-dimensional grids, outperforming traditional methods in stability and computational cost.

Contribution

The paper proposes a novel Factorized Transformer with axial kernel decomposition, enabling efficient PDE surrogate modeling on large multi-dimensional grids.

Findings

01

Successfully simulates 2D Kolmogorov flow on 256x256 grid

02

Accurately models 3D smoke buoyancy on 64x64x64 grid

03

Offers a computationally efficient low-rank surrogate for full attention

Abstract

Transformer has shown state-of-the-art performance on various applications and has recently emerged as a promising tool for surrogate modeling of partial differential equations (PDEs). Despite the introduction of linear-complexity attention, applying Transformer to problems with a large number of grid points can be numerically unstable and computationally expensive. In this work, we propose Factorized Transformer (FactFormer), which is based on an axial factorized kernel integral. Concretely, we introduce a learnable projection operator that decomposes the input function into multiple sub-functions with one-dimensional domain. These sub-functions are then evaluated and used to compute the instance-based kernel with an axial factorized scheme. We showcase that the proposed model is able to simulate 2D Kolmogorov flow on a $256 \times 256$ grid and 3D smoke buoyancy on a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BaratiLab/FactFormer
pytorchOfficial

Videos

Scalable Transformer for PDE Surrogate Modeling· slideslive

Taxonomy

TopicsModel Reduction and Neural Networks · Computational Physics and Python Applications · Computer Graphics and Visualization Techniques

MethodsMulti-Head Attention · Attention Is All You Need · Dense Connections · Dropout · Byte Pair Encoding · Softmax · Layer Normalization · Position-Wise Feed-Forward Layer · Linear Layer · Absolute Position Encodings