Transformer for Partial Differential Equations' Operator Learning
Zijie Li, Kazem Meidani, Amir Barati Farimani

TL;DR
This paper introduces Operator Transformer (OFormer), an attention-based neural network framework for learning solution operators of partial differential equations that is flexible with input sampling patterns.
Contribution
The paper presents a novel attention-based framework, OFormer, for data-driven PDE operator learning, which makes minimal assumptions on input sampling and demonstrates competitive performance.
Findings
OFormer is competitive on standard PDE benchmark problems.
The framework can adapt to randomly sampled input data.
Attention mechanisms provide flexible modeling of PDE operators.
Abstract
Data-driven learning of partial differential equations' solution operators has recently emerged as a promising paradigm for approximating the underlying solutions. The solution operators are usually parameterized by deep learning models that are built upon problem-specific inductive biases. An example is a convolutional or a graph neural network that exploits the local grid structure where functions' values are sampled. The attention mechanism, on the other hand, provides a flexible way to implicitly exploit the patterns within inputs, and furthermore, relationship between arbitrary query locations and inputs. In this work, we present an attention-based framework for data-driven operator learning, which we term Operator Transformer (OFormer). Our framework is built upon self-attention, cross-attention, and a set of point-wise multilayer perceptrons (MLPs), and thus it makes few…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsModel Reduction and Neural Networks · Electromagnetic Simulation and Numerical Methods · Numerical methods for differential equations
MethodsMulti-Head Attention · Attention Is All You Need · Graph Neural Network · Linear Layer · Layer Normalization · Softmax · Dense Connections · Absolute Position Encodings · Dropout · Byte Pair Encoding
