Transformer for Partial Differential Equations' Operator Learning

Zijie Li; Kazem Meidani; Amir Barati Farimani

arXiv:2205.13671·cs.LG·May 1, 2023·46 cites

Transformer for Partial Differential Equations' Operator Learning

Zijie Li, Kazem Meidani, Amir Barati Farimani

PDF

Open Access 1 Repo

TL;DR

This paper introduces Operator Transformer (OFormer), an attention-based neural network framework for learning solution operators of partial differential equations that is flexible with input sampling patterns.

Contribution

The paper presents a novel attention-based framework, OFormer, for data-driven PDE operator learning, which makes minimal assumptions on input sampling and demonstrates competitive performance.

Findings

01

OFormer is competitive on standard PDE benchmark problems.

02

The framework can adapt to randomly sampled input data.

03

Attention mechanisms provide flexible modeling of PDE operators.

Abstract

Data-driven learning of partial differential equations' solution operators has recently emerged as a promising paradigm for approximating the underlying solutions. The solution operators are usually parameterized by deep learning models that are built upon problem-specific inductive biases. An example is a convolutional or a graph neural network that exploits the local grid structure where functions' values are sampled. The attention mechanism, on the other hand, provides a flexible way to implicitly exploit the patterns within inputs, and furthermore, relationship between arbitrary query locations and inputs. In this work, we present an attention-based framework for data-driven operator learning, which we term Operator Transformer (OFormer). Our framework is built upon self-attention, cross-attention, and a set of point-wise multilayer perceptrons (MLPs), and thus it makes few…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

BaratiLab/OFormer
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsModel Reduction and Neural Networks · Electromagnetic Simulation and Numerical Methods · Numerical methods for differential equations

MethodsMulti-Head Attention · Attention Is All You Need · Graph Neural Network · Linear Layer · Layer Normalization · Softmax · Dense Connections · Absolute Position Encodings · Dropout · Byte Pair Encoding