Transformers as Neural Operators for Solutions of Differential Equations   with Finite Regularity

Benjamin Shih; Ahmad Peyvan; Zhongqiang Zhang; George Em Karniadakis

arXiv:2405.19166·cs.LG·May 30, 2024

Transformers as Neural Operators for Solutions of Differential Equations with Finite Regularity

Benjamin Shih, Ahmad Peyvan, Zhongqiang Zhang, George Em Karniadakis

PDF

Open Access

TL;DR

This paper demonstrates that transformer models can serve as universal neural operators for solving PDEs with finite regularity, outperforming DeepONet in accuracy across diverse dynamical systems.

Contribution

It establishes the universal approximation property of transformers as neural operators and applies them to PDE solutions with low regularity, a novel application in this context.

Findings

01

Transformers outperform DeepONet in accuracy for PDE solutions.

02

Transformers are capable of modeling solutions with finite regularity.

03

Transformers are more computationally expensive than DeepONet.

Abstract

Neural operator learning models have emerged as very effective surrogates in data-driven methods for partial differential equations (PDEs) across different applications from computational science and engineering. Such operator learning models not only predict particular instances of a physical or biological system in real-time but also forecast classes of solutions corresponding to a distribution of initial and boundary conditions or forcing terms. % DeepONet is the first neural operator model and has been tested extensively for a broad class of solutions, including Riemann problems. Transformers have not been used in that capacity, and specifically, they have not been tested for solutions of PDEs with low regularity. % In this work, we first establish the theoretical groundwork that transformers possess the universal approximation property as operator learning models. We then apply…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications