Positional Knowledge is All You Need: Position-induced Transformer (PiT)   for Operator Learning

Junfeng Chen; Kailiang Wu

arXiv:2405.09285·cs.LG·May 17, 2024·2 cites

Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning

Junfeng Chen, Kailiang Wu

PDF

Open Access 1 Repo

TL;DR

The paper introduces PiT, a novel position-attention transformer for operator learning in PDEs, which improves efficiency and performance over traditional self-attention mechanisms by focusing solely on spatial position relations.

Contribution

It proposes the position-induced attention mechanism, inspired by numerical PDE methods, offering a more efficient and interpretable alternative to self-attention in neural operators.

Findings

01

PiT outperforms state-of-the-art neural operators on PDE benchmarks.

02

PiT has better discretization convergence than Fourier neural operator.

03

Position-attention reduces computational costs significantly.

Abstract

Operator learning for Partial Differential Equations (PDEs) is rapidly emerging as a promising approach for surrogate modeling of intricate systems. Transformers with the self-attention mechanism $\unicode x 2013$ a powerful tool originally designed for natural language processing $\unicode x 2013$ have recently been adapted for operator learning. However, they confront challenges, including high computational demands and limited interpretability. This raises a critical question: Is there a more efficient attention mechanism for Transformer-based operator learning? This paper proposes the Position-induced Transformer (PiT), built on an innovative position-attention mechanism, which demonstrates significant advantages over the classical self-attention in operator learning. Position-attention draws inspiration from numerical methods for PDEs. Different from self-attention, position-attention…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

junfeng-chen/position_induced_transformer
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsInertial Sensor and Navigation · Indoor and Outdoor Localization Technologies · Spatial Cognition and Navigation

MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding