Positional Knowledge is All You Need: Position-induced Transformer (PiT) for Operator Learning
Junfeng Chen, Kailiang Wu

TL;DR
The paper introduces PiT, a novel position-attention transformer for operator learning in PDEs, which improves efficiency and performance over traditional self-attention mechanisms by focusing solely on spatial position relations.
Contribution
It proposes the position-induced attention mechanism, inspired by numerical PDE methods, offering a more efficient and interpretable alternative to self-attention in neural operators.
Findings
PiT outperforms state-of-the-art neural operators on PDE benchmarks.
PiT has better discretization convergence than Fourier neural operator.
Position-attention reduces computational costs significantly.
Abstract
Operator learning for Partial Differential Equations (PDEs) is rapidly emerging as a promising approach for surrogate modeling of intricate systems. Transformers with the self-attention mechanisma powerful tool originally designed for natural language processinghave recently been adapted for operator learning. However, they confront challenges, including high computational demands and limited interpretability. This raises a critical question: Is there a more efficient attention mechanism for Transformer-based operator learning? This paper proposes the Position-induced Transformer (PiT), built on an innovative position-attention mechanism, which demonstrates significant advantages over the classical self-attention in operator learning. Position-attention draws inspiration from numerical methods for PDEs. Different from self-attention, position-attention…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInertial Sensor and Navigation · Indoor and Outdoor Localization Technologies · Spatial Cognition and Navigation
MethodsAttention Is All You Need · Linear Layer · Multi-Head Attention · Dense Connections · Position-Wise Feed-Forward Layer · Dropout · Label Smoothing · Residual Connection · Absolute Position Encodings · Byte Pair Encoding
