SEA: State-Exchange Attention for High-Fidelity Physics Based Transformers
Parsa Esmati, Amirhossein Dadashzadeh, Vahid Goodarzi, Nicolas, Larrosa, Nicol\`o Grilli

TL;DR
This paper introduces the State-Exchange Attention (SEA) module, a transformer-based approach that improves the accuracy of physics-based predictions in dynamical systems by enabling multidirectional information exchange between fields, significantly reducing rollout errors.
Contribution
The paper presents the novel SEA module with cross-field attention and an efficient mesh autoencoder, achieving state-of-the-art error reduction in physics-based transformers for dynamical systems.
Findings
Outperforms existing models with 88-91% error reduction
SEA module reduces errors by 97% in highly dependent state variables
Demonstrates superior rollout accuracy over baseline models
Abstract
Current approaches using sequential networks have shown promise in estimating field variables for dynamical systems, but they are often limited by high rollout errors. The unresolved issue of rollout error accumulation results in unreliable estimations as the network predicts further into the future, with each step's error compounding and leading to an increase in inaccuracy. Here, we introduce the State-Exchange Attention (SEA) module, a novel transformer-based module enabling information exchange between encoded fields through multi-head cross-attention. The cross-field multidirectional information exchange design enables all state variables in the system to exchange information with one another, capturing physical relationships and symmetries between fields. Additionally, we introduce an efficient ViT-like mesh autoencoder to generate spatially coherent mesh embeddings for a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsNeural Networks and Applications · Neural Networks and Reservoir Computing · Advanced Memory and Neural Computing
MethodsDense Connections · Layer Normalization · Residual Connection · Position-Wise Feed-Forward Layer · Attention Is All You Need · Adam · Linear Layer · Softmax · Multi-Head Attention · Dropout
