Equiformer: Equivariant Graph Attention Transformer for 3D Atomistic Graphs
Yi-Lun Liao, Tess Smidt

TL;DR
Equiformer introduces an equivariant Transformer-based graph neural network for 3D atomistic graphs, effectively encoding 3D geometric information and achieving strong empirical results on molecular datasets.
Contribution
The paper presents a simple, effective equivariant Transformer architecture with a novel attention mechanism for 3D graphs, improving generalization and performance.
Findings
Achieves competitive results on QM9, MD17, and OC20 datasets.
Introduces equivariant graph attention with MLP-based attention.
Demonstrates strong empirical performance with minimal architectural modifications.
Abstract
Despite their widespread success in various domains, Transformer networks have yet to perform well across datasets in the domain of 3D atomistic graphs such as molecules even when 3D-related inductive biases like translational invariance and rotational equivariance are considered. In this paper, we demonstrate that Transformers can generalize well to 3D atomistic graphs and present Equiformer, a graph neural network leveraging the strength of Transformer architectures and incorporating SE(3)/E(3)-equivariant features based on irreducible representations (irreps). First, we propose a simple and effective architecture by only replacing original operations in Transformers with their equivariant counterparts and including tensor products. Using equivariant operations enables encoding equivariant information in channels of irreps features without complicating graph structures. With minimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks
MethodsMulti-Head Attention · Attention Is All You Need · Graph Neural Network · Linear Layer · Softmax · Residual Connection · Adam · Byte Pair Encoding · Layer Normalization · Absolute Position Encodings
