E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory
Lin Huang, Chengxiang Huang, Ziang Wang, Yiyue Du, Chu Wang, Haocheng Lu, Yunyang Li, Xiaoli Liu, Arthur Jiang, Jia Zhang

TL;DR
E2Former-V2 introduces a scalable, efficient equivariant graph neural network architecture that leverages algebraic sparsity and custom GPU kernels to improve computational efficiency while maintaining accuracy in 3D molecular modeling.
Contribution
The paper presents E2Former-V2, a novel architecture combining algebraic sparsity with hardware-aware execution, enabling scalable and efficient equivariant attention for 3D atomistic systems.
Findings
Achieves 20× TFLOPS improvement with custom kernels.
Maintains comparable predictive performance on molecular datasets.
Demonstrates scalable training of large equivariant transformers.
Abstract
Equivariant Graph Neural Networks (EGNNs) have become a widely used approach for modeling 3D atomistic systems. However, mainstream architectures face critical scalability bottlenecks due to the explicit construction of geometric features or dense tensor products on \textit{every} edge. To overcome this, we introduce \textbf{E2Former-V2}, a scalable architecture that integrates algebraic sparsity with hardware-aware execution. We first propose \textbf{E}quivariant \textbf{A}xis-\textbf{A}ligned \textbf{S}parsification (EAAS). EAAS builds on Wigner- convolution by exploiting an change of basis to transform computationally expensive dense tensor contractions into efficient, sparse parity re-indexing operations. Building on this representation, we introduce \textbf{On-the-Fly Equivariant Attention}, a fully node-centric mechanism implemented…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Topic Modeling
