E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory

Lin Huang; Chengxiang Huang; Ziang Wang; Yiyue Du; Chu Wang; Haocheng Lu; Yunyang Li; Xiaoli Liu; Arthur Jiang; Jia Zhang

arXiv:2601.16622·cs.LG·January 26, 2026

E2Former-V2: On-the-Fly Equivariant Attention with Linear Activation Memory

Lin Huang, Chengxiang Huang, Ziang Wang, Yiyue Du, Chu Wang, Haocheng Lu, Yunyang Li, Xiaoli Liu, Arthur Jiang, Jia Zhang

PDF

Open Access

TL;DR

E2Former-V2 introduces a scalable, efficient equivariant graph neural network architecture that leverages algebraic sparsity and custom GPU kernels to improve computational efficiency while maintaining accuracy in 3D molecular modeling.

Contribution

The paper presents E2Former-V2, a novel architecture combining algebraic sparsity with hardware-aware execution, enabling scalable and efficient equivariant attention for 3D atomistic systems.

Findings

01

Achieves 20× TFLOPS improvement with custom kernels.

02

Maintains comparable predictive performance on molecular datasets.

03

Demonstrates scalable training of large equivariant transformers.

Abstract

Equivariant Graph Neural Networks (EGNNs) have become a widely used approach for modeling 3D atomistic systems. However, mainstream architectures face critical scalability bottlenecks due to the explicit construction of geometric features or dense tensor products on \textit{every} edge. To overcome this, we introduce \textbf{E2Former-V2}, a scalable architecture that integrates algebraic sparsity with hardware-aware execution. We first propose \textbf{E}quivariant \textbf{A}xis-\textbf{A}ligned \textbf{S}parsification (EAAS). EAAS builds on Wigner- $6 j$ convolution by exploiting an $SO (3) \to SO (2)$ change of basis to transform computationally expensive dense tensor contractions into efficient, sparse parity re-indexing operations. Building on this representation, we introduce \textbf{On-the-Fly Equivariant Attention}, a fully node-centric mechanism implemented…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Topic Modeling