TL;DR
EquiformerV3 is an advanced $SE(3)$-equivariant graph attention Transformer that improves efficiency, expressivity, and generality for 3D atomistic modeling, achieving state-of-the-art results on multiple benchmarks.
Contribution
It introduces software optimizations, new equivariant normalization, improved hyperparameters, and SwiGLU-$S^2$ activations to enhance the capabilities of $SE(3)$-equivariant graph neural networks.
Findings
Achieves 1.75x speedup over previous version.
Enables accurate modeling of potential energy surfaces.
Sets new state-of-the-art on OC20, OMat24, and Matbench Discovery.
Abstract
As -equivariant graph neural networks mature as a core tool for 3D atomistic modeling, improving their efficiency, expressivity, and physical consistency has become a central challenge for large-scale applications. In this work, we introduce EquiformerV3, the third generation of the -equivariant graph attention Transformer, designed to advance all three dimensions: efficiency, expressivity, and generality. Building on EquiformerV2, we have the following three key advances. First, we optimize the software implementation, achieving speedup. Second, we introduce simple and effective modifications to EquiformerV2, including equivariant merged layer normalization, improved feedforward network hyper-parameters, and attention with smooth radius cutoff. Third, we propose SwiGLU- activations to incorporate many-body interactions for better theoretical expressivity…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
