EquiformerV2: Improved Equivariant Transformer for Scaling to   Higher-Degree Representations

Yi-Lun Liao; Brandon Wood; Abhishek Das; Tess Smidt

arXiv:2306.12059·cs.LG·March 8, 2024·82 cites

EquiformerV2: Improved Equivariant Transformer for Scaling to Higher-Degree Representations

Yi-Lun Liao, Brandon Wood, Abhishek Das, Tess Smidt

PDF

Open Access 2 Repos 1 Models

TL;DR

EquiformerV2 advances equivariant Transformer architectures for 3D atomistic systems by scaling to higher degrees, improving efficiency, accuracy, and data utilization, and demonstrating superior performance on multiple large-scale datasets.

Contribution

The paper introduces EquiformerV2, a scalable and efficient equivariant Transformer architecture with novel architectural improvements for higher-degree representations.

Findings

01

Outperforms previous state-of-the-art on OC20 dataset

02

Achieves up to 9% better force prediction accuracy

03

Reduces DFT calculations by 2x for adsorption energies

Abstract

Equivariant Transformers such as Equiformer have demonstrated the efficacy of applying Transformers to the domain of 3D atomistic systems. However, they are limited to small degrees of equivariant representations due to their computational complexity. In this paper, we investigate whether these architectures can scale well to higher degrees. Starting from Equiformer, we first replace $S O (3)$ convolutions with eSCN convolutions to efficiently incorporate higher-degree tensors. Then, to better leverage the power of higher degrees, we propose three architectural improvements -- attention re-normalization, separable $S^{2}$ activation and separable layer normalization. Putting this all together, we propose EquiformerV2, which outperforms previous state-of-the-art methods on large-scale OC20 dataset by up to $9%$ on forces, $4%$ on energies, offers better speed-accuracy trade-offs, and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

🤗
facebook/OMAT24
model· ♡ 94
♡ 94

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Advanced Neural Network Applications · Topic Modeling