An Equivariant Pretrained Transformer for Unified 3D Molecular Representation Learning
Rui Jiao, Xiangzhe Kong, Li Zhang, Ziyang Yu, Fangyuan Ren, Wenjuan, Tan, Wenbing Huang, Yang Liu

TL;DR
This paper introduces EPT, a unified 3D molecular representation model that leverages cross-domain data and equivariant transformers, significantly improving performance in molecular and protein tasks and aiding drug discovery.
Contribution
The paper presents EPT, a novel all-atom, E(3)-equivariant transformer pretrained on diverse 3D molecular data, enabling cross-domain learning and superior downstream task performance.
Findings
EPT outperforms previous methods in ligand binding affinity prediction.
EPT achieves competitive results in protein and molecular property prediction.
EPT successfully identifies potential anti-COVID-19 drug candidates.
Abstract
Pretraining on a large number of unlabeled 3D molecules has showcased superiority in various scientific applications. However, prior efforts typically focus on pretraining models in a specific domain, either proteins or small molecules, missing the opportunity to leverage cross-domain knowledge. To mitigate this gap, we introduce Equivariant Pretrained Transformer (EPT), an all-atom foundation model that can be pretrained from multiple domain 3D molecules. Built upon an E(3)-equivariant transformer, EPT is able to not only process atom-level information but also incorporate block-level features (e.g. residuals in proteins). Additionally, we employ a block-level denoising task, rather than the conventional atom-level denoising, as the pretraining objective. To pretrain EPT, we construct a large-scale dataset of 5.89M entries, comprising small molecules, proteins, protein-protein…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMolecular spectroscopy and chirality · Spectroscopy Techniques in Biomedical and Chemical Research · Neural Networks and Applications
MethodsLinear Layer · Dense Connections · Label Smoothing · Focus · Adam · Attention Is All You Need · Softmax · Multi-Head Attention · Layer Normalization · Dropout
