Geometry-aware Line Graph Transformer Pre-training for Molecular Property Prediction
Peizhen Bai, Xianyuan Liu, Haiping Lu

TL;DR
This paper introduces Galformer, a self-supervised pre-training framework that leverages both 2D topological and 3D geometric information of molecules to improve molecular property prediction accuracy.
Contribution
It proposes a dual-modality line graph transformer backbone and novel pre-training tasks that incorporate 2D and 3D molecular data for better representation learning.
Findings
Galformer outperforms six state-of-the-art baselines on twelve benchmarks.
It effectively captures both topological and geometric molecular features.
Experimental results demonstrate superior performance in classification and regression tasks.
Abstract
Molecular property prediction with deep learning has gained much attention over the past years. Owing to the scarcity of labeled molecules, there has been growing interest in self-supervised learning methods that learn generalizable molecular representations from unlabeled data. Molecules are typically treated as 2D topological graphs in modeling, but it has been discovered that their 3D geometry is of great importance in determining molecular functionalities. In this paper, we propose the Geometry-aware line graph transformer (Galformer) pre-training, a novel self-supervised learning framework that aims to enhance molecular representation learning with 2D and 3D modalities. Specifically, we first design a dual-modality line graph transformer backbone to encode the topological and geometric information of a molecule. The designed backbone incorporates effective structural encodings to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science
MethodsAttention Is All You Need · Softmax · Dense Connections · Linear Layer · Byte Pair Encoding · Dropout · Adam · Position-Wise Feed-Forward Layer · Multi-Head Attention · Residual Connection
