Geometry-aware Line Graph Transformer Pre-training for Molecular   Property Prediction

Peizhen Bai; Xianyuan Liu; Haiping Lu

arXiv:2309.00483·cs.LG·September 4, 2023

Geometry-aware Line Graph Transformer Pre-training for Molecular Property Prediction

Peizhen Bai, Xianyuan Liu, Haiping Lu

PDF

Open Access

TL;DR

This paper introduces Galformer, a self-supervised pre-training framework that leverages both 2D topological and 3D geometric information of molecules to improve molecular property prediction accuracy.

Contribution

It proposes a dual-modality line graph transformer backbone and novel pre-training tasks that incorporate 2D and 3D molecular data for better representation learning.

Findings

01

Galformer outperforms six state-of-the-art baselines on twelve benchmarks.

02

It effectively captures both topological and geometric molecular features.

03

Experimental results demonstrate superior performance in classification and regression tasks.

Abstract

Molecular property prediction with deep learning has gained much attention over the past years. Owing to the scarcity of labeled molecules, there has been growing interest in self-supervised learning methods that learn generalizable molecular representations from unlabeled data. Molecules are typically treated as 2D topological graphs in modeling, but it has been discovered that their 3D geometry is of great importance in determining molecular functionalities. In this paper, we propose the Geometry-aware line graph transformer (Galformer) pre-training, a novel self-supervised learning framework that aims to enhance molecular representation learning with 2D and 3D modalities. Specifically, we first design a dual-modality line graph transformer backbone to encode the topological and geometric information of a molecule. The designed backbone incorporates effective structural encodings to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science

MethodsAttention Is All You Need · Softmax · Dense Connections · Linear Layer · Byte Pair Encoding · Dropout · Adam · Position-Wise Feed-Forward Layer · Multi-Head Attention · Residual Connection