BatmanNet: Bi-branch Masked Graph Transformer Autoencoder for Molecular   Representation

Zhen Wang; Zheng Feng; Yanjun Li; Bowen Li; Yongrui Wang; Chulin Sha,; Min He; Xiaolin Li

arXiv:2211.13979·cs.LG·November 7, 2023

BatmanNet: Bi-branch Masked Graph Transformer Autoencoder for Molecular Representation

Zhen Wang, Zheng Feng, Yanjun Li, Bowen Li, Yongrui Wang, Chulin Sha,, Min He, Xiaolin Li

PDF

1 Repo

TL;DR

BatmanNet is a novel bi-branch masked graph transformer autoencoder that effectively learns molecular representations by reconstructing missing nodes and edges, leading to state-of-the-art results in drug discovery tasks.

Contribution

The paper introduces BatmanNet, a simple yet effective self-supervised model with dual autoencoders for local and global molecular structure learning, improving over complex existing methods.

Findings

01

Achieves state-of-the-art results on 13 benchmark datasets.

02

Effectively captures molecular structure and semantic information.

03

Improves performance in multiple drug discovery tasks.

Abstract

Although substantial efforts have been made using graph neural networks (GNNs) for AI-driven drug discovery (AIDD), effective molecular representation learning remains an open challenge, especially in the case of insufficient labeled molecules. Recent studies suggest that big GNN models pre-trained by self-supervised learning on unlabeled datasets enable better transfer performance in downstream molecular property prediction tasks. However, the approaches in these studies require multiple complex self-supervised tasks and large-scale datasets, which are time-consuming, computationally expensive, and difficult to pre-train end-to-end. Here, we design a simple yet effective self-supervised strategy to simultaneously learn local and global information about molecules, and further propose a novel bi-branch masked graph transformer autoencoder (BatmanNet) to learn molecular representations.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

wz-create/batmannet
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsMulti-Head Attention · Attention Is All You Need · Laplacian EigenMap · Layer Normalization · Laplacian Positional Encodings · Adam · Softmax · Dropout · Byte Pair Encoding · Position-Wise Feed-Forward Layer