A 3D-Shape Similarity-based Contrastive Approach to Molecular   Representation Learning

Austin Atsango; Nathaniel L. Diamant; Ziqing Lu; Tommaso Biancalani,; Gabriele Scalia; Kangway V. Chuang

arXiv:2211.02130·cs.LG·November 7, 2022·1 cites

A 3D-Shape Similarity-based Contrastive Approach to Molecular Representation Learning

Austin Atsango, Nathaniel L. Diamant, Ziqing Lu, Tommaso Biancalani,, Gabriele Scalia, Kangway V. Chuang

PDF

Open Access

TL;DR

This paper introduces MolCLaSS, a contrastive learning method for graph neural networks that captures 3D molecular shape information indirectly through similarity matching, improving molecular property prediction.

Contribution

It presents a novel contrastive learning framework that encodes 3D shape similarity without directly modeling 3D poses, enhancing molecular representations.

Findings

01

MolCLaSS effectively captures 3D shape features.

02

It improves scaffold hopping capabilities.

03

The method outperforms traditional 2D-based models.

Abstract

Molecular shape and geometry dictate key biophysical recognition processes, yet many graph neural networks disregard 3D information for molecular property prediction. Here, we propose a new contrastive-learning procedure for graph neural networks, Molecular Contrastive Learning from Shape Similarity (MolCLaSS), that implicitly learns a three-dimensional representation. Rather than directly encoding or targeting three-dimensional poses, MolCLaSS matches a similarity objective based on Gaussian overlays to learn a meaningful representation of molecular shape. We demonstrate how this framework naturally captures key aspects of three-dimensionality that two-dimensional representations cannot and provides an inductive framework for scaffold hopping.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Chemical Synthesis and Analysis

MethodsContrastive Learning