Hashing based Contrastive Learning for Virtual Screening

Jin Han; Yun Hong; Wu-Jun Li

arXiv:2407.19790·cs.AI·July 30, 2024

Hashing based Contrastive Learning for Virtual Screening

Jin Han, Yun Hong, Wu-Jun Li

PDF

Open Access 1 Video

TL;DR

This paper introduces DrugHash, a hashing-based contrastive learning method for virtual screening that uses binary hash codes to significantly reduce memory and computation costs while achieving state-of-the-art accuracy.

Contribution

DrugHash is the first to apply end-to-end binary hashing contrastive learning for virtual screening, improving efficiency and accuracy over existing real-valued vector methods.

Findings

01

Achieves state-of-the-art accuracy in virtual screening.

02

Reduces memory usage by 32 times.

03

Speeds up retrieval by 3.5 times.

Abstract

Virtual screening (VS) is a critical step in computer-aided drug discovery, aiming to identify molecules that bind to a specific target receptor like protein. Traditional VS methods, such as docking, are often too time-consuming for screening large-scale molecular databases. Recent advances in deep learning have demonstrated that learning vector representations for both proteins and molecules using contrastive learning can outperform traditional docking methods. However, given that target databases often contain billions of molecules, real-valued vector representations adopted by existing methods can still incur significant memory and time costs in VS. To address this problem, in this paper we propose a hashing-based contrastive learning method, called DrugHash, for VS. DrugHash treats VS as a retrieval task that uses efficient binary hash codes for retrieval. In particular, DrugHash…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Hashing Based Contrastive Learning for Virtual Screening· underline

Taxonomy

TopicsAdvanced Image and Video Retrieval Techniques · AI in cancer detection · Advanced Data Compression Techniques

MethodsContrastive Learning · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings