Unlocking Potential Binders: Multimodal Pretraining DEL-Fusion for Denoising DNA-Encoded Libraries
Chunbin Gu, Mutian He, Hanqun Cao, Guangyong Chen, Chang-yu Hsieh,, Pheng Ann Heng

TL;DR
This paper introduces MPDF, a multimodal pretraining framework that fuses multi-scale compound features to improve denoising in DNA-encoded library screening, enhancing drug discovery processes.
Contribution
The paper presents a novel multimodal pretraining and fusion approach that captures compound features at multiple levels, significantly improving denoising performance in DEL screening.
Findings
MPDF outperforms existing methods on three DEL datasets.
Enhanced multi-scale feature integration improves denoising accuracy.
Provides new insights into high-affinity molecule identification.
Abstract
In the realm of drug discovery, DNA-encoded library (DEL) screening technology has emerged as an efficient method for identifying high-affinity compounds. However, DEL screening faces a significant challenge: noise arising from nonspecific interactions within complex biological systems. Neural networks trained on DEL libraries have been employed to extract compound features, aiming to denoise the data and uncover potential binders to the desired therapeutic target. Nevertheless, the inherent structure of DEL, constrained by the limited diversity of building blocks, impacts the performance of compound encoders. Moreover, existing methods only capture compound features at a single level, further limiting the effectiveness of the denoising strategy. To mitigate these issues, we propose a Multimodal Pretraining DEL-Fusion model (MPDF) that enhances encoder capabilities through pretraining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBiomedical Text Mining and Ontologies · Natural Language Processing Techniques · Semantic Web and Ontologies
MethodsLib
