Integrating Chemical Language and Molecular Graph in Multimodal Fused Deep Learning for Drug Property Prediction
Xiaohua Lu, Liangxu Xie, Lei Xu, Rongzhi Mao, Shan Chang, Xiaojun Xu

TL;DR
This paper introduces a multimodal deep learning approach combining chemical language and molecular graphs to improve drug property prediction, demonstrating enhanced accuracy, robustness, and generalization over mono-modal methods.
Contribution
The study presents a novel triple-modal fusion framework integrating SMILES, ECFP, and molecular graphs with multiple fusion strategies, advancing molecular property prediction capabilities.
Findings
Outperforms mono-modal models in accuracy and noise resistance
Demonstrates strong generalization in protein-ligand binding prediction
Uses diverse data sources to enhance model robustness
Abstract
Accurately predicting molecular properties is a challenging but essential task in drug discovery. Recently, many mono-modal deep learning methods have been successfully applied to molecular property prediction. However, the inherent limitation of mono-modal learning arises from relying solely on one modality of molecular representation, which restricts a comprehensive understanding of drug molecules and hampers their resilience against data noise. To overcome the limitations, we construct multimodal deep learning models to cover different molecular representations. We convert drug molecules into three molecular representations, SMILES-encoded vectors, ECFP fingerprints, and molecular graphs. To process the modal information, Transformer-Encoder, bi-directional gated recurrent units (BiGRU), and graph convolutional network (GCN) are utilized for feature learning respectively, which can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Chemical Synthesis and Analysis
MethodsSparse Evolutionary Training
