A Molecular Multimodal Foundation Model Associating Molecule Graphs with Natural Language
Bing Su, Dazhao Du, Zhao Yang, Yujie Zhou, Jiangmeng Li, Anyi Rao, Hao, Sun, Zhiwu Lu, Ji-Rong Wen

TL;DR
This paper introduces a multimodal AI model that links molecular graphs with natural language, improving understanding and prediction of molecular properties by leveraging both visual and textual data.
Contribution
It presents a novel contrastive learning framework that jointly pretrains on molecular graphs and related texts, bridging modalities for enhanced molecular understanding.
Findings
Improved performance in cross-modal retrieval and molecule captioning
Enhanced molecular property prediction accuracy
Capability to generate molecular graphs from natural language
Abstract
Although artificial intelligence (AI) has made significant progress in understanding molecules in a wide range of fields, existing models generally acquire the single cognitive ability from the single molecular modality. Since the hierarchy of molecular knowledge is profound, even humans learn from different modalities including both intuitive diagrams and professional texts to assist their understanding. Inspired by this, we propose a molecular multimodal foundation model which is pretrained from molecular graphs and their semantically related textual data (crawled from published Scientific Citation Index papers) via contrastive learning. This AI model represents a critical attempt that directly bridges molecular graphs and natural language. Importantly, through capturing the specific and complementary information of the two modalities, our proposed model can better grasp molecular…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Biomedical Text Mining and Ontologies
