ChemMLLM: Chemical Multimodal Large Language Model

Qian Tan; Dongzhan Zhou; Peng Xia; Wanhao Liu; Wanli Ouyang; Lei Bai; Yuqiang Li; Tianfan Fu

arXiv:2505.16326·cs.LG·August 5, 2025

ChemMLLM: Chemical Multimodal Large Language Model

Qian Tan, Dongzhan Zhou, Peng Xia, Wanhao Liu, Wanli Ouyang, Lei Bai, Yuqiang Li, Tianfan Fu

PDF

Open Access 1 Repo

TL;DR

ChemMLLM is a novel multimodal large language model tailored for chemical molecule understanding and generation, integrating text, SMILES, and images, and demonstrating superior performance on curated chemical tasks.

Contribution

The paper introduces ChemMLLM, the first unified chemical multimodal LLM, with designed tasks and datasets, advancing cross-modal understanding and generation in chemistry.

Findings

01

ChemMLLM outperforms baseline models on all tasks.

02

Achieves 116.75% improvement in molecule image optimization.

03

Code is publicly available for reproducibility.

Abstract

Multimodal large language models (MLLMs) have made impressive progress in many applications in recent years. However, chemical MLLMs that can handle cross-modal understanding and generation remain underexplored. To fill this gap, we propose ChemMLLM, a unified chemical multimodal large language model for molecule understanding and generation. Also, we design five multimodal tasks across text, molecular SMILES strings, and image, and curate the datasets. We benchmark ChemMLLM against a range of general leading MLLMs and Chemical LLMs on these tasks. Experimental results show that ChemMLLM achieves superior performance across all evaluated tasks. For example, in molecule image optimization task, ChemMLLM outperforms the best baseline (GPT-4o) by 116.75\% (4.27 vs 1.97 property improvement). The code is publicly available at https://github.com/bbsbz/ChemMLLM.git.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bbsbz/chemmllm
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Topic Modeling