Multimodal Large Language Models for Inverse Molecular Design with Retrosynthetic Planning
Gang Liu, Michael Sun, Wojciech Matusik, Meng Jiang, Jie Chen

TL;DR
Llamole is a novel multimodal large language model that interleaves text and graph generation, enabling advanced molecular inverse design and retrosynthetic planning, outperforming existing models in controllability and planning accuracy.
Contribution
This paper introduces Llamole, the first multimodal LLM capable of integrated text and graph generation for molecular design and retrosynthesis, combining neural networks with search algorithms.
Findings
Llamole outperforms 14 adapted LLMs across 12 metrics.
Llamole enables flexible, controllable molecular generation.
Effective integration of graph neural networks with LLMs for chemical applications.
Abstract
While large language models (LLMs) have integrated images, adapting them to graphs remains challenging, limiting their applications in materials and drug design. This difficulty stems from the need for coherent autoregressive generation across texts and graphs. To address this, we introduce Llamole, the first multimodal LLM capable of interleaved text and graph generation, enabling molecular inverse design with retrosynthetic planning. Llamole integrates a base LLM with the Graph Diffusion Transformer and Graph Neural Networks for multi-conditional molecular generation and reaction inference within texts, while the LLM, with enhanced molecular understanding, flexibly controls activation among the different graph modules. Additionally, Llamole integrates A* search with LLM-based cost functions for efficient retrosynthetic planning. We create benchmarking datasets and conduct extensive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
- 🤗liuganghuggingface/Llamole-Pretrained-GraphDiTmodel· 1 dl1 dl
- 🤗liuganghuggingface/Llamole-Pretrained-GraphEncodermodel· 3 dl3 dl
- 🤗liuganghuggingface/Llamole-Pretrained-GNNPredictormodel· 1 dl1 dl
- 🤗liuganghuggingface/Llamole-Llama-3.1-8B-Instruct-Adaptermodel· 279 dl279 dl
- 🤗liuganghuggingface/Llamole-Qwen2-7B-Instruct-Adaptermodel· 5 dl· ♡ 15 dl♡ 1
- 🤗liuganghuggingface/Llamole-Mistral-7B-Instruct-v0.3-Adaptermodel· 3 dl· ♡ 13 dl♡ 1
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Chemical Synthesis and Analysis
MethodsAttention Is All You Need · Dense Connections · Adam · Linear Layer · Residual Connection · Position-Wise Feed-Forward Layer · Label Smoothing · Dropout · Byte Pair Encoding · Absolute Position Encodings
