BatGPT-Chem: A Foundation Large Model For Retrosynthesis Prediction
Yifei Yang, Runhan Shi, Zuchao Li, Shu Jiang, Bao-Liang Lu, Yang Yang,, Hai Zhao

TL;DR
BatGPT-Chem is a large language model designed for retrosynthesis prediction, integrating extensive chemical data and advanced training techniques to improve accuracy, generalization, and zero-shot capabilities in synthetic chemistry analysis.
Contribution
This paper introduces BatGPT-Chem, a 15-billion-parameter model that unifies chemical tasks using natural language and SMILES, advancing AI-driven retrosynthesis prediction.
Findings
Outperforms existing AI methods in retrosynthesis tasks
Demonstrates strong zero-shot prediction capabilities
Achieves significant improvements on benchmark tests
Abstract
Retrosynthesis analysis is pivotal yet challenging in drug discovery and organic chemistry. Despite the proliferation of computational tools over the past decade, AI-based systems often fall short in generalizing across diverse reaction types and exploring alternative synthetic pathways. This paper presents BatGPT-Chem, a large language model with 15 billion parameters, tailored for enhanced retrosynthesis prediction. Integrating chemical tasks via a unified framework of natural language and SMILES notation, this approach synthesizes extensive instructional data from an expansive chemical database. Employing both autoregressive and bidirectional training techniques across over one hundred million instances, BatGPT-Chem captures a broad spectrum of chemical knowledge, enabling precise prediction of reaction conditions and exhibiting strong zero-shot capabilities. Superior to existing AI…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
