DeepRetro: Retrosynthetic Pathway Discovery using Iterative LLM Reasoning
Shreyas Vinaya Sathyanarayana, Sharanabasava D. Hiremath, Rahil Shah, Rishikesh Panda, Rahul Jana, Riya Singh, Rida Irfan, Ashwin Murali, Bharath Ramsundar

TL;DR
DeepRetro is an innovative open-source framework that combines large language models, traditional retrosynthetic tools, and expert feedback to discover complex natural product synthesis pathways, advancing computational organic chemistry.
Contribution
It introduces a hybrid, iterative system integrating LLMs with chemical validity checks and human feedback for improved retrosynthesis of complex molecules.
Findings
Achieves strong performance on standard benchmarks.
Proposes novel pathways for complex natural products.
Facilitates human-machine collaboration in synthesis planning.
Abstract
The synthesis of complex natural products remains one of the grand challenges of organic chemistry. We present DeepRetro, a major advancement in computational retrosynthesis that enables the discovery of viable synthetic routes for complex molecules typically considered beyond the reach of existing retrosynthetic methods. DeepRetro is a novel, open-source framework that tightly integrates large language models (LLMs), traditional retrosynthetic engines, and expert human feedback in an iterative design loop. Prior approaches rely solely on template-based methods or unconstrained LLM outputs. In contrast, DeepRetro combines the precision of template-based methods with the generative flexibility of LLMs, controlled by rigorous chemical validity checks and enhanced by recursive refinement. This hybrid system dynamically explores and revises synthetic pathways, guided by both algorithmic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Semantic Web and Ontologies · Biomedical Text Mining and Ontologies
