Models Matter: The Impact of Single-Step Retrosynthesis on Synthesis Planning
Paula Torren-Peraire, Alan Kai Hassen, Samuel Genheden, Jonas, Verhoeven, Djork-Arne Clevert, Mike Preuss, Igor Tetko

TL;DR
This paper investigates how the choice of single-step retrosynthesis models affects multi-step synthesis planning, revealing that model performance on benchmarks does not always translate to better synthesis routes and emphasizing the importance of integrated evaluation.
Contribution
It introduces a combined approach of using multiple single-step models within multi-step planning and analyzes their impact, highlighting the need for evaluation within synthesis planning contexts.
Findings
High single-step performance does not guarantee successful route finding.
Using different single-step models can increase synthesis success rate by up to 28%.
Single-step models produce unique, diverse synthesis routes.
Abstract
Retrosynthesis consists of breaking down a chemical compound recursively step-by-step into molecular precursors until a set of commercially available molecules is found with the goal to provide a synthesis route. Its two primary research directions, single-step retrosynthesis prediction, which models the chemical reaction logic, and multi-step synthesis planning, which tries to find the correct sequence of reactions, are inherently intertwined. Still, this connection is not reflected in contemporary research. In this work, we combine these two major research directions by applying multiple single-step retrosynthesis models within multi-step synthesis planning and analyzing their impact using public and proprietary reaction data. We find a disconnection between high single-step performance and potential route-finding success, suggesting that single-step models must be evaluated within…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Chemical Synthesis and Analysis
