FragmentFlow: Scalable Transition State Generation for Large Molecules
Ron Shprints, Peter Holderrieth, Juno Nam, Rafael G\'omez-Bombarelli, Tommi Jaakkola

TL;DR
FragmentFlow introduces a divide-and-conquer generative approach for predicting transition state geometries in large molecules, effectively addressing size-related distribution shifts and enabling scalable, high-throughput reactivity analysis.
Contribution
It presents FragmentFlow, a novel method that predicts reactive core transition states and reconstructs full geometries, improving scalability and accuracy for large molecules.
Findings
Correctly identifies 90% of transition states in large molecules.
Requires 30% fewer saddle-point optimization steps than classical methods.
Operates effectively on molecules with up to 33 heavy atoms.
Abstract
Transition states (TSs) are central to understanding and quantitatively predicting chemical reactivity and reaction mechanisms. Although traditional TS generation methods are computationally expensive, recent generative modeling approaches have enabled chemically meaningful TS prediction for relatively small molecules. However, these methods fail to generalize to practically relevant reaction substrates because of distribution shifts induced by increasing molecular sizes. Furthermore, TS geometries for larger molecules are not available at scale, making it infeasible to train generative models from scratch on such molecules. To address these challenges, we introduce FragmentFlow: a divide-and-conquer approach that trains a generative model to predict TS geometries for the reactive core atoms, which define the reaction mechanism. The full TS structure is then reconstructed by…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Cyclization and Aryne Chemistry · Computational Drug Discovery Methods
