Toward Closed-loop Molecular Discovery via Language Model, Property Alignment and Strategic Search
Junkai Ji, Zhangfan Yang, Dong Xu, Ruibin Bai, Jianqiang Li, Tingjun Hou, Zexuan Zhu

TL;DR
This paper introduces Trio, a novel closed-loop molecular design framework combining language modeling, reinforcement learning, and search techniques to generate diverse, valid, and pharmacologically optimized drug candidates more effectively than existing methods.
Contribution
Trio integrates fragment-based language modeling, reinforcement learning, and Monte Carlo tree search to improve interpretability, feasibility, and exploration in molecular generation for drug discovery.
Findings
Achieves +7.85% binding affinity improvement
Enhances drug-likeness by +11.10%
Increases synthetic accessibility by +12.05%
Abstract
Drug discovery is a time-consuming and expensive process, with traditional high-throughput and docking-based virtual screening hampered by low success rates and limited scalability. Recent advances in generative modelling, including autoregressive, diffusion, and flow-based approaches, have enabled de novo ligand design beyond the limits of enumerative screening. Yet these models often suffer from inadequate generalization, limited interpretability, and an overemphasis on binding affinity at the expense of key pharmacological properties, thereby restricting their translational utility. Here we present Trio, a molecular generation framework integrating fragment-based molecular language modeling, reinforcement learning, and Monte Carlo tree search, for effective and interpretable closed-loop targeted molecular design. Through the three key components, Trio enables context-aware fragment…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Protein Degradation and Inhibitors · Chemical Synthesis and Analysis
