Designing the Haystack: Programmable Chemical Space for Generative Molecular Discovery
Yuchen Zhu, Donghai Zhao, Yangyang Zhang, Yitong Li, Xiaorui Wang, Shuwang Li, Yue Kong, Beichen Zhang, Ricki Chen, Chang Liu, Xingcai Zhang, Tingjun Hou, Chang-Yu Hsieh

TL;DR
This paper introduces SpaceGFN, a novel framework that allows explicit design and adaptive exploration of structured chemical spaces for molecular discovery, enabling more controlled and efficient drug development processes.
Contribution
It presents a programmable chemical space model combined with flow-based sampling and reaction-aware editing, advancing the deliberate design and optimization of molecules.
Findings
Programmable space design enables natural product-like architectures.
Evolution-inspired space improves metabolic and toxicological profiles.
Effective lead optimization under synthetic constraints across multiple drug targets.
Abstract
Chemical space exploration underlies drug discovery, yet most generative models treat chemical space as a fixed, implicitly learned distribution, focusing on sampling molecules rather than deliberately designing the space itself. We introduce SpaceGFN, a generative framework that elevates chemical space to a programmable computational object: a controllable degree of freedom enabling explicit construction and adaptive traversal of structured molecular universes. SpaceGFN decouples space definition from exploration. Users specify building blocks and reaction rules to construct chemically and synthetically coherent spaces, while a GFlowNet performs efficient, property-biased sampling within them. In Discovery mode, we demonstrate programmable space design through two strategies. A pseudo-natural product space assembles natural product-like architectures. An evolution-inspired (Evo) space…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Innovative Microfluidic and Catalytic Techniques Innovation
