STRIDE: Structure-guided Generation for Inverse Design of Molecules
Shehtab Zaman, Denis Akhiyarov, Mauricio Araya-Polo, Kenneth Chiu

TL;DR
STRIDE is a novel molecule generation method that guides the creation of new molecules using known structures without retraining, effectively addressing data scarcity in industrial molecule design.
Contribution
It introduces a structure-guided generative workflow that produces novel molecules from limited data without retraining, applicable to small datasets.
Findings
Generated molecules have 21.7% lower synthetic accessibility scores.
Ionization potential is reduced by 5.9% in guided molecules.
Method works without retraining on small, specialized datasets.
Abstract
Machine learning and especially deep learning has had an increasing impact on molecule and materials design. In particular, given the growing access to an abundance of high-quality small molecule data for generative modeling for drug design, results for drug discovery have been promising. However, for many important classes of materials such as catalysts, antioxidants, and metal-organic frameworks, such large datasets are not available. Such families of molecules with limited samples and structural similarities are especially prevalent for industrial applications. As is well-known, retraining and even fine-tuning are challenging on such small datasets. Novel, practically applicable molecules are most often derivatives of well-known molecules, suggesting approaches to addressing data scarcity. To address this problem, we introduce , a generative molecule workflow that…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Catalysis and Oxidation Reactions
MethodsSparse Evolutionary Training
