ChemBO: Bayesian Optimization of Small Organic Molecules with Synthesizable Recommendations
Ksenia Korovina, Sailun Xu, Kirthevasan Kandasamy, Willie Neiswanger,, Barnabas Poczos, Jeff Schneider, Eric P. Xing

TL;DR
ChemBO is a Bayesian optimization framework that efficiently generates synthesizable organic molecules with desired properties, incorporating novel graph-aware kernels for improved molecular similarity assessment.
Contribution
It introduces a sample-efficient Bayesian optimization method with a new optimal-transport based kernel that explicitly considers molecular graph structures.
Findings
ChemBO outperforms existing methods in molecular property optimization
The new kernel improves similarity assessment for molecular graphs
ChemBO produces synthesizable candidate molecules effectively
Abstract
In applications such as molecule design or drug discovery, it is desirable to have an algorithm which recommends new candidate molecules based on the results of past tests. These molecules first need to be synthesized and then tested for objective properties. We describe ChemBO, a Bayesian optimization framework for generating and optimizing organic molecules for desired molecular properties. While most existing data-driven methods for this problem do not account for sample efficiency or fail to enforce realistic constraints on synthesizability, our approach explores the synthesis graph in a sample-efficient way and produces synthesizable candidates. We implement ChemBO as a Gaussian process model and explore existing molecular kernels for it. Moreover, we propose a novel optimal-transport based distance and kernel that accounts for graphical information explicitly. In our experiments,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Process Optimization and Integration
MethodsGaussian Process
