MF-LAL: Drug Compound Generation Using Multi-Fidelity Latent Space Active Learning
Peter Eckmann, Dongxia Wu, Germano Heinzelmann, Michael K. Gilson, Rose Yu

TL;DR
MF-LAL is a novel generative framework that combines multi-fidelity surrogate models with active learning to generate more accurate and effective drug compounds, significantly improving binding free energy scores over existing methods.
Contribution
It introduces a unified framework integrating generative and multi-fidelity surrogate models for drug discovery, enhancing activity prediction accuracy.
Findings
MF-LAL produces compounds with ~50% better binding free energy scores.
The framework outperforms single and multi-fidelity approaches.
It effectively guides compound generation using active learning.
Abstract
Current generative models for drug discovery primarily use molecular docking as an oracle to guide the generation of active compounds. However, such models are often not useful in practice because even compounds with high docking scores do not consistently show real-world experimental activity. More accurate methods for activity prediction exist, such as molecular dynamics based binding free energy calculations, but they are too computationally expensive to use in a generative model. To address this challenge, we propose Multi-Fidelity Latent space Active Learning (MF-LAL), a generative modeling framework that integrates a set of oracles with varying cost-accuracy tradeoffs. Using active learning, we train a surrogate model for each oracle and use these surrogates to guide generation of compounds with high predicted activity. Unlike previous approaches that separately learn the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsMachine Learning and Algorithms
MethodsSparse Evolutionary Training
