MF-LAL: Drug Compound Generation Using Multi-Fidelity Latent Space Active Learning

Peter Eckmann; Dongxia Wu; Germano Heinzelmann; Michael K. Gilson; Rose Yu

arXiv:2410.11226·cs.LG·June 11, 2025·2 cites

MF-LAL: Drug Compound Generation Using Multi-Fidelity Latent Space Active Learning

Peter Eckmann, Dongxia Wu, Germano Heinzelmann, Michael K. Gilson, Rose Yu

PDF

Open Access 1 Repo 1 Video

TL;DR

MF-LAL is a novel generative framework that combines multi-fidelity surrogate models with active learning to generate more accurate and effective drug compounds, significantly improving binding free energy scores over existing methods.

Contribution

It introduces a unified framework integrating generative and multi-fidelity surrogate models for drug discovery, enhancing activity prediction accuracy.

Findings

01

MF-LAL produces compounds with ~50% better binding free energy scores.

02

The framework outperforms single and multi-fidelity approaches.

03

It effectively guides compound generation using active learning.

Abstract

Current generative models for drug discovery primarily use molecular docking as an oracle to guide the generation of active compounds. However, such models are often not useful in practice because even compounds with high docking scores do not consistently show real-world experimental activity. More accurate methods for activity prediction exist, such as molecular dynamics based binding free energy calculations, but they are too computationally expensive to use in a generative model. To address this challenge, we propose Multi-Fidelity Latent space Active Learning (MF-LAL), a generative modeling framework that integrates a set of oracles with varying cost-accuracy tradeoffs. Using active learning, we train a surrogate model for each oracle and use these surrogates to guide generation of compounds with high predicted activity. Unlike previous approaches that separately learn the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

rose-stl-lab/mf-lal
pytorchOfficial

Videos

MF-LAL: Drug Compound Generation Using Multi-Fidelity Latent Space Active Learning· slideslive

Taxonomy

TopicsMachine Learning and Algorithms

MethodsSparse Evolutionary Training