Diversity-Aware Reinforcement Learning for de novo Drug Design

Hampus Gummesson Svensson; Christian Tyrchan; Ola Engkvist; Morteza Haghir Chehreghani

arXiv:2410.10431·cs.LG·November 12, 2025·2 cites

Diversity-Aware Reinforcement Learning for de novo Drug Design

Hampus Gummesson Svensson, Christian Tyrchan, Ola Engkvist, Morteza Haghir Chehreghani

PDF

Open Access

TL;DR

This paper explores how different adaptive reward update mechanisms in reinforcement learning can enhance the diversity of generated drug molecules, addressing the risk of local optima in drug design models.

Contribution

It systematically investigates various intrinsic motivation and penalty strategies for reward functions to improve molecular diversity in drug generation.

Findings

01

Combining structure- and prediction-based methods improves diversity.

02

Adaptive reward mechanisms influence the diversity of generated molecules.

03

Diverse reward strategies lead to more promising drug candidates.

Abstract

Fine-tuning a pre-trained generative model has demonstrated good performance in generating promising drug molecules. The fine-tuning task is often formulated as a reinforcement learning problem, where previous methods efficiently learn to optimize a reward function to generate potential drug molecules. Nevertheless, in the absence of an adaptive update mechanism for the reward function, the optimization process can become stuck in local optima. The efficacy of the optimal molecule in a local optimization may not translate to usefulness in the subsequent drug optimization process or as a potential standalone clinical candidate. Therefore, it is important to generate a diverse set of promising molecules. Prior work has modified the reward function by penalizing structurally similar molecules, primarily focusing on finding molecules with higher rewards. To date, no study has…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods · Evolutionary Algorithms and Applications · Advanced Multi-Objective Optimization Algorithms

MethodsSparse Evolutionary Training