Improving Targeted Molecule Generation through Language Model Fine-Tuning Via Reinforcement Learning

Salma J. Ahmed; Emad A. Mohammed

arXiv:2405.06836·q-bio.BM·May 20, 2025

Improving Targeted Molecule Generation through Language Model Fine-Tuning Via Reinforcement Learning

Salma J. Ahmed, Emad A. Mohammed

PDF

Open Access

TL;DR

This paper presents a novel drug design approach that fine-tunes language models with reinforcement learning to generate targeted, valid, and novel molecules for specific proteins, improving key chemical properties.

Contribution

It introduces a reinforcement learning framework with a composite reward for targeted molecule generation using language models, enhancing drug-likeness and interaction efficacy.

Findings

01

Achieved 65.37 QED score indicating high drug-likeness

02

Generated molecules with 0.041% non-novel compounds

03

Improved molecular validity and target interaction metrics

Abstract

Developing new drugs is laborious and costly, demanding extensive time investment. In this paper, we introduce a de-novo drug design strategy, which harnesses the capabilities of language models to devise targeted drugs for specific proteins. Employing a Reinforcement Learning (RL) framework utilizing Proximal Policy Optimization (PPO), we refine the model to acquire a policy for generating drugs tailored to protein targets. The proposed method integrates a composite reward function, combining considerations of drug-target interaction and molecular validity. Following RL fine-tuning, the proposed method demonstrates promising outcomes, yielding notable improvements in molecular validity, interaction efficacy, and critical chemical properties, achieving 65.37 for Quantitative Estimation of Drug-likeness (QED), 321.55 for Molecular Weight (MW), and 4.47 for Octanol-Water Partition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Materials Science · Chemical Synthesis and Analysis