Entropy-Reinforced Planning with Large Language Models for Drug   Discovery

Xuefeng Liu; Chih-chan Tien; Peng Ding; Songhao Jiang; Rick L. Stevens

arXiv:2406.07025·cs.LG·April 1, 2025·2 cites

Entropy-Reinforced Planning with Large Language Models for Drug Discovery

Xuefeng Liu, Chih-chan Tien, Peng Ding, Songhao Jiang, Rick L. Stevens

PDF

Open Access 1 Repo

TL;DR

This paper introduces ERP, an entropy-reinforced planning method for Transformer decoding in drug discovery, which improves molecule generation quality by balancing exploration and exploitation, outperforming existing methods across multiple benchmarks.

Contribution

ERP is a novel entropy-reinforced planning algorithm that enhances Transformer decoding for molecule generation, achieving consistent improvements over state-of-the-art methods.

Findings

01

ERP outperforms current state-of-the-art algorithms by 1-5% in drug discovery benchmarks.

02

ERP demonstrates robustness across different Transformer models and objectives.

03

ERP also surpasses existing methods in code generation benchmarks.

Abstract

The objective of drug discovery is to identify chemical compounds that possess specific pharmaceutical properties toward a binding target. Existing large language models (LLMS) can achieve high token matching scores in terms of likelihood for molecule generation. However, relying solely on LLM decoding often results in the generation of molecules that are either invalid due to a single misused token, or suboptimal due to unbalanced exploration and exploitation as a consequence of the LLMs prior experience. Here we propose ERP, Entropy-Reinforced Planning for Transformer Decoding, which employs an entropy-reinforced planning algorithm to enhance the Transformer decoding process and strike a balance between exploitation and exploration. ERP aims to achieve improvements in multiple properties compared to direct sampling from the Transformer. We evaluated ERP on the SARS-CoV-2 virus…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

xuefeng-cs/erp
jaxOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputational Drug Discovery Methods

MethodsAttention Is All You Need · Softmax · Layer Normalization · Linear Layer · Byte Pair Encoding · Label Smoothing · Adam · Residual Connection · Multi-Head Attention · Position-Wise Feed-Forward Layer