Empowering LLMs for Structure-Based Drug Design via Exploration-Augmented Latent Inference
Xuanning Hu, Anchen Li, Qianli Xing, Jinglong Ji, Hao Tuo, Bo Yang

TL;DR
This paper introduces ELILLM, a novel framework that enhances large language models for structure-based drug design by systematic exploration of latent spaces guided by Bayesian optimization and surrogate models, leading to improved molecule generation.
Contribution
ELILLM reinterprets LLM generation as a latent space exploration process, integrating Bayesian optimization and knowledge-guided decoding to improve drug design capabilities.
Findings
ELILLM achieves higher binding affinity scores than baseline methods.
The framework enables controlled exploration of chemical space.
ELILLM generates chemically valid and synthetically feasible molecules.
Abstract
Large Language Models (LLMs) possess strong representation and reasoning capabilities, but their application to structure-based drug design (SBDD) is limited by insufficient understanding of protein structures and unpredictable molecular generation. To address these challenges, we propose Exploration-Augmented Latent Inference for LLMs (ELILLM), a framework that reinterprets the LLM generation process as an encoding, latent space exploration, and decoding workflow. ELILLM explicitly explores portions of the design problem beyond the model's current knowledge while using a decoding module to handle familiar regions, generating chemically valid and synthetically reasonable molecules. In our implementation, Bayesian optimization guides the systematic exploration of latent embeddings, and a position-aware surrogate model efficiently predicts binding affinity distributions to inform the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Protein Structure and Dynamics
