A Collaborative Framework Integrating Large Language Model and Chemical Fragment Space: Mutual Inspiration for Lead Design
Hao Tuo, Yan Li, Xuanning Hu, Haishi Zhao, Xueyan Liu, Bo Yang

TL;DR
AutoLeadDesign is a novel framework that combines large language models and chemical fragment space to enhance the exploration and design of lead compounds in drug discovery, outperforming baseline methods and demonstrating clinical relevance.
Contribution
The paper introduces AutoLeadDesign, a new AI-driven framework that integrates domain knowledge from language models with chemical fragments for efficient lead compound design.
Findings
AutoLeadDesign outperforms baseline methods in experiments.
Successfully designed lead compounds for PRMT5 and SARS-CoV-2 PLpro.
Generated compounds show mechanism-validated inhibitory patterns.
Abstract
Combinatorial optimization algorithm is essential in computer-aided drug design by progressively exploring chemical space to design lead compounds with high affinity to target protein. However current methods face inherent challenges in integrating domain knowledge, limiting their performance in identifying lead compounds with novel and valid binding mode. Here, we propose AutoLeadDesign, a lead compounds design framework that inspires extensive domain knowledge encoded in large language models with chemical fragments to progressively implement efficient exploration of vast chemical space. The comprehensive experiments indicate that AutoLeadDesign outperforms baseline methods. Significantly, empirical lead design campaigns targeting two clinically relevant targets (PRMT5 and SARS-CoV-2 PLpro) demonstrate AutoLeadDesign's competence in de novo generation of lead compounds achieving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSemantic Web and Ontologies · Scientific Computing and Data Management
