AgentDrug: Utilizing Large Language Models in An Agentic Workflow for Zero-Shot Molecular Editing
Khiem Le, Ting Hua, Nitesh V. Chawla

TL;DR
AgentDrug introduces a structured, agentic workflow utilizing large language models with feedback loops for zero-shot molecular editing, significantly improving accuracy in property optimization tasks in drug discovery.
Contribution
This work presents a novel agentic framework that combines LLMs with cheminformatics feedback loops for enhanced molecular editing accuracy.
Findings
AgentDrug achieves over 20% accuracy improvement on single-property tasks.
The method outperforms previous approaches on multi-property optimization.
Larger models further enhance editing accuracy across benchmarks.
Abstract
Molecular editing-modifying a given molecule to improve desired properties-is a fundamental task in drug discovery. While LLMs hold the potential to solve this task using natural language to drive the editing, straightforward prompting achieves limited accuracy. In this work, we propose AgentDrug, an agentic workflow that leverages LLMs in a structured refinement process to achieve significantly higher accuracy. AgentDrug defines a nested refinement loop: the inner loop uses feedback from cheminformatics toolkits to validate molecular structures, while the outer loop guides the LLM with generic feedback and a gradient-based objective to steer the molecule toward property improvement. We evaluate AgentDrug on benchmarks with both single- and multi-property editing under loose and strict thresholds. Results demonstrate significant performance gains over previous methods. With Qwen-2.5-3B,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods
