ChemAgent: Self-updating Library in Large Language Models Improves   Chemical Reasoning

Xiangru Tang; Tianyu Hu; Muyang Ye; Yanjun Shao; Xunjian Yin; Siru; Ouyang; Wangchunshu Zhou; Pan Lu; Zhuosheng Zhang; Yilun Zhao; Arman Cohan,; Mark Gerstein

arXiv:2501.06590·cs.CL·January 14, 2025

ChemAgent: Self-updating Library in Large Language Models Improves Chemical Reasoning

Xiangru Tang, Tianyu Hu, Muyang Ye, Yanjun Shao, Xunjian Yin, Siru, Ouyang, Wangchunshu Zhou, Pan Lu, Zhuosheng Zhang, Yilun Zhao, Arman Cohan,, Mark Gerstein

PDF

1 Repo

TL;DR

ChemAgent introduces a self-updating library framework that enhances large language models' chemical reasoning by decomposing tasks, referencing structured memory, and refining solutions over time, leading to significant performance improvements.

Contribution

The paper presents ChemAgent, a novel self-updating library system that improves LLM chemical reasoning through dynamic memory and task decomposition, outperforming existing methods.

Findings

01

Achieves up to 46% performance improvement on chemical reasoning datasets

02

Demonstrates effective task decomposition and memory referencing in LLMs

03

Shows potential for applications in drug discovery and materials science

Abstract

Chemical reasoning usually involves complex, multi-step processes that demand precise calculations, where even minor errors can lead to cascading failures. Furthermore, large language models (LLMs) encounter difficulties handling domain-specific formulas, executing reasoning steps accurately, and integrating code effectively when tackling chemical reasoning tasks. To address these challenges, we present ChemAgent, a novel framework designed to improve the performance of LLMs through a dynamic, self-updating library. This library is developed by decomposing chemical tasks into sub-tasks and compiling these sub-tasks into a structured collection that can be referenced for future queries. Then, when presented with a new problem, ChemAgent retrieves and refines pertinent information from the library, which we call memory, facilitating effective task decomposition and the generation of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gersteinlab/chemagent
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsLib