TL;DR
ChemAgent introduces a self-updating library framework that enhances large language models' chemical reasoning by decomposing tasks, referencing structured memory, and refining solutions over time, leading to significant performance improvements.
Contribution
The paper presents ChemAgent, a novel self-updating library system that improves LLM chemical reasoning through dynamic memory and task decomposition, outperforming existing methods.
Findings
Achieves up to 46% performance improvement on chemical reasoning datasets
Demonstrates effective task decomposition and memory referencing in LLMs
Shows potential for applications in drug discovery and materials science
Abstract
Chemical reasoning usually involves complex, multi-step processes that demand precise calculations, where even minor errors can lead to cascading failures. Furthermore, large language models (LLMs) encounter difficulties handling domain-specific formulas, executing reasoning steps accurately, and integrating code effectively when tackling chemical reasoning tasks. To address these challenges, we present ChemAgent, a novel framework designed to improve the performance of LLMs through a dynamic, self-updating library. This library is developed by decomposing chemical tasks into sub-tasks and compiling these sub-tasks into a structured collection that can be referenced for future queries. Then, when presented with a new problem, ChemAgent retrieves and refines pertinent information from the library, which we call memory, facilitating effective task decomposition and the generation of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsLib
