ChemATP: A Training-Free Chemical Reasoning Framework for Large Language Models
Mingxu Zhang, Dazhong Shen, Qi Zhang, Ying Sun

TL;DR
ChemATP introduces a training-free framework that enhances large language models' chemical reasoning by explicitly retrieving atom-level knowledge, improving accuracy and interpretability without retraining.
Contribution
It constructs the first atom-level textual knowledge base enabling frozen LLMs to explicitly retrieve chemical priors, addressing limitations of surface-level prompting.
Findings
ChemATP outperforms existing training-free methods.
ChemATP rivals state-of-the-art training-based models.
The framework improves chemical reasoning accuracy.
Abstract
Large Language Models (LLMs) exhibit strong general reasoning but struggle in molecular science due to the lack of explicit chemical priors in standard string representations. Current solutions face a fundamental dilemma. Training-based methods inject priors into parameters, but this static coupling hinders rapid knowledge updates and often compromises the model's general reasoning capabilities. Conversely, existing training-free methods avoid these issues but rely on surface-level prompting, failing to provide the fine-grained atom-level priors essential for precise chemical reasoning. To address this issue, we introduce ChemATP, a framework that decouples chemical knowledge from the reasoning engine. By constructing the first atom-level textual knowledge base, ChemATP enables frozen LLMs to explicitly retrieve and reason over this information dynamically. This architecture ensures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Topic Modeling
