ChemCrow: Augmenting large-language models with chemistry tools
Andres M Bran, Sam Cox, Oliver Schilter, Carlo Baldassari, Andrew D, White, Philippe Schwaller

TL;DR
ChemCrow is a novel system that integrates chemistry tools with large-language models, enabling autonomous planning and execution of complex chemical tasks such as synthesis and discovery, thus enhancing AI-assisted scientific research.
Contribution
This work introduces ChemCrow, the first LLM-based chemistry agent that combines 18 expert tools to improve performance and capabilities in chemical synthesis, drug discovery, and materials design.
Findings
ChemCrow successfully synthesized an insect repellent and three organocatalysts.
It guided the discovery of a new chromophore.
GPT-4 evaluations could not reliably distinguish ChemCrow's outputs from correct completions.
Abstract
Over the last decades, excellent computational chemistry tools have been developed. Integrating them into a single platform with enhanced accessibility could help reaching their full potential by overcoming steep learning curves. Recently, large-language models (LLMs) have shown strong performance in tasks across domains, but struggle with chemistry-related problems. Moreover, these models lack access to external knowledge sources, limiting their usefulness in scientific applications. In this study, we introduce ChemCrow, an LLM chemistry agent designed to accomplish tasks across organic synthesis, drug discovery, and materials design. By integrating 18 expert-designed tools, ChemCrow augments the LLM performance in chemistry, and new capabilities emerge. Our agent autonomously planned and executed the syntheses of an insect repellent, three organocatalysts, and guided the discovery of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
MethodsMulti-Head Attention · Attention Is All You Need · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Softmax · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections
