AgentCAT: An LLM Agent for Extracting and Analyzing Catalytic Reaction Data from Chemical Engineering Literature
Wei Yang, Zihao Liu, Tao Tan, Xiao Hu, Hong Xie, Lulu Li Xin Li, Jianyu Han, Defu Lian, Mao Ye

TL;DR
AgentCAT is a large language model-based system that extracts, analyzes, and visualizes catalytic reaction data from chemical engineering literature, addressing data bottlenecks and supporting interactive, natural language-driven analysis.
Contribution
It introduces a schema-guided extraction pipeline, a dependency-aware knowledge graph, and a natural language querying module for catalytic reaction data from scientific papers.
Findings
Effective extraction from ~800 papers demonstrated
Knowledge graph links catalysts, mechanisms, and outcomes
Supports natural language exploration and visualization
Abstract
This paper presents a large language model (LLM) agent named AgentCAT, which extracts and analyzes catalytic reaction data from chemical engineering papers, %and supports natural language based interactive analysis of the extracted data. AgentCAT serves as an alternative to overcome the long-standing data bottleneck in chemical engineering field, and its natural language based interactive data analysis functionality is friendly to the community. AgentCAT also presents a formal abstraction and challenge analysis of the catalytic reaction data extraction task in an artificial intelligence-friendly manner. This abstraction would help the artificial intelligence community understand this problem and in turn would attract more attention to address it. Technically, the complex catalytic process leads to complicated dependency structure in catalytic reaction data with respect to elementary…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Ammonia Synthesis and Nitrogen Reduction · Catalysts for Methane Reforming
