A Multi-Agent System Enables Versatile Information Extraction from the Chemical Literature
Yufan Chen, Ching Ting Leung, Bowen Yu, Jianwei Sun, Yong Huang, Linyan Li, Hao Chen, Hanyu Gao

TL;DR
This paper presents a multimodal multi-agent system leveraging large language models to automate and improve chemical information extraction from literature, significantly advancing the construction of chemical databases.
Contribution
It introduces a novel multi-agent framework combining MLLMs with specialized tools for robust chemical information extraction, outperforming previous models by a large margin.
Findings
Achieved an F1 score of 76.27% on a chemical reaction graphics dataset.
Surpassed the previous state-of-the-art F1 score of 39.13%.
Demonstrated versatility across multiple chemical information extraction tasks.
Abstract
To fully expedite AI-powered chemical research, high-quality chemical databases are the foundation. Automatic extraction of chemical information from the literature is essential for constructing reaction databases, but it is currently limited by the multimodality and style variability of chemical information. In this work, we developed a multimodal large language model (MLLM)-based multi-agent system for robust and automated chemical information extraction. It utilizes the MLLM's strong reasoning capability to understand the structure of diverse chemical graphics and decompose the extraction task into sub-tasks. It then coordinates a set of specialized agents, each combining the capabilities of the MLLM with the precise, domain-specific strengths of dedicated tools and web services, to solve the subtasks accurately and integrate the results into a unified output. Our system achieved an…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
