Skill-Based Autonomous Agents for Material Creep Database Construction
Yue Wu, Tianhao Su, Shunbo Hu, Deng Pan

TL;DR
This paper presents an autonomous agent framework utilizing Large Language Models to extract and verify high-fidelity material creep data from scientific literature, significantly accelerating database construction without human intervention.
Contribution
The work introduces a modular, skill-based agent architecture that automates complex data extraction and validation from PDFs, achieving over 90% success in constructing a physically consistent creep database.
Findings
Over 90% success rate in graphical data digitization
High correlation ($R^2 > 0.99$) between visual and textual data extraction
Demonstrated scalability for large-scale scientific literature mining
Abstract
The advancement of data-driven materials science is currently constrained by a fundamental bottleneck: the vast majority of historical experimental data remains locked within the unstructured text and rasterized figures of legacy scientific literature. Manual curation of this knowledge is prohibitively labor-intensive and prone to human error. To address this challenge, we introduce an autonomous, agent-based framework powered by Large Language Models (LLMs) designed to excavate high-fidelity datasets from scientific PDFs without human intervention. By deploying a modular "skill-based" architecture, the agent orchestrates complex cognitive tasks - including semantic filtering, multi-modal information extraction, and physics-informed validation. We demonstrate the efficacy of this framework by constructing a physically self-consistent database for material creep mechanics, a domain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Model Reduction and Neural Networks · Generative Adversarial Networks and Image Synthesis
