Automated Extraction of Multicomponent Alloy Data Using Large Language Models for Sustainable Design
Aravindan Kamatchi Sundaram, Mohit Chakraborty, Sai Mani Kumar Devathi, B. Pabitramohan Prusty, Rohit Batra

TL;DR
This paper presents an LLM-based pipeline that accurately extracts multicomponent alloy data from literature, creating large databases to support sustainable materials design and selection.
Contribution
It introduces a novel LLM-driven method for extracting detailed alloy data from text and tables, surpassing existing approaches in accuracy and scope.
Findings
Retrieved over 180,000 alloy data entries from literature.
Achieved F1-scores of 0.83 for text and 0.88 for tables in data extraction.
Generated the largest publicly available multicomponent alloy database.
Abstract
The design of sustainable materials requires access to materials performance and sustainability data from literature corpus in an organized, structured and automated manner. Natural language processing approaches, particularly large language models (LLMs), have been explored for materials data extraction from the literature, yet often suffer from limited accuracy or narrow scope. In this work, an LLM-based pipeline is developed to accurately extract alloy-related information from both textual descriptions and tabular data across the literature on high-entropy (or multicomponent) alloys (HEA). Specifically two databases with 37,711 and 148,069 entries respectively are retrieved; one from the literature text, consisting of alloy composition, processing conditions, characterization methods, and reported properties, and other from the literature tables, consisting of property names, values,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · High Entropy Alloys Studies · Material Selection and Properties
