Large Language Models for Superconductor Discovery
Suman Itani, Yibo Zhang, Ranjit Itani, Jiadong Zang

TL;DR
This paper demonstrates how large language models can be used to extract data, predict properties, and design new superconducting materials, significantly advancing materials discovery processes.
Contribution
It introduces a large experimental database and fine-tuned LLMs for classification, prediction, and inverse design of superconductors, outperforming traditional models.
Findings
Fine-tuned LLMs achieve comparable or better performance than traditional models.
The inverse design model generates 28% novel, chemically plausible candidates.
Predicted high-Tc candidates in existing databases suggest new materials for experimental validation.
Abstract
Large language models (LLMs) offer new opportunities for automated data extraction and property prediction across materials science, yet their use in superconductivity research remains limited. Here we construct a large experimental database of 78,203 records, covering 19,058 unique compositions, extracted from scientific literature using an LLM-driven workflow. Each entry includes chemical composition, critical temperature, measurement pressure, structural descriptors, and critical fields. We fine-tune several open-source LLMs for three tasks: (i) classifying superconductors vs. non-superconductors, (ii) predicting the superconducting transition temperature directly from composition or structure-informed inputs, and (iii) inverse design of candidate compositions conditioned on target Tc. The fine-tuned LLMs achieve performance comparable to traditional feature-based models and in some…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Inorganic Chemistry and Materials · Inorganic Fluorides and Related Compounds
