aLLoyM: A large language model for alloy phase diagram prediction

Yuna Oikawa; Guillaume Deffrennes; Taichi Abe; Ryo Tamura; Koji Tsuda

arXiv:2507.22558·cond-mat.mtrl-sci·April 30, 2026

aLLoyM: A large language model for alloy phase diagram prediction

Yuna Oikawa, Guillaume Deffrennes, Taichi Abe, Ryo Tamura, Koji Tsuda

PDF

1 Repo

TL;DR

aLLoyM is a specialized large language model trained on alloy phase diagram data, capable of generating and understanding phase diagrams, thus aiding materials discovery.

Contribution

This work introduces aLLoyM, a fine-tuned LLM for alloy phase diagrams, with publicly available datasets and models to advance materials science research.

Findings

01

Fine-tuning improves phase diagram question-answering accuracy.

02

aLLoyM can generate novel phase diagrams from component data.

03

Public release of datasets and models supports further research.

Abstract

Large Language Models (LLMs) are general-purpose tools with wide-ranging applications, including in materials science. In this work, we introduce aLLoyM, a fine-tuned LLM specifically trained on alloy compositions, temperatures, and their corresponding phase information. To develop aLLoyM, we curated question-and-answer (Q&A) pairs for binary and ternary phase diagrams using the open-source Computational Phase Diagram Database (CPDDB) and assessments based on CALPHAD (CALculation of PHAse Diagrams). We fine-tuned Mistral, an open-source pre-trained LLM, for two distinct Q&A formats: multiple-choice and short-answer. Benchmark evaluations demonstrate that fine-tuning substantially enhances performance on multiple-choice phase diagram questions. Moreover, the short-answer model of aLLoyM exhibits the ability to generate novel phase diagrams from its components alone, underscoring its…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://huggingface.co
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.