Data-driven Prediction of Ionic Conductivity in Solid-State Electrolytes with Machine Learning and Large Language Models
Haewon Kim, Taekgi Lee, Seongeun Hong, Kyeong-Ho Kim, and Yongchul G. Chung

TL;DR
This paper demonstrates that combining geometric descriptors with machine learning and large language models enables efficient, accurate prediction of ionic conductivity in solid-state electrolytes, facilitating accelerated materials discovery.
Contribution
It introduces a dual approach using gradient-boosted trees with geometric descriptors and fine-tuned LLMs on CIF metadata for predicting ionic conductivity.
Findings
Gradient-boosted tree regressor achieved MAE of 0.543 in log(S cm-1).
SHAP analysis identified probe-occupiable volume and lattice parameters as key factors.
LLama-3.1-8B-Instruct achieved MAE of 0.657 using only CIF metadata.
Abstract
Solid-state electrolytes (SSEs) are attractive for next-generation lithium-ion batteries due to improved safety and stability but their low room-temperature ionic conductivity hinders practical application. Experimental synthesis and testing of new SSEs remain time-consuming and resource intensive. Machine learning (ML) offers an accelerated route for SSE discovery; however, composition-only models neglect structural factors important for ion transport while graph neural networks (GNNs) are challenged by the scarcity of structure-labeled conductivity data and the prevalence of crystallographic disorder in CIFs. Here, we train two complementary predictors on the same room-temperature, structure-labeled dataset (n = 499). A gradient-boosted tree regressor (GBR) combining stoichiometric and geometric descriptors achieves best performance (MAE = 0.543 in log(S cm-1)), and Shapley Additive…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
