Robust Machine Learning Framework for Reliable Discovery of High-Performance Half-Heusler Thermoelectrics
Shoeb Athar, Adrien Mecibah, and Philippe Jund

TL;DR
This paper develops a robust machine learning workflow for predicting thermoelectric performance in half-Heusler compounds, emphasizing model generalizability and physical insight, leading to the discovery of new high-performance candidates.
Contribution
It introduces a PCA-based data splitting method, integrates Bayesian hyperparameter optimization with feature filtering, and combines physical insight tools, enhancing ML model reliability for thermoelectric material discovery.
Findings
Improved model generalizability for zT prediction.
Identification of key physical drivers like dopant concentration and heat of vaporization.
Discovery of several novel high-zT thermoelectric candidates.
Abstract
Machine learning (ML) can facilitate efficient thermoelectric (TE) material discovery essential to address the environmental crisis. However, ML models often suffer from poor experimental generalizability despite high metrics. This study presents a robust workflow, applied to the half-Heusler (hH) structural prototype, for figure of merit (zT) prediction, to improve the generalizability of ML models. To resolve challenges in dataset handling and feature filtering, we first introduce a rigorous PCA-based splitting method that ensures training and test sets are unbiased and representative of the full chemical space. We then integrate Bayesian hyperparameter optimization with k-best feature filtering across three architectures-Random Forest, XGBoost, and Neural Networks - while employing SISSO symbolic regression for physical insight and comparison. Using SHAP and SISSO analysis, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Thermoelectric Materials and Devices · Machine Learning in Materials Science · Heusler alloys: electronic and magnetic properties
