Robust FCC solute diffusion predictions from ab-initio machine learning methods
Henry Wu, Aren Lorenson, Ben Anderson, Liam Witteman, Haotian Wu,, Bryce Meredig, and Dane Morgan

TL;DR
This study compares four machine learning methods to predict FCC solute diffusion barriers, demonstrating that GKRR and ANN excel in accuracy and extrapolation, aiding materials design.
Contribution
It introduces a systematic evaluation of ML methods for diffusion barrier prediction and highlights the effectiveness of GKRR and ANN in extrapolating to new hosts.
Findings
GKRR and ANN achieved 0.15 eV cross-validation errors.
These methods predicted diffusion barriers within 0.2 eV with limited data.
The approach enables comprehensive impurity diffusion predictions across FCC hosts.
Abstract
We evaluate the performance of four machine learning methods for modeling and predicting FCC solute diffusion barriers. More than 200 FCC solute diffusion barriers from previous density functional theory (DFT) calculations served as our dataset to train four machine learning methods: linear regression (LR), decision tree (DT), Gaussian kernel ridge regression (GKRR), and artificial neural network (ANN). We separately optimize key physical descriptors favored by each method to model diffusion barriers. We also assess the ability of each method to extrapolate when faced with new hosts with limited known data. GKRR and ANN were found to perform the best, showing 0.15 eV cross-validation errors and predicting impurity diffusion in new hosts to within 0.2 eV when given only 5 data points from the host. We demonstrate the success of a combined DFT + data mining approach towards solving…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
