Pair distribution function analysis for oxide defect identification through feature extraction and supervised learning
Shuyan Zhang, Jie Gong, Sharon Chu, Daniel Xiao, B. Reeja Jayan, Alan, J. H. McGaughey

TL;DR
This paper develops a machine learning approach using feature extraction from pair distribution functions to identify and quantify defects in TiO₂, combining physics-based models with neural networks for improved material characterization.
Contribution
It introduces a novel pipeline integrating physical feature extraction and supervised learning to accurately predict defect types and concentrations from experimental PDFs.
Findings
Autoencoder with physics-based initialization yields highest accuracy.
The method outperforms brute-force predictions on experimental data.
Features extracted effectively correlate with defect concentrations.
Abstract
Feature extraction and a neural network model are applied to predict the defect types and concentrations in experimental TiO samples. A dataset of TiO structures with vacancies and interstitials of oxygen and titanium is built and the structures are relaxed using energy minimization. The features of the calculated pair distribution functions (PDFs) of these defected structures are extracted using linear methods (principal component analysis, non-negative matrix factorization) and non-linear methods (autoencoder, convolutional neural network). The extracted features are used as the inputs to a neural network that maps the feature weights to the concentration of each defect type. The performance of this machine learning pipeline is validated by predicting the defect concentrations based on experimentally-measured TiO PDFs and comparing the results to brute-force predictions. A…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · X-ray Diffraction in Crystallography · Geochemistry and Geologic Mapping
