Unsupervised Machine-Learning Pipeline for Data-Driven Defect Detection and Characterisation: Application to Displacement Cascades
Samuel Del Fr\'e, Andr\'ee de Backer, Christophe Domain, Ludovic Thuinet, Charlotte S. Becquart

TL;DR
This paper introduces an unsupervised machine learning pipeline that detects and characterizes atomic defects from molecular dynamics data, providing a comprehensive, threshold-free analysis of irradiation-induced damage in materials.
Contribution
The work presents a novel unsupervised ML workflow combining autoencoders, UMAP, and HDBSCAN to identify and classify defects directly from atomic simulation data, without prior templates or thresholds.
Findings
Successfully identifies defect outliers with 99.7% accuracy
Clusters defect types correlating with physical defect motifs
Achieves high correlation (R2 > 0.89) between defect counts and cluster sizes
Abstract
Neutron irradiation produces, within a few picoseconds, displacement cascades that are sequences of atomic collisions generating point and extended defects which subsequently affects the long-term evolution of materials. The diversity of these defects, characterized morphologically and statistically, defines what is called the "primary damage". In this work, we present a fully unsupervised machine learning (ML) workflow that detects and classifies these defects directly from molecular dynamics data. Local environments are encoded by the Smooth Overlap of Atomic Positions (SOAP) vector, anomalous atoms are isolated with autoencoder neural networks (AE), embedded with Uniform Manifold Approximation and Projection (UMAP) and clustered using Hierarchical Density-Based Spatial Clustering of Applications with Noise (HDBSCAN). Applied to 80 keV displacement cascades in Ni,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
