Beyond Diamond: Interpretable Machine Learning Reveals Design Principles for Quantum Defect Host Materials
Mohammed Mahshook, Rudra Banerjee

TL;DR
This paper develops a machine learning framework to identify and predict suitable quantum defect host materials from vast chemical spaces, revealing new candidates and underlying design principles.
Contribution
It introduces a composition-only ML approach using Rashomon set ensembles to extract consensus design rules for quantum defect hosts, enabling efficient screening of large material databases.
Findings
Identified 122 high-confidence candidate materials, including unexplored compounds.
Validated dielectric screening as a coherence proxy with R^2 = 0.89.
Revealed key features like filled valence shells and chemical composition that favor quantum hosting.
Abstract
Solid-state spin defects in wide-bandgap semiconductors are leading candidates for quantum information processing, but systematic identification of suitable host materials remains limited by the cost of first-principles screening across vast chemical spaces. We address this with a composition-only machine learning framework built on heterogeneous Rashomon set ensembles: by contrasting the feature attributions of seven diverse classifiers, we extract consensus design rules that no single model identifies alone-filled valence s-, d-, and f-shells, low chemical heterogeneity, and enrichment in C, S, Si, and O favor quantum compatibility. Screening approximately 45,000 thermodynamically stable compounds, we identify 122 high-confidence candidates (confidence > 0.95), recovering most experimentally verified hosts (C, SiC, ZnO, ZnS) and predicting unexplored materials including TiO,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
