SWORD: Symmetry and Wyckoff-sequence of Ordered and Disordered crystals
Yuyao Huang, Wei Nong, Shuya Yamazaki, Martin Hoffmann Petersen, Jianghai Wang, Ruiming Zhu, Kedar Hippalgaonkar

TL;DR
SWORD is a symmetry-aware, Wyckoff-based string representation for ordered and disordered crystals that improves structure comparison, deduplication, and novelty assessment in large crystallographic databases.
Contribution
It introduces a novel, symmetry-aware representation that explicitly encodes disorder and co-occupancy, enabling more reliable structure analysis and database curation.
Findings
SWORD remains invariant under symmetry transformations.
It effectively identifies duplicates in large databases.
It correlates unrelaxed structures with relaxed states.
Abstract
Novelty in materials discovery requires candidates to be distinct, non-redundant, and thermodynamically plausible. While crystallographic databases continue to expand in both size and complexity, making efficient and reliable novelty assessment has become increasingly difficult. This becomes particularly acute when crystallographic disorder is involved, as partial occupancies greatly enlarge the structure-composition space and obscure the identification of genuinely distinct structures. Here, we introduce SWORD, a symmetry-aware, Wyckoff-based string representation compatible with both ordered and disordered crystals. SWORD provides (i) standardization of symmetry-equivalent structural descriptions into a consistent label, (ii) explicitly represents co-occupying species on partially occupied sites, and (iii) quantifies complex disorder through a degree of mixing descriptor that captures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
