Data-Driven Topological Analysis of Polymorphic Crystal Structures
Sourin Dey, Nicholas Miklaucic, Sadman Sadeed Omee, Rongzhi Dong, Lai Wei, Qinyang Li, Nihang Fu, and Jianjun Hu

TL;DR
This paper uses data-driven methods to analyze polymorphic crystal structures, revealing topological motifs and clustering techniques that improve understanding and prediction of polymorphism in materials.
Contribution
It introduces a topological analysis approach to polymorphs, uncovering recurring motifs and developing clustering methods based on polyhedral topology, advancing materials design.
Findings
Recurring topological motifs across polymorph pairs
Polyhedral environments are consistent despite symmetry differences
Topology-based clustering effectively groups similar polymorphs
Abstract
Polymorphism, the ability of a compound to crystallize in multiple distinct structures, plays a vital role in determining the physical, chemical, and functional properties of materials. Accurate identification and prediction of polymorphic structures are critical for materials design, drug development, and device optimization, as unknown or overlooked polymorphs may lead to unexpected performance or stability issues. Despite its significance, predicting polymorphism directly from a chemical composition remains a challenging problem due to the complex interplay between molecular conformations, crystal packing, and symmetry constraints. In this study, we conduct a comprehensive data-driven analysis of polymorphic materials from the Materials Project database, uncovering key statistical patterns in their composition, space group distributions, and polyhedral building blocks. We discover…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
