TL;DR
Crystalite is a lightweight Transformer that efficiently models crystalline materials using novel inductive biases, achieving state-of-the-art results with faster sampling than existing methods.
Contribution
It introduces Subatomic Tokenization and the Geometry Enhancement Module to improve crystal modeling efficiency and accuracy.
Findings
Achieves state-of-the-art results on crystal structure prediction benchmarks.
Attains the best S.U.N. discovery score among evaluated baselines.
Samples substantially faster than geometry-heavy alternatives.
Abstract
Generative models for crystalline materials often rely on equivariant graph neural networks, which capture geometric structure well but are costly to train and slow to sample. We present Crystalite, a lightweight diffusion Transformer for crystal modeling built around two simple inductive biases. The first is Subatomic Tokenization, a compact chemically structured atom representation that replaces high-dimensional one-hot encodings and is better suited to continuous diffusion. The second is the Geometry Enhancement Module (GEM), which injects periodic minimum-image pair geometry directly into attention through additive geometric biases. Together, these components preserve the simplicity and efficiency of a standard Transformer while making it better matched to the structure of crystalline materials. Crystalite achieves state-of-the-art results on crystal structure prediction benchmarks,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
