Lang2Str: Two-Stage Crystal Structure Generation with LLMs and Continuous Flow Models
Cong Liu, Chengyue Gong, Zhenyu Liu, Jiale Zhao, Yuxuan Zhang

TL;DR
Lang2Str introduces a two-stage generative framework combining large language models and flow-based models to improve the diversity, validity, and control in crystal structure generation for material discovery.
Contribution
The paper presents a novel two-stage framework that leverages LLMs for high-level descriptions and flow models for precise structure decoding, enhancing flexibility and accuracy in material generation.
Findings
Achieves competitive performance in initio material generation.
Generated structures closely match ground truth in geometry and energy.
Framework enables fine-grained control over material design.
Abstract
Generative models hold great promise for accelerating material discovery but are often limited by their inflexible single-stage generative process in designing valid and diverse materials. To address this, we propose a two-stage generative framework, Lang2Str, that combines the strengths of large language models (LLMs) and flow-based models for flexible and precise material generation. Our method frames the generative process as a conditional generative task, where an LLM provides high-level conditions by generating descriptions of material unit cells' geometric layouts and properties. These descriptions, informed by the LLM's extensive background knowledge, ensure reasonable structure designs. A conditioned flow model then decodes these textual conditions into precise continuous coordinates and unit cell parameters. This staged approach combines the structured reasoning of LLMs and the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Generative Adversarial Networks and Image Synthesis · Material Selection and Properties
