Bridging Text and Crystal Structures: Literature-driven Contrastive Learning for Materials Science
Yuta Suzuki, Tatsunori Taniai, Ryo Igarashi, Kotaro Saito, Naoya Chiba, Yoshitaka Ushiku, and Kanta Ono

TL;DR
This paper introduces CLaSP, a contrastive learning framework that creates crossmodal embeddings between crystal structures and texts, enabling intuitive materials retrieval and analysis based on property-related descriptions.
Contribution
CLaSP is the first to align crystal structure embeddings with textual descriptions, facilitating property-based material exploration using large-scale literature data.
Findings
Effective text-based crystal structure screening demonstrated
Embedding space visualization reveals meaningful structure-property relationships
Leverages over 400,000 structures and publication records for training
Abstract
Understanding structure-property relationships is an essential yet challenging aspect of materials discovery and development. To facilitate this process, recent studies in materials informatics have sought latent embedding spaces of crystal structures to capture their similarities based on properties and functionalities. However, abstract feature-based embedding spaces are human-unfriendly and prevent intuitive and efficient exploration of the vast materials space. Here we introduce Contrastive Language--Structure Pre-training (CLaSP), a learning paradigm for constructing crossmodal embedding spaces between crystal structures and texts. CLaSP aims to achieve material embeddings that 1) capture property- and functionality-related similarities between crystal structures and 2) allow intuitive retrieval of materials via user-provided description texts as queries. To compensate for the lack…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLinguistic research and analysis · Linguistic Education and Pedagogy
