MatterChat: A Multi-Modal LLM for Material Science
Yingheng Tang, Wenbin Xu, Jie Cao, Weilu Gao, Steve Farrell, Benjamin, Erichson, Michael W. Mahoney, Andy Nonaka, Zhi Yao

TL;DR
MatterChat is a novel multi-modal large language model that integrates atomic structural data with textual information to improve material property prediction and scientific reasoning in materials science.
Contribution
It introduces a structure-aware multi-modal LLM that unifies material structures and language, using a bridging module to align pretrained models, enhancing flexibility and reducing training costs.
Findings
Outperforms general-purpose LLMs like GPT-4 in material property prediction
Enhances human-AI interaction in materials science tasks
Supports advanced scientific reasoning and synthesis steps
Abstract
Understanding and predicting the properties of inorganic materials is crucial for accelerating advancements in materials science and driving applications in energy, electronics, and beyond. Integrating material structure data with language-based information through multi-modal large language models (LLMs) offers great potential to support these efforts by enhancing human-AI interaction. However, a key challenge lies in integrating atomic structures at full resolution into LLMs. In this work, we introduce MatterChat, a versatile structure-aware multi-modal LLM that unifies material structural data and textual inputs into a single cohesive model. MatterChat employs a bridging module to effectively align a pretrained machine learning interatomic potential with a pretrained LLM, reducing training costs and enhancing flexibility. Our results demonstrate that MatterChat significantly improves…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Materials Characterization Techniques · Mineral Processing and Grinding · Machine Learning in Materials Science
MethodsAttention Is All You Need · Absolute Position Encodings · Linear Layer · Byte Pair Encoding · Layer Normalization · Residual Connection · Dense Connections · Label Smoothing · Multi-Head Attention · Position-Wise Feed-Forward Layer
