MatMMFuse: Multi-Modal Fusion model for Material Property Prediction
Abhiroop Bhattacharya, Sylvain G. Cloutier

TL;DR
MatMMFuse is a multi-modal fusion model that combines graph-based crystal structure encoding with text embeddings from large language models to improve material property prediction accuracy and zero-shot performance.
Contribution
This work introduces a novel multi-modal fusion approach using attention mechanisms to integrate structure-aware and global information for material property prediction.
Findings
Significant improvement over single-modality models in predicting key properties.
40% better formation energy prediction compared to CGCNN.
68% better formation energy prediction compared to SciBERT.
Abstract
The recent progress of using graph based encoding of crystal structures for high throughput material property prediction has been quite successful. However, using a single modality model prevents us from exploiting the advantages of an enhanced features space by combining different representations. Specifically, pre-trained Large language models(LLMs) can encode a large amount of knowledge which is beneficial for training of models. Moreover, the graph encoder is able to learn the local features while the text encoder is able to learn global information such as space group and crystal symmetry. In this work, we propose Material Multi-Modal Fusion(MatMMFuse), a fusion based model which uses a multi-head attention mechanism for the combination of structure aware embedding from the Crystal Graph Convolution Network (CGCNN) and text embeddings from the SciBERT model. We train our model in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Advanced Graph Neural Networks · Inorganic Chemistry and Materials
MethodsAttention Is All You Need · Softmax · Linear Layer · Multi-Head Attention · Attentive Walk-Aggregating Graph Neural Network · Convolution
