Information fusion strategy integrating pre-trained language model and contrastive learning for materials knowledge mining
Yongqian Peng, Zhouran Zhang, Longhui Zhang, Fengyuan Zhao, Yahao Li, Yicong Ye, Shuxin Bai

TL;DR
This paper introduces a novel information fusion framework combining pre-trained language models and contrastive learning to enhance materials property prediction by integrating literature-based knowledge with physical descriptors.
Contribution
It presents an innovative architecture that fuses textual and quantitative data for materials knowledge mining, outperforming traditional methods in complex property prediction.
Findings
Achieved R2 of 0.849 on titanium alloys
Achieved R2 of 0.680 on refractory alloys
Demonstrated superior performance over baseline models
Abstract
Machine learning has revolutionized materials design, yet predicting complex properties like alloy ductility remains challenging due to the influence of processing conditions and microstructural features that resist quantification through traditional reductionist approaches. Here, we present an innovative information fusion architecture that integrates domain-specific texts from materials science literature with quantitative physical descriptors to overcome these limitations. Our framework employs MatSciBERT for advanced textual comprehension and incorporates contrastive learning to automatically extract implicit knowledge regarding processing parameters and microstructural characteristics. Through rigorous ablation studies and comparative experiments, the model demonstrates superior performance, achieving coefficient of determination (R2) values of 0.849 and 0.680 on titanium alloy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Image Processing and 3D Reconstruction · Geoscience and Mining Technology
MethodsContrastive Learning · Sparse Evolutionary Training
