AlloyBERT: Alloy Property Prediction with Large Language Models
Akshat Chaudhari, Chakradhar Guntuboina, Hongshuo Huang, and Amir, Barati Farimani

TL;DR
AlloyBERT leverages large language models to predict alloy properties from textual descriptions, outperforming traditional models and demonstrating the potential of NLP techniques in materials science.
Contribution
This paper introduces AlloyBERT, a novel transformer-based model that predicts alloy properties from textual data, combining pre-trained language models with domain-specific fine-tuning.
Findings
Achieved low MSE scores on alloy property datasets.
Outperformed shallow models in property prediction accuracy.
Established a text-based framework for alloy property prediction.
Abstract
The pursuit of novel alloys tailored to specific requirements poses significant challenges for researchers in the field. This underscores the importance of developing predictive techniques for essential physical properties of alloys based on their chemical composition and processing parameters. This study introduces AlloyBERT, a transformer encoder-based model designed to predict properties such as elastic modulus and yield strength of alloys using textual inputs. Leveraging the pre-trained RoBERTa encoder model as its foundation, AlloyBERT employs self-attention mechanisms to establish meaningful relationships between words, enabling it to interpret human-readable input and predict target alloy properties. By combining a tokenizer trained on our textual data and a RoBERTa encoder pre-trained and fine-tuned for this specific task, we achieved a mean squared error (MSE) of 0.00015 on the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science
MethodsAttention Is All You Need · Sparse Evolutionary Training · Linear Layer · Layer Normalization · WordPiece · Multi-Head Attention · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Softmax · Dense Connections
