Searching for chromate replacements using natural language processing and machine learning algorithms
Shujing Zhao, Nick Birbilis

TL;DR
This study employs NLP and machine learning models like Word2Vec and BERT to analyze a large corpus of scientific literature, aiming to identify potential chromate replacements for corrosion protection, achieving insights comparable to human experts.
Contribution
The paper introduces a novel application of NLP and machine learning to extract knowledge from scientific texts for materials discovery, specifically for chromate replacements.
Findings
Successful extraction of relevant information from 80 million records
Identification of potential chromate replacements through automated literature analysis
Demonstration of NLP models achieving expert-level insights
Abstract
The past few years has seen the application of machine learning utilised in the exploration of new materials. As in many fields of research - the vast majority of knowledge is published as text, which poses challenges in either a consolidated or statistical analysis across studies and reports. Such challenges include the inability to extract quantitative information, and in accessing the breadth of non-numerical information. To address this issue, the application of natural language processing (NLP) has been explored in several studies to date. In NLP, assignment of high-dimensional vectors, known as embeddings, to passages of text preserves the syntactic and semantic relationship between words. Embeddings rely on machine learning algorithms and in the present work, we have employed the Word2Vec model, previously explored by others, and the BERT model - applying them towards a unique…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHydrogen embrittlement and corrosion behaviors in metals
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · WordPiece · Layer Normalization · Dropout · Dense Connections · Linear Warmup With Linear Decay · Softmax · Refunds@Expedia|||How do I get a full refund from Expedia?
