Topic Modeling and Link-Prediction for Material Property Discovery
Ryan C. Barron, Maksim E. Eren, Valentin Stanev, Cynthia Matuszek, Boian S. Alexandrov

TL;DR
This paper introduces an AI-driven hierarchical link prediction framework that combines matrix factorization techniques to uncover hidden associations in scientific literature networks, aiding material property discovery across diverse physics domains.
Contribution
The paper presents a novel integration of Hierarchical Nonnegative Matrix Factorization, Boolean matrix factorization, and Logistic matrix factorization for topic modeling and link prediction in large scientific corpora.
Findings
Successfully maps materials to coherent scientific topics.
Predicts hidden links between materials and topics, suggesting new hypotheses.
Validates the method by recovering known associations in superconductivity literature.
Abstract
Link prediction infers missing or future relations between graph nodes, based on connection patterns. Scientific literature networks and knowledge graphs are typically large, sparse, and noisy, and often contain missing links between entities. We present an AI-driven hierarchical link prediction framework that integrates matrix factorization to infer hidden associations and steer discovery in complex material domains. Our method combines Hierarchical Nonnegative Matrix Factorization (HNMFk) and Boolean matrix factorization (BNMFk) with automatic model selection, as well as Logistic matrix factorization (LMF), we use to construct a three-level topic tree from a 46,862-document corpus focused on 73 transition-metal dichalcogenides (TMDs). These materials are studied in a variety of physics fields with many current and potential applications. An ensemble BNMFk + LMF approach fuses…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Graph Neural Networks · Machine Learning in Materials Science · Complex Network Analysis Techniques
