Predicting Scale-Up of Metal-Organic Framework Syntheses with Large Language Models
Peter Walther, Hongrui Sheng, Xinxin Liu, Bin Feng, Reid Coyle, Xinhua Yan, Kyle Smith, Harrison Kayal, Shyam Chand Pal, Zhiling Zheng

TL;DR
This paper presents ESU-MOF, a dataset and method using large language models to predict the scalability of MOF syntheses, aiding industrial deployment.
Contribution
It introduces a novel dataset and positive-unlabeled learning approach to accurately predict MOF synthesis scalability from literature data.
Findings
Achieved 91.4% accuracy in predicting MOF scale-up potential.
Enabled rapid, data-driven triage for industrial MOF discovery.
Demonstrated the effectiveness of LLMs in materials synthesis prediction.
Abstract
Scalable synthesis remains the gate between MOF discovery and industrial deployment, as scale-up know-how is fragmented across disparate reports. We introduce ESU-MOF, a literature-mined dataset and a positive-unlabeled learning strategy that fine-tunes large language models to predict scalability potential with 91.4% accuracy, enabling rapid data-driven triage for industrial MOF discovery.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
