Predicting Scale-Up of Metal-Organic Framework Syntheses with Large Language Models

Peter Walther; Hongrui Sheng; Xinxin Liu; Bin Feng; Reid Coyle; Xinhua Yan; Kyle Smith; Harrison Kayal; Shyam Chand Pal; Zhiling Zheng

arXiv:2604.20899·cond-mat.mtrl-sci·April 24, 2026

Predicting Scale-Up of Metal-Organic Framework Syntheses with Large Language Models

Peter Walther, Hongrui Sheng, Xinxin Liu, Bin Feng, Reid Coyle, Xinhua Yan, Kyle Smith, Harrison Kayal, Shyam Chand Pal, Zhiling Zheng

PDF

TL;DR

This paper presents ESU-MOF, a dataset and method using large language models to predict the scalability of MOF syntheses, aiding industrial deployment.

Contribution

It introduces a novel dataset and positive-unlabeled learning approach to accurately predict MOF synthesis scalability from literature data.

Findings

01

Achieved 91.4% accuracy in predicting MOF scale-up potential.

02

Enabled rapid, data-driven triage for industrial MOF discovery.

03

Demonstrated the effectiveness of LLMs in materials synthesis prediction.

Abstract

Scalable synthesis remains the gate between MOF discovery and industrial deployment, as scale-up know-how is fragmented across disparate reports. We introduce ESU-MOF, a literature-mined dataset and a positive-unlabeled learning strategy that fine-tunes large language models to predict scalability potential with 91.4% accuracy, enabling rapid data-driven triage for industrial MOF discovery.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.