Using Machine Learning and Data Mining to Leverage Community Knowledge for the Engineering of Stable Metal-Organic Frameworks
Aditya Nandy, Chenru Duan, and Heather J. Kulik

TL;DR
This paper employs machine learning and data mining on thousands of reports to understand and predict the stability of metal-organic frameworks, aiming to accelerate the development of practical, stable MOF materials for various applications.
Contribution
It introduces a large-scale NLP and image analysis pipeline combined with ML models to predict MOF stability, providing new insights and strategies for engineering more stable MOFs.
Findings
ML models predict stability faster than traditional methods
Analysis reveals key structural features influencing stability
Strategies identified to enhance stability of 3d-MOFs
Abstract
Although the tailored metal active sites and porous architectures of MOFs hold great promise for engineering challenges ranging from gas separations to catalysis, a lack of understanding of how to improve their stability limits their use in practice. To overcome this limitation, we extract thousands of published reports of the key aspects of MOF stability necessary for their practical application: the ability to withstand high temperatures without degrading and the capacity to be activated by removal of solvent molecules. From nearly 4,000 manuscripts, we use natural language processing and automated image analysis to obtain over 2,000 solvent-removal stability measures and 3,000 thermal degradation temperatures. We analyze the relationships between stability properties and the chemical and geometric structures in this set to identify limits of prior heuristics derived from smaller sets…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsGaussian Process
