MOFClassifier: A Machine Learning Approach for Validating Computation-Ready Metal-Organic Frameworks
Guobin Zhao, Pengyu Zhao, Yongchul G. Chung

TL;DR
MOFClassifier is a machine learning tool that improves the validation of computation-ready MOF structures by accurately classifying them and identifying errors, thereby enhancing high-throughput material screening.
Contribution
It introduces a novel PU-CGCNN model that outperforms rule-based methods in classifying MOFs and detecting subtle structural errors.
Findings
Achieved ROC of 0.979, surpassing previous 0.912
Effectively identifies errors undetectable by rule-based methods
Reduces false negatives in MOF classification
Abstract
The computational discovery and design of new crystalline materials, particularly metal-organic frameworks (MOFs), heavily relies on high-quality, computation-ready structural data. However, recent studies have revealed significant error rates within existing MOF databases, posing a critical data problem that hinders efficient high-throughput computational screening. While rule-based algorithms like MOSAEC, MOFChecker, and the Chen and Manz method (Chen-Manz) have been developed to address this, they often suffer from inherent limitations and misclassification of structures. To overcome this challenge, we developed MOFClassifier, a novel machine learning approach built upon a positive-unlabeled crystal graph convolutional neural network (PU-CGCNN) model. MOFClassifier learns intricate patterns from perfect crystal structures to predict a crystal-likeness score (CLscore), effectively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
