A Machine Learning and Explainable AI Framework Tailored for Unbalanced Experimental Catalyst Discovery
Parastoo Semnani, Mihail Bogojeski, Florian Bley, Zizheng Zhang, Qiong, Wu, Thomas Kneib, Jan Herrmann, Christoph Weisser, Florina Patcas,, Klaus-Robert M\"uller

TL;DR
This paper presents a robust machine learning and explainable AI framework designed to classify catalyst yields and identify key components, addressing data scarcity and imbalance in experimental catalyst discovery.
Contribution
The framework combines ML techniques tailored for unbalanced data and XAI methods to improve catalyst yield classification and interpretability, a novel approach in this domain.
Findings
Improved classification performance across multiple ML models.
XAI methods identified key catalyst components aligned with chemical intuition.
Framework effectively handles data scarcity and imbalance issues.
Abstract
The successful application of machine learning (ML) in catalyst design relies on high-quality and diverse data to ensure effective generalization to novel compositions, thereby aiding in catalyst discovery. However, due to complex interactions, catalyst design has long relied on trial-and-error, a costly and labor-intensive process leading to scarce data that is heavily biased towards undesired, low-yield catalysts. Despite the rise of ML in this field, most efforts have not focused on dealing with the challenges presented by such experimental data. To address these challenges, we introduce a robust machine learning and explainable AI (XAI) framework to accurately classify the catalytic yield of various compositions and identify the contributions of individual components. This framework combines a series of ML practices designed to handle the scarcity and imbalance of catalyst data. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Data Quality and Management
MethodsALIGN
