Coherent Collections of Rules Describing Exceptional Materials Identified with a Multi-Objective Optimization of Subgroups
Lucas Foppa, Matthias Scheffler

TL;DR
This paper introduces a multi-objective subgroup discovery approach to identify diverse rules describing exceptional materials, demonstrated on perovskites with high bulk modulus, enabling more effective material screening.
Contribution
It presents a novel multi-objective optimization method for subgroup discovery that captures a Pareto region of solutions, improving the identification of exceptional materials over traditional single-objective approaches.
Findings
Identified a Pareto region of subgroup rules balancing size and exceptionality.
Discovered rules that predict high bulk modulus in perovskites.
Successfully screened materials with up to 13% higher bulk modulus than training data.
Abstract
Useful materials are often statistically exceptional and they might be overlooked by AI models that attempt to describe all materials simultaneously. These global models perform well for the majority of (useless) materials, but they do not necessarily capture the useful ones. Subgroup discovery (SGD) identifies rules describing subsets of materials (SGs) associated to exceptional values, e.g., high values, of a materials property of interest. Thus, SGD can better capture exceptional materials compared to most widely used AI techniques. Previous works focused on the SG that maximizes an objective function that establishes one tradeoff between the size of the SG and the exceptionality of the distribution of property values in the SG. However, this optimization does not give a unique solution, but many SGs typically have similar objective-function values. Here, we identify a Pareto region…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · Statistical and Computational Modeling · Manufacturing Process and Optimization
