Using Constraints to Discover Sparse and Alternative Subgroup Descriptions
Jakob Bach

TL;DR
This paper introduces methods to incorporate feature sparsity and alternative subgroup discovery constraints into subgroup discovery, using heuristic and solver-based approaches, including a novel SMT formulation, to improve interpretability and flexibility.
Contribution
It presents a new framework for constrained subgroup discovery with a novel SMT-based optimization approach and analyzes the complexity of these constraints.
Findings
Heuristic methods find high-quality subgroups quickly.
Constraints increase the complexity but are manageable with proposed methods.
Solver-based search effectively handles complex constraints.
Abstract
Subgroup-discovery methods allow users to obtain simple descriptions of interesting regions in a dataset. Using constraints in subgroup discovery can enhance interpretability even further. In this article, we focus on two types of constraints: First, we limit the number of features used in subgroup descriptions, making the latter sparse. Second, we propose the novel optimization problem of finding alternative subgroup descriptions, which cover a similar set of data objects as a given subgroup but use different features. We describe how to integrate both constraint types into heuristic subgroup-discovery methods. Further, we propose a novel Satisfiability Modulo Theories (SMT) formulation of subgroup discovery as a white-box optimization problem, which allows solver-based search for subgroups and is open to a variety of constraint types. Additionally, we prove that both constraint types…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsText and Document Classification Technologies
MethodsSparse Evolutionary Training · Focus
