Optimal Decision Trees For Interpretable Clustering with Constraints (Extended Version)
Pouya Shati, Eldan Cohen, Sheila McIlraith

TL;DR
This paper introduces a SAT-based framework for interpretable constrained clustering that guarantees solution quality and supports domain-specific constraints, addressing limitations of previous methods.
Contribution
It presents the first approach combining interpretability, constraints, and theoretical guarantees in clustering, using a novel SAT-based method.
Findings
Produces high-quality, interpretable clustering solutions
Supports domain-specific clustering constraints
Provides strong theoretical guarantees on solution quality
Abstract
Constrained clustering is a semi-supervised task that employs a limited amount of labelled data, formulated as constraints, to incorporate domain-specific knowledge and to significantly improve clustering accuracy. Previous work has considered exact optimization formulations that can guarantee optimal clustering while satisfying all constraints, however these approaches lack interpretability. Recently, decision-trees have been used to produce inherently interpretable clustering solutions, however existing approaches do not support clustering constraints and do not provide strong theoretical guarantees on solution quality. In this work, we present a novel SAT-based framework for interpretable clustering that supports clustering constraints and that also provides strong theoretical guarantees on solution quality. We also present new insight into the trade-off between interpretability and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsData Mining Algorithms and Applications · Advanced Clustering Algorithms Research · Neural Networks and Applications
