The computational complexity of some explainable clustering problems
Eduardo Sany Laber

TL;DR
This paper investigates the computational difficulty of explainable clustering problems using axis-aligned decision trees, showing some are NP-hard while others are solvable efficiently.
Contribution
It establishes the complexity status of several explainable clustering problems, identifying which can be optimized in polynomial time and which are computationally hard.
Findings
k-means, k-medians, and k-centers are NP-hard to optimize
Spacing cost function can be optimized in polynomial time
Clarifies the computational landscape of explainable clustering problems
Abstract
We study the computational complexity of some explainable clustering problems in the framework proposed by [Dasgupta et al., ICML 2020], where explainability is achieved via axis-aligned decision trees. We consider the -means, -medians, -centers and the spacing cost functions. We prove that the first three are hard to optimize while the latter can be optimized in polynomial time.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRough Sets and Fuzzy Logic · Data Mining Algorithms and Applications · Statistical Methods and Inference
