Interpretable clustering via optimal multiway-split decision trees
Hayato Suzuki, Shunnosuke Ikeda, Yuichi Takano

TL;DR
This paper introduces an interpretable clustering approach using optimal multiway-split decision trees formulated as a 0-1 integer linear program, improving computational efficiency and interpretability over traditional binary trees.
Contribution
The paper presents a novel multiway-split decision tree clustering method that is more tractable and interpretable, incorporating a data-driven discretization of continuous variables.
Findings
Outperforms baseline methods in clustering accuracy
Produces concise, interpretable multiway decision trees
Maintains competitive performance across datasets
Abstract
Clustering serves as a vital tool for uncovering latent data structures, and achieving both high accuracy and interpretability is essential. To this end, existing methods typically construct binary decision trees by solving mixed-integer nonlinear optimization problems, often leading to significant computational costs and suboptimal solutions. Furthermore, binary decision trees frequently result in excessively deep structures, which makes them difficult to interpret. To mitigate these issues, we propose an interpretable clustering method based on optimal multiway-split decision trees, formulated as a 0-1 integer linear optimization problem. This reformulation renders the optimization problem more tractable compared to existing models. A key feature of our method is the integration of a one-dimensional K-means algorithm for the discretization of continuous variables, allowing for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsExplainable Artificial Intelligence (XAI) · Data Mining Algorithms and Applications · Bayesian Modeling and Causal Inference
