Scalable Optimal Multiway-Split Decision Trees with Constraints
Shivaram Subramanian, Wei Sun

TL;DR
This paper introduces a scalable, path-based mixed-integer programming approach for optimal multiway-split decision trees that can handle large datasets, nonlinear metrics, and complex constraints, outperforming existing methods in efficiency and interpretability.
Contribution
The authors develop a novel path-based MIP formulation and a column generation framework that significantly improves scalability and flexibility over existing arc-based models for optimal decision trees.
Findings
Successfully trained on datasets with over 1 million samples.
Achieved up to 24x faster runtimes compared to state-of-the-art MIP methods.
Produced more interpretable multiway-split trees with competitive accuracy.
Abstract
There has been a surge of interest in learning optimal decision trees using mixed-integer programs (MIP) in recent years, as heuristic-based methods do not guarantee optimality and find it challenging to incorporate constraints that are critical for many practical applications. However, existing MIP methods that build on an arc-based formulation do not scale well as the number of binary variables is in the order of , where and refer to the depth of the tree and the size of the dataset. Moreover, they can only handle sample-level constraints and linear metrics. In this paper, we propose a novel path-based MIP formulation where the number of decision variables is independent of . We present a scalable column generation framework to solve the MIP optimally. Our framework produces a multiway-split tree which is more interpretable than the typical binary-split…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Advanced Multi-Objective Optimization Algorithms
