On Computing Optimal Tree Ensembles
Christian Komusiewicz, Pascal Kunz, Frank Sommer, Manuel Sorge

TL;DR
This paper introduces novel algorithms for computing optimal decision tree ensembles, improving tractability results and providing bounds, with potential practical implications for classification and regression tasks.
Contribution
It presents two new algorithms for optimal tree ensembles, including a witness-tree technique and dynamic programming approach, advancing the computational methods for these models.
Findings
Algorithms for optimal tree ensembles with improved runtime bounds.
Dynamic programming approach for computing ensembles.
Ensembles may require exponentially fewer cuts than single trees.
Abstract
Random forests and, more generally, (decision\nobreakdash-)tree ensembles are widely used methods for classification and regression. Recent algorithmic advances allow to compute decision trees that are optimal for various measures such as their size or depth. We are not aware of such research for tree ensembles and aim to contribute to this area. Mainly, we provide two novel algorithms and corresponding lower bounds. First, we are able to carry over and substantially improve on tractability results for decision trees: We obtain an algorithm that, given a training-data set and an size bound , computes a tree ensemble of size at most that classifies the data correctly. The algorithm runs in -time, where the largest domain size, is the largest number of features in which two examples differ, the number of input examples,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Data Mining Algorithms and Applications · Bayesian Modeling and Causal Inference
MethodsAttentive Walk-Aggregating Graph Neural Network
