On Computing Optimal Tree Ensembles

Christian Komusiewicz; Pascal Kunz; Frank Sommer; Manuel Sorge

arXiv:2306.04423·cs.LG·September 25, 2024·1 cites

On Computing Optimal Tree Ensembles

Christian Komusiewicz, Pascal Kunz, Frank Sommer, Manuel Sorge

PDF

Open Access 1 Video

TL;DR

This paper introduces novel algorithms for computing optimal decision tree ensembles, improving tractability results and providing bounds, with potential practical implications for classification and regression tasks.

Contribution

It presents two new algorithms for optimal tree ensembles, including a witness-tree technique and dynamic programming approach, advancing the computational methods for these models.

Findings

01

Algorithms for optimal tree ensembles with improved runtime bounds.

02

Dynamic programming approach for computing ensembles.

03

Ensembles may require exponentially fewer cuts than single trees.

Abstract

Random forests and, more generally, (decision\nobreakdash-)tree ensembles are widely used methods for classification and regression. Recent algorithmic advances allow to compute decision trees that are optimal for various measures such as their size or depth. We are not aware of such research for tree ensembles and aim to contribute to this area. Mainly, we provide two novel algorithms and corresponding lower bounds. First, we are able to carry over and substantially improve on tractability results for decision trees: We obtain an algorithm that, given a training-data set and an size bound $S \in R$ , computes a tree ensemble of size at most $S$ that classifies the data correctly. The algorithm runs in $(4 δ D S)^{S} \cdot p o l y$ -time, where $D$ the largest domain size, $δ$ is the largest number of features in which two examples differ, $n$ the number of input examples,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

On Computing Optimal Tree Ensembles· slideslive

Taxonomy

TopicsMachine Learning and Data Classification · Data Mining Algorithms and Applications · Bayesian Modeling and Causal Inference

MethodsAttentive Walk-Aggregating Graph Neural Network