Fast Sparse Decision Tree Optimization via Reference Ensembles

Hayden McTavish; Chudi Zhong; Reto Achermann; Ilias Karimalis; Jacques; Chen; Cynthia Rudin; Margo Seltzer

arXiv:2112.00798·cs.LG·July 7, 2022

Fast Sparse Decision Tree Optimization via Reference Ensembles

Hayden McTavish, Chudi Zhong, Reto Achermann, Ilias Karimalis, Jacques, Chen, Cynthia Rudin, Margo Seltzer

PDF

Open Access 3 Repos 1 Video

TL;DR

This paper introduces a novel approach for optimizing sparse decision trees by using strategic guesses, significantly reducing computation time and enabling the construction of accurate, interpretable models that rival black box methods.

Contribution

The authors propose a guessing strategy that enhances branch-and-bound decision tree algorithms, making sparse decision tree optimization more practical and scalable for real-world datasets.

Findings

01

Run time reduced by multiple orders of magnitude

02

Ability to match black box model accuracy with sparse trees

03

Effective guessing strategies for feature binning and tree size

Abstract

Sparse decision tree optimization has been one of the most fundamental problems in AI since its inception and is a challenge at the core of interpretable machine learning. Sparse decision tree optimization is computationally hard, and despite steady effort since the 1960's, breakthroughs have only been made on the problem within the past few years, primarily on the problem of finding optimal sparse decision trees. However, current state-of-the-art algorithms often require impractical amounts of computation time and memory to find optimal or near-optimal trees for some real-world datasets, particularly those having several continuous-valued features. Given that the search spaces of these decision tree optimization problems are massive, can we practically hope to find a sparse decision tree that competes in accuracy with a black box machine learning model? We address this problem via…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

Fast Sparse Decision Tree Optimization via Reference Ensembles· underline

Taxonomy

TopicsMachine Learning and Data Classification · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications