Optimal randomized classification trees

Rafael Blanquero; Emilio Carrizosa; Cristina Molero-R\'io; Dolores; Romero Morales

arXiv:2110.11952·stat.ML·October 25, 2021

Optimal randomized classification trees

Rafael Blanquero, Emilio Carrizosa, Cristina Molero-R\'io, Dolores, Romero Morales

PDF

Open Access

TL;DR

This paper introduces a novel continuous optimization approach for constructing randomized classification trees, aiming to improve accuracy and control over misclassification rates compared to traditional greedy CART methods.

Contribution

It proposes a new continuous optimization-based method for building randomized decision trees, addressing limitations of existing optimal tree models.

Findings

01

Demonstrates good performance through computational experiments.

02

Outperforms traditional greedy CART in accuracy.

03

Offers better control over class-specific misclassification rates.

Abstract

Classification and Regression Trees (CARTs) are off-the-shelf techniques in modern Statistics and Machine Learning. CARTs are traditionally built by means of a greedy procedure, sequentially deciding the splitting predictor variable(s) and the associated threshold. This greedy approach trains trees very fast, but, by its nature, their classification accuracy may not be competitive against other state-of-the-art procedures. Moreover, controlling critical issues, such as the misclassification rates in each of the classes, is difficult. To address these shortcomings, optimal decision trees have been recently proposed in the literature, which use discrete decision variables to model the path each observation will follow in the tree. Instead, we propose a new approach based on continuous optimization. Our classifier can be seen as a randomized tree, since at each node of the decision tree a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Statistical Methods and Inference · Machine Learning and Algorithms