Breiman meets Bellman: Non-Greedy Decision Trees with MDPs

Hector Kohler; Riad Akrour; Philippe Preux

arXiv:2309.12701·cs.LG·June 3, 2025

Breiman meets Bellman: Non-Greedy Decision Trees with MDPs

Hector Kohler, Riad Akrour, Philippe Preux

PDF

Open Access 1 Repo

TL;DR

This paper introduces DPDT, a novel decision tree framework that combines Markov Decision Processes with heuristic split generation to produce near-optimal trees efficiently, outperforming traditional greedy and existing optimal methods.

Contribution

DPDT is a new approach that bridges greedy and optimal decision trees using MDPs and heuristics, achieving near-optimal solutions with reduced computational complexity.

Findings

01

DPDT achieves near-optimal training loss on multiple datasets.

02

DPDT outperforms CART and existing optimal decision trees in generalization.

03

DPDT is effective in boosting applications, outperforming baselines.

Abstract

In supervised learning, decision trees are valued for their interpretability and performance. While greedy decision tree algorithms like CART remain widely used due to their computational efficiency, they often produce sub-optimal solutions with respect to a regularized training loss. Conversely, optimal decision tree methods can find better solutions but are computationally intensive and typically limited to shallow trees or binary features. We present Dynamic Programming Decision Trees (DPDT), a framework that bridges the gap between greedy and optimal approaches. DPDT relies on a Markov Decision Process formulation combined with heuristic split generation to construct near-optimal decision trees with significantly reduced computational complexity. Our approach dynamically limits the set of admissible splits at each node while directly optimizing the tree regularized training loss.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

KohlerHECTOR/DPDTreeEstimator
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Data Classification · Reinforcement Learning in Robotics