End-to-end Learning of Deterministic Decision Trees

Thomas Hehn; Fred A. Hamprecht

arXiv:1712.02743·stat.ML·December 8, 2017

End-to-end Learning of Deterministic Decision Trees

Thomas Hehn, Fred A. Hamprecht

PDF

1 Repo

TL;DR

This paper introduces the first end-to-end trainable scheme for deterministic decision trees, combining probabilistic training with a deterministic test phase, and demonstrates competitive results on image datasets.

Contribution

It proposes a novel model and EM training scheme for fully probabilistic decision trees that become deterministic after annealing, enabling end-to-end learning.

Findings

01

Achieves results comparable or superior to existing oblique decision tree algorithms.

02

Analyzes learned split parameters on image datasets.

03

Shows neural networks can be trained at each split node.

Abstract

Conventional decision trees have a number of favorable properties, including interpretability, a small computational footprint and the ability to learn from little training data. However, they lack a key quality that has helped fuel the deep learning revolution: that of being end-to-end trainable, and to learn from scratch those features that best allow to solve a given supervised learning problem. Recent work (Kontschieder 2015) has addressed this deficit, but at the cost of losing a main attractive trait of decision trees: the fact that each sample is routed along a small subset of tree nodes only. We here propose a model and Expectation-Maximization training scheme for decision trees that are fully probabilistic at train time, but after a deterministic annealing process become deterministic at test time. We also analyze the learned oblique split parameters on image datasets and show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tomsal/endtoenddecisiontrees
pytorch

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.