Approximation-Free Differentiable Oblique Decision Trees

Subrat Prasad Panda; Blaise Genest; Arvind Easwaran

arXiv:2605.07837·cs.LG·May 11, 2026

Approximation-Free Differentiable Oblique Decision Trees

Subrat Prasad Panda, Blaise Genest, Arvind Easwaran

PDF

TL;DR

This paper introduces DTSemNet, a novel method for training oblique decision trees using exact gradients without approximations, improving performance on classification, regression, and reinforcement learning tasks.

Contribution

DTSemNet provides an invertible, semantically equivalent neural network representation of hard oblique decision trees, enabling end-to-end gradient-based training without approximation.

Findings

01

DTSemNet outperforms state-of-the-art differentiable decision trees on benchmarks.

02

The annealed Top-k method effectively replaces STE for gradient estimation in regression.

03

DTSemNet can be used as programmatic policies in reinforcement learning environments.

Abstract

Decision Trees (DTs) are widely used in safety-critical domains such as medical diagnosis, valued for their interpretability and effectiveness on tabular data. However, training accurate oblique DTs is challenging due to complex optimization landscapes and overfitting risks, particularly in regression. Recent advances have introduced differentiable formulations that enable gradient-based training and joint optimization of decision boundaries and leaf regressors. Yet, existing approaches typically rely on approximations, either through probabilistic softening of boundaries (soft DTs) or quantized gradients such as the Straight-Through Estimator (STE). To overcome these limitations, we propose DTSemNet, a novel, semantically equivalent, and invertible representation of hard oblique DTs as neural networks. DTSemNet enables end-to-end training with standard gradient descent, eliminating the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.