Approximation-Free Differentiable Oblique Decision Trees
Subrat Prasad Panda, Blaise Genest, Arvind Easwaran

TL;DR
This paper introduces DTSemNet, a novel method for training oblique decision trees using exact gradients without approximations, improving performance on classification, regression, and reinforcement learning tasks.
Contribution
DTSemNet provides an invertible, semantically equivalent neural network representation of hard oblique decision trees, enabling end-to-end gradient-based training without approximation.
Findings
DTSemNet outperforms state-of-the-art differentiable decision trees on benchmarks.
The annealed Top-k method effectively replaces STE for gradient estimation in regression.
DTSemNet can be used as programmatic policies in reinforcement learning environments.
Abstract
Decision Trees (DTs) are widely used in safety-critical domains such as medical diagnosis, valued for their interpretability and effectiveness on tabular data. However, training accurate oblique DTs is challenging due to complex optimization landscapes and overfitting risks, particularly in regression. Recent advances have introduced differentiable formulations that enable gradient-based training and joint optimization of decision boundaries and leaf regressors. Yet, existing approaches typically rely on approximations, either through probabilistic softening of boundaries (soft DTs) or quantized gradients such as the Straight-Through Estimator (STE). To overcome these limitations, we propose DTSemNet, a novel, semantically equivalent, and invertible representation of hard oblique DTs as neural networks. DTSemNet enables end-to-end training with standard gradient descent, eliminating the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
