Convergence Rates of Oblique Regression Trees for Flexible Function Libraries
Matias D. Cattaneo, Rajita Chandak, Jason M. Klusowski

TL;DR
This paper provides a theoretical analysis of oblique regression trees, demonstrating their ability to adapt to complex models and achieve accuracy comparable to neural networks, thus bridging the gap between empirical success and theoretical understanding.
Contribution
It introduces a theoretical framework for oblique decision trees, showing they satisfy oracle inequalities and can adapt to rich model libraries, with implications for interpretability and accuracy.
Findings
Oblique trees can adapt to linear combinations of ridge functions.
They achieve similar accuracy to neural networks under certain conditions.
The framework accommodates existing computational tools for hyperplane optimization.
Abstract
We develop a theoretical framework for the analysis of oblique decision trees, where the splits at each decision node occur at linear combinations of the covariates (as opposed to conventional tree constructions that force axis-aligned splits involving only a single covariate). While this methodology has garnered significant attention from the computer science and optimization communities since the mid-80s, the advantages they offer over their axis-aligned counterparts remain only empirically justified, and explanations for their success are largely based on heuristics. Filling this long-standing gap between theory and practice, we show that oblique regression trees (constructed by recursively minimizing squared error) satisfy a type of oracle inequality and can adapt to a rich library of regression models consisting of linear combinations of ridge functions and their limit points. This…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Adversarial Robustness in Machine Learning · Statistical Methods and Inference
