TL;DR
The paper introduces Hinge Regression Tree (HRT), a novel method using a Newton-based optimization for oblique decision tree splits, achieving fast convergence and strong approximation capabilities.
Contribution
HRT reframes oblique split learning as a non-linear least-squares problem, providing a theoretically grounded, efficient, and expressive approach for decision trees.
Findings
HRT converges monotonically with fixed and adaptive damping.
HRT matches or outperforms baselines on benchmarks.
HRT has a universal approximation property with explicit rate.
Abstract
Oblique decision trees combine the transparency of trees with the power of multivariate decision boundaries, but learning high-quality oblique splits is NP-hard, and practical methods still rely on slow search or theory-free heuristics. We present the Hinge Regression Tree (HRT), which reframes each split as a non-linear least-squares problem over two linear predictors whose max/min envelope induces ReLU-like expressive power. The resulting alternating fitting procedure is exactly equivalent to a damped Newton (Gauss-Newton) method within fixed partitions. We analyze this node-level optimization and, for a backtracking line-search variant, prove that the local objective decreases monotonically and converges; in practice, both fixed and adaptive damping yield fast, stable convergence and can be combined with optional ridge regularization. We further prove that HRT's model class is a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
