Efficient distributional regression trees learning algorithms for calibrated non-parametric probabilistic forecasts
Quentin Duchemin, Guillaume Obozinski

TL;DR
This paper presents efficient algorithms for training probabilistic regression trees that optimize proper scoring rules like WIS and CRPS, enhancing uncertainty estimation in non-parametric regression.
Contribution
It introduces novel, computationally efficient algorithms for distributional regression trees based on proper scoring rules, improving probabilistic forecasts.
Findings
Algorithms are computationally efficient using known data structures.
Methods achieve competitive performance with existing approaches.
Trees provide interpretable models with group-conditional coverage guarantees.
Abstract
The perspective of developing trustworthy AI for critical applications in science and engineering requires machine learning techniques that are capable of estimating their own uncertainty. In the context of regression, instead of estimating a conditional mean, this can be achieved by producing a predictive interval for the output, or to even learn a model of the conditional probability of an output given input features . While this can be done under parametric assumptions with, e.g. generalized linear model, these are typically too strong, and non-parametric models offer flexible alternatives. In particular, for scalar outputs, learning directly a model of the conditional cumulative distribution function of given can lead to more precise probabilistic estimates, and the use of proper scoring rules such as the weighted interval score (WIS) and the continuous…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
