Stagewise Boosting Distributional Regression
Mattias Wetscher, Johannes Seiler, Reto Stauffer, Nikolaus Umlauf

TL;DR
This paper introduces a novel stagewise boosting algorithm for distributional regression that addresses gradient vanishing issues, incorporates regularization, and is scalable to large datasets, improving model estimation for complex distributions.
Contribution
It proposes a new stagewise boosting method for distributional regression, combining ideas from stagewise regression and gradient boosting, with regularization and stochastic approximations for large data.
Findings
Effective in complex distributional modeling
Handles large datasets with stochastic approximations
Improves stability with correlation filtering
Abstract
Forward stagewise regression is a simple algorithm that can be used to estimate regularized models. The updating rule adds a small constant to a regression coefficient in each iteration, such that the underlying optimization problem is solved slowly with small improvements. This is similar to gradient boosting, with the essential difference that the step size is determined by the product of the gradient and a step length parameter in the latter algorithm. One often overlooked challenge in gradient boosting for distributional regression is the issue of a vanishing small gradient, which practically halts the algorithm's progress. We show that gradient boosting in this case oftentimes results in suboptimal models, especially for complex problems certain distributional parameters are never updated due to the vanishing gradient. Therefore, we propose a stagewise boosting-type algorithm for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models
