Optimization by gradient boosting

G\'erard Biau (LSTA; LPMA); Beno\^it Cadre (ENS Rennes; IRMAR)

arXiv:1707.05023·math.ST·July 18, 2017·5 cites

Optimization by gradient boosting

G\'erard Biau (LSTA, LPMA), Beno\^it Cadre (ENS Rennes, IRMAR)

PDF

Open Access

TL;DR

This paper provides a comprehensive analysis of gradient boosting algorithms, demonstrating their convergence and consistency within a functional optimization framework, emphasizing the role of strong convexity and regularization.

Contribution

It introduces a general framework for analyzing gradient boosting, proving convergence and consistency without early stopping, and highlights the importance of strong convexity and regularization.

Findings

01

Proves convergence of gradient boosting algorithms as iterations increase.

02

Establishes conditions for statistical consistency of boosting predictors.

03

Highlights the role of strong convexity and regularization in boosting performance.

Abstract

Gradient boosting is a state-of-the-art prediction technique that sequentially produces a model in the form of linear combinations of simple predictors---typically decision trees---by solving an infinite-dimensional convex optimization problem. We provide in the present paper a thorough analysis of two widespread versions of gradient boosting, and introduce a general framework for studying these algorithms from the point of view of functional optimization. We prove their convergence as the number of iterations tends to infinity and highlight the importance of having a strongly convex risk functional to minimize. We also present a reasonable statistical context ensuring consistency properties of the boosting predictors as the sample size grows. In our approach, the optimization procedures are run forever (that is, without resorting to an early stopping strategy), and statistical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Methods and Inference · Stochastic Gradient Optimization Techniques · Machine Learning and Algorithms

MethodsEarly Stopping