TRBoost: A Generic Gradient Boosting Machine based on Trust-region Method
Jiaqi Luo, Zihao Wei, Junkai Man, Shixin Xu

TL;DR
TRBoost introduces a trust-region based gradient boosting method that balances the flexibility of first-order methods with the performance of second-order algorithms, applicable to arbitrary loss functions without Hessian positivity constraints.
Contribution
This paper presents TRBoost, a novel gradient boosting algorithm using trust-region optimization, enabling application to any loss function while maintaining competitive performance.
Findings
TRBoost is as general as first-order GBMs.
TRBoost achieves performance comparable to second-order GBMs.
Numerical experiments confirm the effectiveness of TRBoost.
Abstract
Gradient Boosting Machines (GBMs) have demonstrated remarkable success in solving diverse problems by utilizing Taylor expansions in functional space. However, achieving a balance between performance and generality has posed a challenge for GBMs. In particular, gradient descent-based GBMs employ the first-order Taylor expansion to ensure applicability to all loss functions, while Newton's method-based GBMs use positive Hessian information to achieve superior performance at the expense of generality. To address this issue, this study proposes a new generic Gradient Boosting Machine called Trust-region Boosting (TRBoost). In each iteration, TRBoost uses a constrained quadratic model to approximate the objective and applies the Trust-region algorithm to solve it and obtain a new learner. Unlike Newton's method-based GBMs, TRBoost does not require the Hessian to be positive definite,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Machine Learning and ELM · Neural Networks and Reservoir Computing
