Wide Boosting

Michael T. Horrell

arXiv:2007.09855·cs.LG·November 8, 2022

Wide Boosting

Michael T. Horrell

PDF

Open Access 1 Repo

TL;DR

Wide Boosting enhances traditional Gradient Boosting by inserting a matrix multiplication step, enabling it to better handle multi-dimensional correlated outputs and generate more useful embeddings for downstream tasks.

Contribution

The paper introduces Wide Boosting, a simple modification to Gradient Boosting that improves its ability to model multi-dimensional outputs and produce richer data embeddings.

Findings

01

Wide Boosting outperforms standard Gradient Boosting on multi-dimensional output tasks.

02

WB generates more useful embeddings for downstream prediction tasks.

03

The method is inspired by neural network architectures.

Abstract

Gradient Boosting (GB) is a popular methodology used to solve prediction problems by minimizing a differentiable loss function, $L$ . GB performs very well on tabular machine learning (ML) problems; however, as a pure ML solver it lacks the ability to fit models with probabilistic but correlated multi-dimensional outputs, for example, multiple correlated Bernoulli outputs. GB also does not form intermediate abstract data embeddings, one property of Deep Learning that gives greater flexibility and performance on other types of problems. This paper presents a simple adjustment to GB motivated in part by artificial neural networks. Specifically, our adjustment inserts a matrix multiplication between the output of a GB model and the loss, $L$ . This allows the output of a GB model to have increased dimension prior to being fed into the loss and is thus ``wider'' than standard GB…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

mthorrell/wideboost
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Neural Networks and Applications · Advanced Neural Network Applications