# Block-distributed Gradient Boosted Trees

**Authors:** Theodore Vasiloudis, Hyunsu Cho, Henrik Bostr\"om

arXiv: 1904.10522 · 2019-05-30

## TL;DR

This paper introduces block-distributed Gradient Boosted Trees that efficiently handle high-dimensional, sparse data by reducing communication costs and improving scalability across data points and features.

## Contribution

It proposes a novel block-distributed GBT framework that leverages data sparsity and adapts the Quickscorer algorithm for scalable, communication-efficient training on high-dimensional datasets.

## Key findings

- Achieves multiple orders of magnitude reduction in communication cost for sparse data
- Maintains accuracy while significantly reducing training time for high-dimensional data
- Enables cost-effective scale-out without expensive network communication

## Abstract

The Gradient Boosted Tree (GBT) algorithm is one of the most popular machine learning algorithms used in production, for tasks that include Click-Through Rate (CTR) prediction and learning-to-rank. To deal with the massive datasets available today, many distributed GBT methods have been proposed. However, they all assume a row-distributed dataset, addressing scalability only with respect to the number of data points and not the number of features, and increasing communication cost for high-dimensional data. In order to allow for scalability across both the data point and feature dimensions, and reduce communication cost, we propose block-distributed GBTs. We achieve communication efficiency by making full use of the data sparsity and adapting the Quickscorer algorithm to the block-distributed setting. We evaluate our approach using datasets with millions of features, and demonstrate that we are able to achieve multiple orders of magnitude reduction in communication cost for sparse data, with no loss in accuracy, while providing a more scalable design. As a result, we are able to reduce the training time for high-dimensional data, and allow more cost-effective scale-out without the need for expensive network communication.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.10522/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/1904.10522/full.md

## References

12 references — full list in the complete paper: https://tomesphere.com/paper/1904.10522/full.md

---
Source: https://tomesphere.com/paper/1904.10522