Learning Multi-Layered GBDT Via Back Propagation

Zhendong Zhang

arXiv:2109.11863·cs.LG·September 28, 2021

Learning Multi-Layered GBDT Via Back Propagation

Zhendong Zhang

PDF

Open Access

TL;DR

This paper introduces a novel framework for training multi-layered gradient boosting decision trees using back propagation, enabling deep tree-based models to learn hierarchical representations.

Contribution

It proposes a method to approximate GBDT gradients with linear regression, allowing back propagation for multi-layered GBDT, which was previously non-differentiable.

Findings

01

Effective performance improvements demonstrated

02

Enhanced representation learning capability

03

First to optimize multi-layered GBDT via back propagation

Abstract

Deep neural networks are able to learn multi-layered representation via back propagation (BP). Although the gradient boosting decision tree (GBDT) is effective for modeling tabular data, it is non-differentiable with respect to its input, thus suffering from learning multi-layered representation. In this paper, we propose a framework of learning multi-layered GBDT via BP. We approximate the gradient of GBDT based on linear regression. Specifically, we use linear regression to replace the constant value at each leaf ignoring the contribution of individual samples to the tree structure. In this way, we estimate the gradient for intermediate representations, which facilitates BP for multi-layered GBDT. Experiments show the effectiveness of the proposed method in terms of performance and representation ability. To the best of our knowledge, this is the first work of optimizing multi-layered…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Advanced Data Compression Techniques · Advanced Image Processing Techniques

MethodsLinear Regression