Learning Multi-Layered GBDT Via Back Propagation
Zhendong Zhang

TL;DR
This paper introduces a novel framework for training multi-layered gradient boosting decision trees using back propagation, enabling deep tree-based models to learn hierarchical representations.
Contribution
It proposes a method to approximate GBDT gradients with linear regression, allowing back propagation for multi-layered GBDT, which was previously non-differentiable.
Findings
Effective performance improvements demonstrated
Enhanced representation learning capability
First to optimize multi-layered GBDT via back propagation
Abstract
Deep neural networks are able to learn multi-layered representation via back propagation (BP). Although the gradient boosting decision tree (GBDT) is effective for modeling tabular data, it is non-differentiable with respect to its input, thus suffering from learning multi-layered representation. In this paper, we propose a framework of learning multi-layered GBDT via BP. We approximate the gradient of GBDT based on linear regression. Specifically, we use linear regression to replace the constant value at each leaf ignoring the contribution of individual samples to the tree structure. In this way, we estimate the gradient for intermediate representations, which facilitates BP for multi-layered GBDT. Experiments show the effectiveness of the proposed method in terms of performance and representation ability. To the best of our knowledge, this is the first work of optimizing multi-layered…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Advanced Data Compression Techniques · Advanced Image Processing Techniques
MethodsLinear Regression
