Team up GBDTs and DNNs: Advancing Efficient and Effective Tabular Prediction with Tree-hybrid MLPs
Jiahuan Yan, Jintai Chen, Qianxing Wang, Danny Z. Chen, Jian Wu

TL;DR
This paper introduces T-MLP, a hybrid model combining GBDT features and DNN architecture, achieving efficient and effective tabular prediction with less training time and competitive performance across numerous datasets.
Contribution
The paper proposes a novel Tree-hybrid MLP framework that integrates GBDT features and DNNs, addressing model selection and training efficiency issues in tabular data prediction.
Findings
T-MLP matches the performance of well-tuned GBDTs and DNNs on 88 datasets.
T-MLP reduces training time significantly compared to traditional models.
The framework achieves compact model storage and high efficiency.
Abstract
Tabular datasets play a crucial role in various applications. Thus, developing efficient, effective, and widely compatible prediction algorithms for tabular data is important. Currently, two prominent model types, Gradient Boosted Decision Trees (GBDTs) and Deep Neural Networks (DNNs), have demonstrated performance advantages on distinct tabular prediction tasks. However, selecting an effective model for a specific tabular dataset is challenging, often demanding time-consuming hyperparameter tuning. To address this model selection dilemma, this paper proposes a new framework that amalgamates the advantages of both GBDTs and DNNs, resulting in a DNN algorithm that is as efficient as GBDTs and is competitively effective regardless of dataset preferences for GBDTs or DNNs. Our idea is rooted in an observation that deep learning (DL) offers a larger parameter space that can represent a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Music and Audio Processing · Topic Modeling
MethodsPruning
