Task-wise Split Gradient Boosting Trees for Multi-center Diabetes Prediction
Mingcheng Chen, Zhenghui Wang, Zhiyun Zhao, Weinan Zhang, Xiawei Guo,, Jian Shen, Yanru Qu, Jieli Lu, Min Xu, Yu Xu, Tiange Wang, Mian Li, Wei-Wei, Tu, Yong Yu, Yufang Bi, Weiqing Wang, Guang Ning

TL;DR
This paper introduces Task-wise Split Gradient Boosting Trees (TSGB), a novel method for multi-center diabetes prediction that effectively handles data heterogeneity and insufficiency, outperforming existing methods and being deployed in real-world software.
Contribution
The paper proposes TSGB, a new GBDT split method based on task gain, addressing negative task gain issues in multi-task learning for diabetes prediction.
Findings
TSGB outperforms state-of-the-art methods on real-world datasets.
Theoretical analysis of GBDT's learning objective and negative task gain.
Deployment of TSGB in online diabetes risk assessment software.
Abstract
Diabetes prediction is an important data science application in the social healthcare domain. There exist two main challenges in the diabetes prediction task: data heterogeneity since demographic and metabolic data are of different types, data insufficiency since the number of diabetes cases in a single medical center is usually limited. To tackle the above challenges, we employ gradient boosting decision trees (GBDT) to handle data heterogeneity and introduce multi-task learning (MTL) to solve data insufficiency. To this end, Task-wise Split Gradient Boosting Trees (TSGB) is proposed for the multi-center diabetes prediction task. Specifically, we firstly introduce task gain to evaluate each task separately during tree construction, with a theoretical analysis of GBDT's learning objective. Secondly, we reveal a problem when directly applying GBDT in MTL, i.e., the negative task gain…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
