FederBoost: Private Federated Learning for GBDT
Zhihua Tian, Rui Zhang, Xiaoyang Hou, Lingjuan Lyu, Tianyi Zhang, Jian, Liu, Kui Ren

TL;DR
FederBoost introduces a privacy-preserving federated learning framework for gradient boosting decision trees that is efficient, supports both data partition types, and maintains high accuracy comparable to centralized models.
Contribution
It presents a novel federated learning framework for GBDT that eliminates heavy cryptography for vertical data and uses lightweight secure aggregation for horizontal data.
Findings
Achieves accuracy comparable to centralized training.
Runs 4-5 orders of magnitude faster than existing solutions.
Supports both vertical and horizontal data partitioning.
Abstract
Federated Learning (FL) has been an emerging trend in machine learning and artificial intelligence. It allows multiple participants to collaboratively train a better global model and offers a privacy-aware paradigm for model training since it does not require participants to release their original training data. However, existing FL solutions for vertically partitioned data or decision trees require heavy cryptographic operations. In this paper, we propose a framework named FederBoost for private federated learning of gradient boosting decision trees (GBDT). It supports running GBDT over both vertically and horizontally partitioned data. Vertical FederBoost does not require any cryptographic operation and horizontal FederBoost only requires lightweight secure aggregation. The key observation is that the whole training process of GBDT relies on the ordering of the data instead of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Mobile Crowdsensing and Crowdsourcing
