SecureBoost: A Lossless Federated Learning Framework
Kewei Cheng, Tao Fan, Yilun Jin, Yang Liu, Tianjian Chen, Dimitrios, Papadopoulos, Qiang Yang

TL;DR
SecureBoost is a novel federated learning framework that enables lossless, privacy-preserving gradient boosting across multiple parties with vertically partitioned data, maintaining accuracy comparable to centralized models.
Contribution
It introduces a lossless, privacy-preserving federated gradient boosting system with entity alignment and encryption, suitable for vertically partitioned data in industrial applications.
Findings
Achieves the same accuracy as centralized gradient boosting.
Ensures no information leakage during protocol execution.
Scalable and practical for real-world applications like credit risk analysis.
Abstract
The protection of user privacy is an important concern in machine learning, as evidenced by the rolling out of the General Data Protection Regulation (GDPR) in the European Union (EU) in May 2018. The GDPR is designed to give users more control over their personal data, which motivates us to explore machine learning frameworks for data sharing that do not violate user privacy. To meet this goal, in this paper, we propose a novel lossless privacy-preserving tree-boosting system known as SecureBoost in the setting of federated learning. SecureBoost first conducts entity alignment under a privacy-preserving protocol and then constructs boosting trees across multiple parties with a carefully designed encryption strategy. This federated learning system allows the learning process to be jointly conducted over multiple parties with common user samples but different feature sets, which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques
