Federated Boosted Decision Trees with Differential Privacy

Samuel Maddock; Graham Cormode; Tianhao Wang; Carsten Maple; Somesh; Jha

arXiv:2210.02910·cs.CR·November 9, 2022·5 cites

Federated Boosted Decision Trees with Differential Privacy

Samuel Maddock, Graham Cormode, Tianhao Wang, Carsten Maple, Somesh, Jha

PDF

Open Access 2 Repos

TL;DR

This paper introduces a federated learning framework for gradient boosted decision trees that incorporates differential privacy, balancing high utility with strong privacy guarantees in distributed data settings.

Contribution

It presents a novel framework for differentially private federated GBDT models, extending existing methods and optimizing privacy-utility trade-offs.

Findings

01

Achieves high utility while maintaining strong privacy guarantees.

02

Extends existing federated decision tree approaches with differential privacy.

03

Provides a flexible framework adaptable to various hyperparameters.

Abstract

There is great demand for scalable, secure, and efficient privacy-preserving machine learning models that can be trained over distributed data. While deep learning models typically achieve the best results in a centralized non-secure setting, different models can excel when privacy and communication constraints are imposed. Instead, tree-based approaches such as XGBoost have attracted much attention for their high performance and ease of use; in particular, they often achieve state-of-the-art results on tabular data. Consequently, several recent works have focused on translating Gradient Boosted Decision Tree (GBDT) models like XGBoost into federated settings, via cryptographic mechanisms such as Homomorphic Encryption (HE) and Secure Multi-Party Computation (MPC). However, these do not always provide formal privacy guarantees, or consider the full range of hyperparameters and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Stochastic Gradient Optimization Techniques