The Tradeoff Between Privacy and Accuracy in Anomaly Detection Using   Federated XGBoost

Mengwei Yang; Linqi Song; Jie Xu; Congduan Li; Guozhen Tan

arXiv:1907.07157·cs.LG·October 15, 2019·23 cites

The Tradeoff Between Privacy and Accuracy in Anomaly Detection Using Federated XGBoost

Mengwei Yang, Linqi Song, Jie Xu, Congduan Li, Guozhen Tan

PDF

Open Access 1 Repo

TL;DR

This paper introduces a federated XGBoost algorithm for anomaly detection that balances privacy and accuracy by aggregating data and focusing on misclassified samples, demonstrating effectiveness over existing methods.

Contribution

The paper proposes a novel federated XGBoost method with data aggregation and sparse updates to enhance privacy-accuracy tradeoff in anomaly detection.

Findings

01

Effective privacy-accuracy balance achieved

02

Outperforms existing anomaly detection methods

03

Sparse model updates improve learning from unbalanced data

Abstract

Privacy has raised considerable concerns recently, especially with the advent of information explosion and numerous data mining techniques to explore the information inside large volumes of data. In this context, a new distributed learning paradigm termed federated learning becomes prominent recently to tackle the privacy issues in distributed learning, where only learning models will be transmitted from the distributed nodes to servers without revealing users' own data and hence protecting the privacy of users. In this paper, we propose a horizontal federated XGBoost algorithm to solve the federated anomaly detection problem, where the anomaly detection aims to identify abnormalities from extremely unbalanced datasets and can be considered as a special classification problem. Our proposed federated XGBoost algorithm incorporates data aggregation and sparse federated update processes…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Raymw/Federated-XGBoost
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Network Security and Intrusion Detection · Digital and Cyber Forensics