The Tradeoff Between Privacy and Accuracy in Anomaly Detection Using Federated XGBoost
Mengwei Yang, Linqi Song, Jie Xu, Congduan Li, Guozhen Tan

TL;DR
This paper introduces a federated XGBoost algorithm for anomaly detection that balances privacy and accuracy by aggregating data and focusing on misclassified samples, demonstrating effectiveness over existing methods.
Contribution
The paper proposes a novel federated XGBoost method with data aggregation and sparse updates to enhance privacy-accuracy tradeoff in anomaly detection.
Findings
Effective privacy-accuracy balance achieved
Outperforms existing anomaly detection methods
Sparse model updates improve learning from unbalanced data
Abstract
Privacy has raised considerable concerns recently, especially with the advent of information explosion and numerous data mining techniques to explore the information inside large volumes of data. In this context, a new distributed learning paradigm termed federated learning becomes prominent recently to tackle the privacy issues in distributed learning, where only learning models will be transmitted from the distributed nodes to servers without revealing users' own data and hence protecting the privacy of users. In this paper, we propose a horizontal federated XGBoost algorithm to solve the federated anomaly detection problem, where the anomaly detection aims to identify abnormalities from extremely unbalanced datasets and can be considered as a special classification problem. Our proposed federated XGBoost algorithm incorporates data aggregation and sparse federated update processes…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Network Security and Intrusion Detection · Digital and Cyber Forensics
