Mitigating Data Absence in Federated Learning Using Privacy-Controllable Data Digests
Chih-Fan Hsu, Ming-Ching Chang, Wei-Chao Chen

TL;DR
This paper introduces FedDig, a privacy-controllable data digest framework for federated learning that effectively mitigates data absence issues and adapts to distribution changes, improving model performance across diverse scenarios.
Contribution
FedDig is a novel framework that manages distribution shifts in federated learning using privacy-adjustable data digests, enhancing robustness and extensibility.
Findings
FedDig outperforms baseline algorithms in multiple data absence scenarios.
FedDig's hyperparameters enable effective adaptation to various data distributions.
The framework is compatible with existing federated learning algorithms.
Abstract
The absence of training data and their distribution changes in federated learning (FL) can significantly undermine model performance, especially in cross-silo scenarios. To address this challenge, we introduce the Federated Learning with Data Digest (FedDig) framework. FedDig manages unexpected distribution changes using a novel privacy-controllable data digest representation. This framework allows FL users to adjust the protection levels of the digest by manipulating hyperparameters that control the mixing of multiple low-dimensional features and applying differential privacy perturbation to these mixed features. Evaluation of FedDig across four diverse public datasets shows that it consistently outperforms five baseline algorithms by substantial margins in various data absence scenarios. We also thoroughly explored FedDig's hyperparameters, demonstrating its adaptability. Notably, the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
