TL;DR
FedAUX enhances federated distillation by utilizing unlabeled auxiliary data with unsupervised pre-training and differential privacy-based weighting, significantly improving training performance over state-of-the-art methods in various data regimes.
Contribution
FedAUX introduces a novel extension to federated distillation that leverages unlabeled auxiliary data through pre-training and privacy-aware weighting, boosting performance in federated learning.
Findings
Outperforms SOTA federated learning baselines in large-scale experiments.
Effectively closes the gap to centralized training performance.
Works well in both iid and non-iid data settings.
Abstract
Federated Distillation (FD) is a popular novel algorithmic paradigm for Federated Learning, which achieves training performance competitive to prior parameter averaging based methods, while additionally allowing the clients to train different model architectures, by distilling the client predictions on an unlabeled auxiliary set of data into a student model. In this work we propose FedAUX, an extension to FD, which, under the same set of assumptions, drastically improves performance by deriving maximum utility from the unlabeled auxiliary data. FedAUX modifies the FD training procedure in two ways: First, unsupervised pre-training on the auxiliary data is performed to find a model initialization for the distributed training. Second, -differentially private certainty scoring is used to weight the ensemble predictions on the auxiliary data according to the certainty…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
