Probabilistic Inference for Learning from Untrusted Sources
Duc Thien Nguyen, Shiau Hoong Lim, Laura Wynter, Desmond Cai

TL;DR
This paper introduces Bayesian inference-based aggregation algorithms for federated learning that adaptively infer the quality of untrusted parties without needing a reference dataset, improving robustness and performance.
Contribution
It presents novel federated learning aggregation methods that infer party quality in the absence of a reference dataset, inspired by crowdsourcing and collaborative filtering techniques.
Findings
Algorithms outperform standard aggregation methods on synthetic data.
Algorithms outperform robust aggregation methods on real data.
Improved robustness to untrusted or corrupted parties.
Abstract
Federated learning brings potential benefits of faster learning, better solutions, and a greater propensity to transfer when heterogeneous data from different parties increases diversity. However, because federated learning tasks tend to be large and complex, and training times non-negligible, it is important for the aggregation algorithm to be robust to non-IID data and corrupted parties. This robustness relies on the ability to identify, and appropriately weight, incompatible parties. Recent work assumes that a \textit{reference dataset} is available through which to perform the identification. We consider settings where no such reference dataset is available; rather, the quality and suitability of the parties needs to be \textit{inferred}. We do so by bringing ideas from crowdsourced predictions and collaborative filtering, where one must infer an unknown ground truth given proposals…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMobile Crowdsensing and Crowdsourcing · Privacy-Preserving Technologies in Data · Data Stream Mining Techniques
