The Built-In Robustness of Decentralized Federated Averaging to Bad Data
Samuele Sabella, Chiara Boldrini, Lorenzo Valerio, Andrea Passarella, Marco Conti

TL;DR
Decentralized federated averaging demonstrates surprising robustness to localized bad data, with the averaging process preventing any single node from disproportionately affecting the overall model, even under severe data corruption scenarios.
Contribution
This paper reveals that decentralized federated averaging inherently resists the influence of corrupted data, a counterintuitive finding that enhances understanding of its robustness without central oversight.
Findings
Decentralized FedAvg is robust to localized bad data.
Robustness increases when corrupted data is concentrated on a single node.
Averaging prevents any node from dominating the learning process.
Abstract
Decentralized federated learning (DFL) enables devices to collaboratively train models over complex network topologies without relying on a central controller. In this setting, local data remains private, but its quality and quantity can vary significantly across nodes. The extent to which a fully decentralized system is vulnerable to poor-quality or corrupted data remains unclear, but several factors could contribute to potential risks. Without a central authority, there can be no unified mechanism to detect or correct errors, and each node operates with a localized view of the data distribution, making it difficult for the node to assess whether its perspective aligns with the true distribution. Moreover, models trained on low-quality data can propagate through the network, amplifying errors. To explore the impact of low-quality data on DFL, we simulate two scenarios with degraded…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Distributed systems and fault tolerance · Complexity and Algorithms in Graphs
