Byzantine Machine Learning: MultiKrum and an optimal notion of robustness
Gilles Bareilles, Wassim Bouaziz, Julien Fageot, El-Mahdi El-Mhamdi

TL;DR
This paper proves the robustness of MultiKrum aggregation in Byzantine machine learning, introduces an optimal robustness coefficient, and compares it with Krum, providing theoretical bounds and empirical validation.
Contribution
It provides the first theoretical proof of MultiKrum's robustness and introduces a new optimal robustness coefficient for aggregation rules.
Findings
MultiKrum is proven to be a robust aggregation rule.
Bounds on MultiKrum's robustness coefficient are established.
MultiKrum outperforms Krum in realistic regimes.
Abstract
Aggregation rules are the cornerstone of distributed (or federated) learning in the presence of adversaries, under the so-called Byzantine threat model. They are also interesting mathematical objects from the point of view of robust mean estimation. The Krum aggregation rule has been extensively studied, and endowed with formal robustness and convergence guarantees. Yet, MultiKrum, a natural extension of Krum, is often preferred in practice for its superior empirical performance, even though no theoretical guarantees were available until now. In this work, we provide the first proof that MultiKrum is a robust aggregation rule, and bound its robustness coefficient. To do so, we introduce , the optimal *robustness coefficient* of an aggregation rule, which quantifies the accuracy of mean estimation in the presence of adversaries in a tighter manner compared with previously…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Ethics and Social Impacts of AI
