Fixing by Mixing: A Recipe for Optimal Byzantine ML under Heterogeneity
Youssef Allouah, Sadegh Farhadkhani, Rachid Guerraoui, Nirupam Gupta,, Rafael Pinot, John Stephan

TL;DR
This paper introduces a novel mechanism called nearest neighbor mixing (NNM) that adapts existing Byzantine ML algorithms to heterogeneous data settings, achieving optimal resilience and superior empirical performance.
Contribution
It proposes NNM, a method that extends homogeneous Byzantine ML solutions to heterogeneous data, ensuring optimal robustness and improved practical results.
Findings
NNM achieves optimal Byzantine resilience under data heterogeneity.
Empirical results outperform existing Byzantine ML solutions.
The method effectively adapts standard algorithms to complex real-world scenarios.
Abstract
Byzantine machine learning (ML) aims to ensure the resilience of distributed learning algorithms to misbehaving (or Byzantine) machines. Although this problem received significant attention, prior works often assume the data held by the machines to be homogeneous, which is seldom true in practical settings. Data heterogeneity makes Byzantine ML considerably more challenging, since a Byzantine machine can hardly be distinguished from a non-Byzantine outlier. A few solutions have been proposed to tackle this issue, but these provide suboptimal probabilistic guarantees and fare poorly in practice. This paper closes the theoretical gap, achieving optimality and inducing good empirical results. In fact, we show how to automatically adapt existing solutions for (homogeneous) Byzantine ML to the heterogeneous setting through a powerful mechanism, we call nearest neighbor mixing (NNM), which…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic Gradient Optimization Techniques · Privacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning
