Contamination Attacks and Mitigation in Multi-Party Machine Learning
Jamie Hayes, Olga Ohrimenko

TL;DR
This paper investigates contamination attacks in multi-party machine learning, demonstrating how malicious data can taint models and proposing adversarial training as a mitigation strategy to protect privacy and model integrity.
Contribution
It introduces the problem of contamination attacks in multi-party ML and proposes adversarial training as an effective defense mechanism.
Findings
Adversarial training prevents models from learning party-specific data trends.
Contamination attacks can significantly degrade model performance.
Adversarial defense also enhances party-level membership privacy.
Abstract
Machine learning is data hungry; the more data a model has access to in training, the more likely it is to perform well at inference time. Distinct parties may want to combine their local data to gain the benefits of a model trained on a large corpus of data. We consider such a case: parties get access to the model trained on their joint data but do not see each others individual datasets. We show that one needs to be careful when using this multi-party model since a potentially malicious party can taint the model by providing contaminated data. We then show how adversarial training can defend against such attacks by preventing the model from learning trends specific to individual parties data, thereby also guaranteeing party-level membership privacy.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data
