How to Sift Out a Clean Data Subset in the Presence of Data Poisoning?
Yi Zeng, Minzhou Pan, Himanshu Jahagirdar, Ming Jin, Lingjuan Lyu and, Ruoxi Jia

TL;DR
This paper investigates the difficulty of identifying clean data in poisoned datasets, analyzes existing defenses' vulnerabilities, and introduces Meta-Sift, a method that effectively isolates clean data for robust machine learning.
Contribution
The paper reveals the limitations of current automated and human methods for detecting clean data in poisoned datasets and proposes Meta-Sift, a bilevel optimization approach that achieves perfect precision in identifying clean data.
Findings
Existing defenses fail with less than 1% poisoned data in the base set.
Automated tools and human inspection often perform worse than random chance.
Meta-Sift achieves 100% precision in extracting clean data across various attacks.
Abstract
Given the volume of data needed to train modern machine learning models, external suppliers are increasingly used. However, incorporating external data poses data poisoning risks, wherein attackers manipulate their data to degrade model utility or integrity. Most poisoning defenses presume access to a set of clean data (or base set). While this assumption has been taken for granted, given the fast-growing research on stealthy poisoning attacks, a question arises: can defenders really identify a clean subset within a contaminated dataset to support defenses? This paper starts by examining the impact of poisoned samples on defenses when they are mistakenly mixed into the base set. We analyze five defenses and find that their performance deteriorates dramatically with less than 1% poisoned points in the base set. These findings suggest that sifting out a base set with high precision is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Machine Learning and Data Classification
MethodsBalanced Selection
