ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning Paradigms
Minzhou Pan, Yi Zeng, Lingjuan Lyu, Xue Lin, Ruoxi Jia

TL;DR
This paper introduces ASSET, a new backdoor detection method that effectively works across supervised, self-supervised, and transfer learning paradigms, outperforming existing methods especially against challenging clean-label attacks.
Contribution
The paper presents ASSET, a novel detection technique that actively induces model behavior differences to identify backdoor samples across various deep learning settings.
Findings
ASSET outperforms existing methods in supervised learning for diverse attacks.
It effectively detects the state-of-the-art clean-label backdoor attack.
ASSET significantly improves detection rates in SSL and TL settings.
Abstract
Backdoor data detection is traditionally studied in an end-to-end supervised learning (SL) setting. However, recent years have seen the proliferating adoption of self-supervised learning (SSL) and transfer learning (TL), due to their lesser need for labeled data. Successful backdoor attacks have also been demonstrated in these new settings. However, we lack a thorough understanding of the applicability of existing detection methods across a variety of learning settings. By evaluating 56 attack settings, we show that the performance of most existing detection methods varies significantly across different attacks and poison ratios, and all fail on the state-of-the-art clean-label attack. In addition, they either become inapplicable or suffer large performance losses when applied to SSL and TL. We propose a new detection method called Active Separation via Offset (ASSET), which actively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning
Methodsfail
