ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep   Learning Paradigms

Minzhou Pan; Yi Zeng; Lingjuan Lyu; Xue Lin; Ruoxi Jia

arXiv:2302.11408·cs.LG·August 8, 2023·6 cites

ASSET: Robust Backdoor Data Detection Across a Multiplicity of Deep Learning Paradigms

Minzhou Pan, Yi Zeng, Lingjuan Lyu, Xue Lin, Ruoxi Jia

PDF

Open Access 1 Repo

TL;DR

This paper introduces ASSET, a new backdoor detection method that effectively works across supervised, self-supervised, and transfer learning paradigms, outperforming existing methods especially against challenging clean-label attacks.

Contribution

The paper presents ASSET, a novel detection technique that actively induces model behavior differences to identify backdoor samples across various deep learning settings.

Findings

01

ASSET outperforms existing methods in supervised learning for diverse attacks.

02

It effectively detects the state-of-the-art clean-label backdoor attack.

03

ASSET significantly improves detection rates in SSL and TL settings.

Abstract

Backdoor data detection is traditionally studied in an end-to-end supervised learning (SL) setting. However, recent years have seen the proliferating adoption of self-supervised learning (SSL) and transfer learning (TL), due to their lesser need for labeled data. Successful backdoor attacks have also been demonstrated in these new settings. However, we lack a thorough understanding of the applicability of existing detection methods across a variety of learning settings. By evaluating 56 attack settings, we show that the performance of most existing detection methods varies significantly across different attacks and poison ratios, and all fail on the state-of-the-art clean-label attack. In addition, they either become inapplicable or suffer large performance losses when applied to SSL and TL. We propose a new detection method called Active Separation via Offset (ASSET), which actively…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ruoxi-jia-group/asset
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · Anomaly Detection Techniques and Applications · Adversarial Robustness in Machine Learning

Methodsfail