Contributor-Aware Defenses Against Adversarial Backdoor Attacks
Glenn Dawson, Muhammad Umer, Robi Polikar

TL;DR
This paper introduces a contributor-aware defense framework for neural networks that leverages multiple data sources and semi-supervised learning to effectively mitigate adversarial backdoor attacks without relying on pattern detection.
Contribution
It proposes a novel contributor-aware approach that uses ensemble learning and crowdsourcing principles to defend against backdoor attacks without pattern detection.
Findings
Robustness against multiple simultaneous backdoor attacks.
Effective filtering of false labels from adversarial triggers.
Defense strategy is agnostic to backdoor pattern design.
Abstract
Deep neural networks for image classification are well-known to be vulnerable to adversarial attacks. One such attack that has garnered recent attention is the adversarial backdoor attack, which has demonstrated the capability to perform targeted misclassification of specific examples. In particular, backdoor attacks attempt to force a model to learn spurious relations between backdoor trigger patterns and false labels. In response to this threat, numerous defensive measures have been proposed; however, defenses against backdoor attacks focus on backdoor pattern detection, which may be unreliable against novel or unexpected types of backdoor pattern designs. We introduce a novel re-contextualization of the adversarial setting, where the presence of an adversary implicitly admits the existence of multiple database contributors. Then, under the mild assumption of contributor awareness, it…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
