Towards A Conceptually Simple Defensive Approach for Few-shot   classifiers Against Adversarial Support Samples

Yi Xiang Marcus Tan; Penny Chong; Jiamei Sun; Ngai-man Cheung; Yuval; Elovici; Alexander Binder

arXiv:2110.12357·cs.LG·October 26, 2021

Towards A Conceptually Simple Defensive Approach for Few-shot classifiers Against Adversarial Support Samples

Yi Xiang Marcus Tan, Penny Chong, Jiamei Sun, Ngai-man Cheung, Yuval, Elovici, Alexander Binder

PDF

Open Access

TL;DR

This paper introduces a simple, attack-agnostic detection method for few-shot classifiers to identify adversarial support sets, demonstrating effectiveness across datasets, classifiers, and attack strengths, and highlighting its generalizability.

Contribution

The work proposes a novel, conceptually simple detection approach based on self-similarity for defending few-shot classifiers against support set poisoning attacks.

Findings

01

Effective detection of adversarial support sets on miniImagenet and CUB datasets.

02

Outperforms baseline methods across multiple classifiers and attack strengths.

03

Demonstrates the generalizability of the detection approach with different filtering functions.

Abstract

Few-shot classifiers have been shown to exhibit promising results in use cases where user-provided labels are scarce. These models are able to learn to predict novel classes simply by training on a non-overlapping set of classes. This can be largely attributed to the differences in their mechanisms as compared to conventional deep networks. However, this also offers new opportunities for novel attackers to induce integrity attacks against such models, which are not present in other machine learning setups. In this work, we aim to close this gap by studying a conceptually simple approach to defend few-shot classifiers against adversarial attacks. More specifically, we propose a simple attack-agnostic detection method, using the concept of self-similarity and filtering, to flag out adversarial support sets which destroy the understanding of a victim classifier for a certain class. Our…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning