On the transferability of adversarial examples between convex and 01   loss models

Yunzhe Xue; Meiyan Xie; Usman Roshan

arXiv:2006.07800·cs.LG·July 31, 2020·1 cites

On the transferability of adversarial examples between convex and 01 loss models

Yunzhe Xue, Meiyan Xie, Usman Roshan

PDF

Open Access 1 Repo

TL;DR

This paper investigates how adversarial examples transfer between models using 01 loss and convex loss functions, revealing that non-transferability is influenced by outliers and decision boundary differences, impacting robustness and attack effectiveness.

Contribution

It provides empirical evidence that adversarial transferability is limited between 01 loss and convex models due to boundary differences caused by outliers.

Findings

01

Adversarial examples do not transfer effectively between 01 loss and convex models.

02

Convex substitute model attacks are less effective on 01 loss models.

03

Outliers influence decision boundaries, affecting transferability and robustness.

Abstract

The 01 loss gives different and more accurate boundaries than convex loss models in the presence of outliers. Could the difference of boundaries translate to adversarial examples that are non-transferable between 01 loss and convex models? We explore this empirically in this paper by studying transferability of adversarial examples between linear 01 loss and convex (hinge) loss models, and between dual layer neural networks with sign activation and 01 loss vs sigmoid activation and logistic loss. We first show that white box adversarial examples do not transfer effectively between convex and 01 loss and between 01 loss models compared to between convex models. As a result of this non-transferability we see that convex substitute model black box attacks are less effective on 01 loss than convex models. Interestingly we also see that 01 loss substitute model attacks are ineffective on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zero-one-loss/mlp01
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Integrated Circuits and Semiconductor Failure Analysis

MethodsSigmoid Activation