On the transferability of adversarial examples between convex and 01 loss models
Yunzhe Xue, Meiyan Xie, Usman Roshan

TL;DR
This paper investigates how adversarial examples transfer between models using 01 loss and convex loss functions, revealing that non-transferability is influenced by outliers and decision boundary differences, impacting robustness and attack effectiveness.
Contribution
It provides empirical evidence that adversarial transferability is limited between 01 loss and convex models due to boundary differences caused by outliers.
Findings
Adversarial examples do not transfer effectively between 01 loss and convex models.
Convex substitute model attacks are less effective on 01 loss models.
Outliers influence decision boundaries, affecting transferability and robustness.
Abstract
The 01 loss gives different and more accurate boundaries than convex loss models in the presence of outliers. Could the difference of boundaries translate to adversarial examples that are non-transferable between 01 loss and convex models? We explore this empirically in this paper by studying transferability of adversarial examples between linear 01 loss and convex (hinge) loss models, and between dual layer neural networks with sign activation and 01 loss vs sigmoid activation and logistic loss. We first show that white box adversarial examples do not transfer effectively between convex and 01 loss and between 01 loss models compared to between convex models. As a result of this non-transferability we see that convex substitute model black box attacks are less effective on 01 loss than convex models. Interestingly we also see that 01 loss substitute model attacks are ineffective on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Integrated Circuits and Semiconductor Failure Analysis
MethodsSigmoid Activation
