Cross-domain Cross-architecture Black-box Attacks on Fine-tuned Models   with Transferred Evolutionary Strategies

Yinghua Zhang; Yangqiu Song; Kun Bai; Qiang Yang

arXiv:2208.13182·cs.LG·August 30, 2022

Cross-domain Cross-architecture Black-box Attacks on Fine-tuned Models with Transferred Evolutionary Strategies

Yinghua Zhang, Yangqiu Song, Kun Bai, Qiang Yang

PDF

1 Repo

TL;DR

This paper introduces novel black-box attack methods on fine-tuned models across different domains and architectures, utilizing transferred evolutionary strategies to generate adversarial examples efficiently.

Contribution

It proposes two new BAFT settings and a method using an adversarial generator and latent space search guided by surrogate gradients.

Findings

01

Effective attack on fine-tuned models across domains

02

Efficient attack method with low-dimensional search

03

Demonstrated success on various architectures

Abstract

Fine-tuning can be vulnerable to adversarial attacks. Existing works about black-box attacks on fine-tuned models (BAFT) are limited by strong assumptions. To fill the gap, we propose two novel BAFT settings, cross-domain and cross-domain cross-architecture BAFT, which only assume that (1) the target model for attacking is a fine-tuned model, and (2) the source domain data is known and accessible. To successfully attack fine-tuned models under both settings, we propose to first train an adversarial generator against the source model, which adopts an encoder-decoder architecture and maps a clean input to an adversarial example. Then we search in the low-dimensional latent space produced by the encoder of the adversarial generator. The search is conducted under the guidance of the surrogate gradient obtained from the source model. Experimental results on different domains and different…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hkust-knowcomp/tes
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.