Devling into Adversarial Transferability on Image Classification: Review, Benchmark, and Evaluation

Xiaosen Wang; Zhijin Ge; Bohan Liu; Zheng Fang; Fengfan Zhou; Ruixuan Zhang; Shaokang Wang; Yuyang Luo

arXiv:2602.23117·cs.CV·February 27, 2026

Devling into Adversarial Transferability on Image Classification: Review, Benchmark, and Evaluation

Xiaosen Wang, Zhijin Ge, Bohan Liu, Zheng Fang, Fengfan Zhou, Ruixuan Zhang, Shaokang Wang, Yuyang Luo

PDF

Open Access

TL;DR

This paper reviews and benchmarks adversarial transferability in image classification, proposing a standardized evaluation framework, analyzing strategies to improve transferability, and discussing broader applications beyond images.

Contribution

It introduces a comprehensive benchmark framework for transfer-based attacks, categorizes existing methods, and highlights issues affecting fair comparison and evaluation.

Findings

01

Identified six categories of transfer-based attacks.

02

Proposed a standardized evaluation framework.

03

Highlighted strategies that improve transferability.

Abstract

Adversarial transferability refers to the capacity of adversarial examples generated on the surrogate model to deceive alternate, unexposed victim models. This property eliminates the need for direct access to the victim model during an attack, thereby raising considerable security concerns in practical applications and attracting substantial research attention recently. In this work, we discern a lack of a standardized framework and criteria for evaluating transfer-based attacks, leading to potentially biased assessments of existing approaches. To rectify this gap, we have conducted an exhaustive review of hundreds of related works, organizing various transfer-based attacks into six distinct categories. Subsequently, we propose a comprehensive framework designed to serve as a benchmark for evaluating these attacks. In addition, we delineate common strategies that enhance adversarial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection