Performance Evaluation of Adversarial Attacks: Discrepancies and Solutions
Jing Wu, Mingyi Zhou, Ce Zhu, Yipeng Liu, Mehrtash Harandi, Li Li

TL;DR
This paper introduces the PSC toolkit to standardize and improve the evaluation of adversarial attack algorithms, addressing discrepancies and balancing computational costs for more reliable robustness assessments.
Contribution
The paper proposes a novel Piece-wise Sampling Curving (PSC) toolkit that provides a comprehensive and balanced comparison framework for adversarial attack evaluation.
Findings
PSC reduces evaluation discrepancies in practice
Provides a flexible balance between cost and effectiveness
Enables more reliable robustness assessments
Abstract
Recently, adversarial attack methods have been developed to challenge the robustness of machine learning models. However, mainstream evaluation criteria experience limitations, even yielding discrepancies among results under different settings. By examining various attack algorithms, including gradient-based and query-based attacks, we notice the lack of a consensus on a uniform standard for unbiased performance evaluation. Accordingly, we propose a Piece-wise Sampling Curving (PSC) toolkit to effectively address the aforementioned discrepancy, by generating a comprehensive comparison among adversaries in a given range. In addition, the PSC toolkit offers options for balancing the computational cost and evaluation effectiveness. Experimental results demonstrate our PSC toolkit presents comprehensive comparisons of attack algorithms, significantly reducing discrepancies in practice.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · Domain Adaptation and Few-Shot Learning
