Evaluation of Neural Networks Defenses and Attacks using NDCG and Reciprocal Rank Metrics
Haya Brama, Lihi Dery, Tal Grinshpoun

TL;DR
This paper introduces two novel metrics based on information retrieval concepts to better evaluate the impact of adversarial attacks and defenses on neural network outputs, providing more meaningful insights than traditional accuracy-based metrics.
Contribution
The paper proposes two new metrics inspired by NDCG and reciprocal rank to assess neural network robustness against adversarial attacks, tailored for multiclass classification outputs.
Findings
Proposed metrics outperform traditional accuracy metrics in informativeness.
Metrics effectively distinguish between different attack and defense scenarios.
Evaluation on VGG19 and ImageNet demonstrates practical applicability.
Abstract
The problem of attacks on neural networks through input modification (i.e., adversarial examples) has attracted much attention recently. Being relatively easy to generate and hard to detect, these attacks pose a security breach that many suggested defenses try to mitigate. However, the evaluation of the effect of attacks and defenses commonly relies on traditional classification metrics, without adequate adaptation to adversarial scenarios. Most of these metrics are accuracy-based, and therefore may have a limited scope and low distinctive power. Other metrics do not consider the unique characteristics of neural networks functionality, or measure the effect of the attacks indirectly (e.g., through the complexity of their generation). In this paper, we present two metrics which are specifically designed to measure the effect of attacks, or the recovery effect of defenses, on the output…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
