Showing Many Labels in Multi-label Classification Models: An Empirical Study of Adversarial Examples
Yujiang Liu, Wenjian Luo, Zhijian Chen, Muhammad Luqman Naseem

TL;DR
This paper investigates adversarial attacks in multi-label classification, introducing a new attack type called 'Showing Many Labels' that aims to maximize predicted labels, revealing vulnerabilities in DNNs.
Contribution
The study introduces a novel multi-label adversarial attack method and evaluates its effectiveness across multiple datasets and models, highlighting the vulnerability of DNNs to such attacks.
Findings
Iterative attacks outperform one-step attacks in showing many labels.
It is possible to show all labels in the dataset using the proposed attack.
The attack success rate varies across different algorithms and scenarios.
Abstract
With the rapid development of Deep Neural Networks (DNNs), they have been applied in numerous fields. However, research indicates that DNNs are susceptible to adversarial examples, and this is equally true in the multi-label domain. To further investigate multi-label adversarial examples, we introduce a novel type of attacks, termed "Showing Many Labels". The objective of this attack is to maximize the number of labels included in the classifier's prediction results. In our experiments, we select nine attack algorithms and evaluate their performance under "Showing Many Labels". Eight of the attack algorithms were adapted from the multi-class environment to the multi-label environment, while the remaining one was specifically designed for the multi-label environment. We choose ML-LIW and ML-GCN as target models and train them on four popular multi-label datasets: VOC2007, VOC2012,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Hate Speech and Cyberbullying Detection · Authorship Attribution and Profiling
