Multi-concept adversarial attacks
Vibha Belavadi, Yan Zhou, Murat Kantarcioglu, Bhavani M. Thuraisingham

TL;DR
This paper introduces novel adversarial attack methods that can target specific classifiers in multi-concept models without affecting others, enhancing privacy protection and model robustness.
Contribution
It develops a theoretical framework and practical strategies for simultaneous targeted attacks and protections in multi-concept machine learning models.
Findings
Successfully attacked target classifiers while preserving others' accuracy.
Demonstrated effectiveness in deep learning and linear classifiers.
Outperformed existing single-strategy attack methods.
Abstract
As machine learning (ML) techniques are being increasingly used in many applications, their vulnerability to adversarial attacks becomes well-known. Test time attacks, usually launched by adding adversarial noise to test instances, have been shown effective against the deployed ML models. In practice, one test input may be leveraged by different ML models. Test time attacks targeting a single ML model often neglect their impact on other ML models. In this work, we empirically demonstrate that naively attacking the classifier learning one concept may negatively impact classifiers trained to learn other concepts. For example, for the online image classification scenario, when the Gender classifier is under attack, the (wearing) Glasses classifier is simultaneously attacked with the accuracy dropped from 98.69 to 88.42. This raises an interesting question: is it possible to attack one set…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Malware Detection Techniques · Bacillus and Francisella bacterial research
MethodsTest
