Undermining Image and Text Classification Algorithms Using Adversarial Attacks
Langalibalele Lunga, Suhas Sreehari

TL;DR
This paper demonstrates the vulnerability of text and face recognition models to adversarial attacks using GANs, SMOTE, and gradient-based perturbations, leading to significant accuracy drops and highlighting the need for robust defenses.
Contribution
It introduces the use of GANs and SMOTE for adversarial attacks on text classification, and applies gradient sign attacks on face recognition, revealing critical vulnerabilities.
Findings
20% accuracy decrease in text classification models after attack
30% decrease in face recognition accuracy post-attack
Adversarial attacks significantly compromise model reliability
Abstract
Machine learning models are prone to adversarial attacks, where inputs can be manipulated in order to cause misclassifications. While previous research has focused on techniques like Generative Adversarial Networks (GANs), there's limited exploration of GANs and Synthetic Minority Oversampling Technique (SMOTE) in text and image classification models to perform adversarial attacks. Our study addresses this gap by training various machine learning models and using GANs and SMOTE to generate additional data points aimed at attacking text classification models. Furthermore, we extend our investigation to face recognition models, training a Convolutional Neural Network(CNN) and subjecting it to adversarial attacks with fast gradient sign perturbations on key features identified by GradCAM, a technique used to highlight key image characteristics CNNs use in classification. Our experiments…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Adversarial Robustness in Machine Learning
MethodsSynthetic Minority Over-sampling Technique.
