Undermining Image and Text Classification Algorithms Using Adversarial   Attacks

Langalibalele Lunga; Suhas Sreehari

arXiv:2411.03348·cs.CR·November 8, 2024

Undermining Image and Text Classification Algorithms Using Adversarial Attacks

Langalibalele Lunga, Suhas Sreehari

PDF

Open Access

TL;DR

This paper demonstrates the vulnerability of text and face recognition models to adversarial attacks using GANs, SMOTE, and gradient-based perturbations, leading to significant accuracy drops and highlighting the need for robust defenses.

Contribution

It introduces the use of GANs and SMOTE for adversarial attacks on text classification, and applies gradient sign attacks on face recognition, revealing critical vulnerabilities.

Findings

01

20% accuracy decrease in text classification models after attack

02

30% decrease in face recognition accuracy post-attack

03

Adversarial attacks significantly compromise model reliability

Abstract

Machine learning models are prone to adversarial attacks, where inputs can be manipulated in order to cause misclassifications. While previous research has focused on techniques like Generative Adversarial Networks (GANs), there's limited exploration of GANs and Synthetic Minority Oversampling Technique (SMOTE) in text and image classification models to perform adversarial attacks. Our study addresses this gap by training various machine learning models and using GANs and SMOTE to generate additional data points aimed at attacking text classification models. Furthermore, we extend our investigation to face recognition models, training a Convolutional Neural Network(CNN) and subjecting it to adversarial attacks with fast gradient sign perturbations on key features identified by GradCAM, a technique used to highlight key image characteristics CNNs use in classification. Our experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Adversarial Robustness in Machine Learning

MethodsSynthetic Minority Over-sampling Technique.