Adversarial Attacks on Image Classification Models: Analysis and Defense
Jaydip Sen, Abhiraj Sen, and Ananda Chatterjee

TL;DR
This paper analyzes the impact of the FGSM adversarial attack on CNN image classifiers and proposes a defense mechanism based on modified defensive distillation to improve robustness.
Contribution
It provides a detailed analysis of FGSM attack effects on popular CNN architectures and introduces a novel defense strategy using modified defensive distillation.
Findings
FGSM significantly reduces classification accuracy of CNNs
The proposed defense improves robustness against FGSM attacks
Extensive validation confirms effectiveness of the defense mechanism
Abstract
The notion of adversarial attacks on image classification models based on convolutional neural networks (CNN) is introduced in this work. To classify images, deep learning models called CNNs are frequently used. However, when the networks are subject to adversarial attacks, extremely potent and previously trained CNN models that perform quite effectively on image datasets for image classification tasks may perform poorly. In this work, one well-known adversarial attack known as the fast gradient sign method (FGSM) is explored and its adverse effects on the performances of image classification models are examined. The FGSM attack is simulated on three pre-trained image classifier CNN architectures, ResNet-101, AlexNet, and RegNetY 400MF using randomly chosen images from the ImageNet dataset. The classification accuracies of the models are computed in the absence and presence of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research
MethodsAverage Pooling · Sigmoid Activation · Squeeze-and-Excitation Block · Global Average Pooling · Convolution · Dense Connections · Grouped Convolution · 1x1 Convolution · *Communicated@Fast*How Do I Communicate to Expedia? · Batch Normalization
