Performance degradation of ImageNet trained models by simple image   transformations

Harsh Maheshwari

arXiv:2207.08079·cs.CV·July 19, 2022

Performance degradation of ImageNet trained models by simple image transformations

Harsh Maheshwari

PDF

Open Access 1 Repo

TL;DR

This paper evaluates how simple image transformations such as shifting, scaling, and noise affect the performance of ImageNet-trained models, revealing notable accuracy drops even with minor modifications.

Contribution

It systematically analyzes the robustness of popular ImageNet-trained models against common image transformations, highlighting their vulnerability.

Findings

01

Rotations of 10° reduce accuracy by over 1%.

02

Scaling by 20% causes significant performance degradation.

03

Simple transformations can notably impair model accuracy.

Abstract

ImageNet trained PyTorch models are generally preferred as the off-the-shelf models for direct use or for initialisation in most computer vision tasks. In this paper, we simply test a representative set of these convolution and transformer based models under many simple image transformations like horizontal shifting, vertical shifting, scaling, rotation, presence of Gaussian noise, cutout, horizontal flip and vertical flip and report the performance drop caused by such transformations. We find that even simple transformations like rotating the image by 10{\deg} or zooming in by 20% can reduce the top-1 accuracy of models like ResNet152 by 1%+. The code is available at https://github.com/harshm121/imagenet-transformation-degradation.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

harshm121/image-transformation-degradation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

MethodsFLIP · Test · Convolution