The Effect of Data Ordering in Image Classification

Ethem F. Can; Aysu Ezen-Can

arXiv:2001.05857·cs.CV·September 29, 2020·1 cites

The Effect of Data Ordering in Image Classification

Ethem F. Can, Aysu Ezen-Can

PDF

Open Access

TL;DR

This paper demonstrates that the order in which training data is presented significantly impacts image classification performance, regardless of model architecture or training parameters.

Contribution

It reveals the importance of data ordering in training deep learning models and provides experimental evidence across various metrics on ImageNet.

Findings

01

Data ordering affects classification accuracy

02

Certain data sequences outperform others

03

Impact is consistent across different models and metrics

Abstract

The success stories from deep learning models increase every day spanning different tasks from image classification to natural language understanding. With the increasing popularity of these models, scientists spend more and more time finding the optimal parameters and best model architectures for their tasks. In this paper, we focus on the ingredient that feeds these machines: the data. We hypothesize that the data ordering affects how well a model performs. To that end, we conduct experiments on an image classification task using ImageNet dataset and show that some data orderings are better than others in terms of obtaining higher classification accuracies. Experimental results show that independent of model architecture, learning rate and batch size, ordering of the data significantly affects the outcome. We show these findings using different metrics: NDCG, accuracy @ 1 and accuracy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Machine Learning and Data Classification