Flaws of ImageNet, Computer Vision's Favourite Dataset

Nikita Kisel; Illia Volkov; Katerina Hanzelkova; Klara Janouskova,; Jiri Matas

arXiv:2412.00076·cs.CV·December 3, 2024

Flaws of ImageNet, Computer Vision's Favourite Dataset

Nikita Kisel, Illia Volkov, Katerina Hanzelkova, Klara Janouskova,, Jiri Matas

PDF

Open Access

TL;DR

This paper critically examines the ImageNet-1k dataset, highlighting issues like incorrect labels and domain shifts, and advocates for dataset refinement to improve future computer vision research.

Contribution

It provides a detailed analysis of ImageNet-1k's flaws and initiates a discussion on refining this widely used dataset for better research outcomes.

Findings

01

Identification of label correctness issues

02

Detection of image duplicates and ambiguities

03

Discussion on domain shifts affecting evaluation

Abstract

Since its release, ImageNet-1k dataset has become a gold standard for evaluating model performance. It has served as the foundation for numerous other datasets and training tasks in computer vision. As models have improved in accuracy, issues related to label correctness have become increasingly apparent. In this blog post, we analyze the issues in the ImageNet-1k dataset, including incorrect labels, overlapping or ambiguous class definitions, training-evaluation domain shifts, and image duplicates. The solutions for some problems are straightforward. For others, we hope to start a broader conversation about refining this influential dataset to better serve future research.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsCOVID-19 diagnosis using AI · AI in cancer detection