ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky; Jia Deng; Hao Su; Jonathan Krause and; Sanjeev Satheesh; Sean Ma; Zhiheng Huang; Andrej Karpathy; Aditya; Khosla; Michael Bernstein; Alexander C. Berg; Li Fei-Fei

arXiv:1409.0575·cs.CV·February 2, 2015·53 cites

ImageNet Large Scale Visual Recognition Challenge

Olga Russakovsky, Jia Deng, Hao Su, Jonathan Krause and, Sanjeev Satheesh, Sean Ma, Zhiheng Huang, Andrej Karpathy, Aditya, Khosla, Michael Bernstein, Alexander C. Berg, Li Fei-Fei

PDF

Open Access 5 Repos 2 Models 5 Datasets

TL;DR

The ImageNet Large Scale Visual Recognition Challenge is a benchmark for object classification and detection that has driven significant advances in computer vision over the past decade.

Contribution

This paper introduces the ImageNet benchmark dataset and analyzes key progress and challenges in large-scale image recognition and detection.

Findings

01

Significant improvements in object recognition accuracy

02

Comparison of computer vision performance with human accuracy

03

Lessons learned and future directions for large-scale visual recognition

Abstract

The ImageNet Large Scale Visual Recognition Challenge is a benchmark in object category classification and detection on hundreds of object categories and millions of images. The challenge has been run annually from 2010 to present, attracting participation from more than fifty institutions. This paper describes the creation of this benchmark dataset and the advances in object recognition that have been possible as a result. We discuss the challenges of collecting large-scale ground truth annotation, highlight key breakthroughs in categorical object recognition, provide a detailed analysis of the current state of the field of large-scale image classification and object detection, and compare the state-of-the-art computer vision accuracy with human accuracy. We conclude with lessons learned in the five years of the challenge, and propose future directions and improvements.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Retrieval and Classification Techniques · Advanced Image and Video Retrieval Techniques · Digital Imaging for Blood Diseases

Methods1-Dimensional Convolutional Neural Networks