Some Improvements on Deep Convolutional Neural Network Based Image   Classification

Andrew G. Howard

arXiv:1312.5402·cs.CV·December 20, 2013·ICLR·226 cites

Some Improvements on Deep Convolutional Neural Network Based Image Classification

Andrew G. Howard

PDF

Open Access 3 Repos

TL;DR

This paper presents techniques to enhance deep convolutional neural networks for image classification, achieving significant accuracy improvements in the ImageNet challenge by data augmentation, test-time transformations, and model ensembling.

Contribution

The paper introduces specific data augmentation and model combination strategies that significantly improve CNN performance on large-scale image classification.

Findings

01

Achieved a top 5 error rate of 13.55% on ImageNet without external data.

02

Over 20% relative improvement over previous year's winner.

03

Enhanced accuracy through multiple image transformations and model ensembling.

Abstract

We investigate multiple techniques to improve upon the current state of the art deep convolutional neural network based image classification pipeline. The techiques include adding more image transformations to training data, adding more transformations to generate additional predictions at test time and using complementary models applied to higher resolution images. This paper summarizes our entry in the Imagenet Large Scale Visual Recognition Challenge 2013. Our system achieved a top 5 classification error rate of 13.55% using no external data which is over a 20% relative improvement on the previous year's winner.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Advanced Image and Video Retrieval Techniques

MethodsStochastic Gradient Descent · Step Decay · Convolution · Color Jitter · Random Resized Crop