Multi-column Deep Neural Networks for Image Classification

Dan Cire\c{s}an; Ueli Meier; Juergen Schmidhuber

arXiv:1202.2745·cs.CV·November 15, 2012·64 cites

Multi-column Deep Neural Networks for Image Classification

Dan Cire\c{s}an, Ueli Meier, Juergen Schmidhuber

PDF

Open Access 1 Repo

TL;DR

This paper introduces multi-column deep neural networks inspired by biological vision, achieving near-human performance on MNIST and surpassing humans on traffic sign recognition, with state-of-the-art results on various benchmarks.

Contribution

The paper presents a biologically inspired deep neural network architecture with multiple columns and winner-take-all neurons, achieving superior image classification performance.

Findings

01

Achieved near-human accuracy on MNIST handwritten digit recognition.

02

Outperformed humans by a factor of two on traffic sign recognition.

03

Set new state-of-the-art results on multiple image classification benchmarks.

Abstract

Traditional methods of computer vision and machine learning cannot match human performance on tasks such as the recognition of handwritten digits or traffic signs. Our biologically plausible deep artificial neural network architectures can. Small (often minimal) receptive fields of convolutional winner-take-all neurons yield large network depth, resulting in roughly as many sparsely connected neural layers as found in mammals between retina and visual cortex. Only winner neurons are trained. Several deep neural columns become experts on inputs preprocessed in different ways; their predictions are averaged. Graphics cards allow for fast training. On the very competitive MNIST handwriting benchmark, our method is the first to achieve near-human performance. On a traffic sign recognition benchmark it outperforms humans by a factor of two. We also improve the state-of-the-art on a plethora…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hughperkins/DeepCL
none

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Visual Attention and Saliency Detection · Domain Adaptation and Few-Shot Learning