Towards Dropout Training for Convolutional Neural Networks

Haibing Wu; Xiaodong Gu

arXiv:1512.00242·cs.LG·December 2, 2015

Towards Dropout Training for Convolutional Neural Networks

Haibing Wu, Xiaodong Gu

PDF

TL;DR

This paper investigates dropout in convolutional neural networks, proposing probabilistic weighted pooling as an alternative to max-pooling, leading to improved performance and insights into dropout effects in convolutional and pooling layers.

Contribution

It introduces probabilistic weighted pooling as a test-time alternative to max-pooling and demonstrates its effectiveness, along with a comprehensive analysis of dropout in convolutional layers.

Findings

01

Probabilistic weighted pooling outperforms max-pooling.

02

Dropout in convolutional layers has a significant impact.

03

Achieved state-of-the-art on MNIST, competitive on CIFAR datasets.

Abstract

Recently, dropout has seen increasing use in deep learning. For deep convolutional neural networks, dropout is known to work well in fully-connected layers. However, its effect in convolutional and pooling layers is still not clear. This paper demonstrates that max-pooling dropout is equivalent to randomly picking activation based on a multinomial distribution at training time. In light of this insight, we advocate employing our proposed probabilistic weighted pooling, instead of commonly used max-pooling, to act as model averaging at test time. Empirical evidence validates the superiority of probabilistic weighted pooling. We also empirically show that the effect of convolutional dropout is not trivial, despite the dramatically reduced possibility of over-fitting due to the convolutional architecture. Elaborately designing dropout training simultaneously in max-pooling and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDropout