Multi-Sample Dropout for Accelerated Training and Better Generalization

Hiroshi Inoue

arXiv:1905.09788·cs.NE·October 22, 2020·52 cites

Multi-Sample Dropout for Accelerated Training and Better Generalization

Hiroshi Inoue

PDF

Open Access 5 Repos

TL;DR

This paper introduces multi-sample dropout, an enhanced regularization method that creates multiple dropout samples per iteration, leading to faster training and improved generalization in deep neural networks.

Contribution

It proposes a simple extension to dropout that averages losses over multiple samples, accelerating training and enhancing model performance without significant additional computational cost.

Findings

01

Multi-sample dropout accelerates training on image classification tasks.

02

Networks with multi-sample dropout achieve lower error rates than with original dropout.

03

Additional computation cost is minimal due to shared weights and convolutional layer dominance.

Abstract

Dropout is a simple but efficient regularization technique for achieving better generalization of deep neural networks (DNNs); hence it is widely used in tasks based on DNNs. During training, dropout randomly discards a portion of the neurons to avoid overfitting. This paper presents an enhanced dropout technique, which we call multi-sample dropout, for both accelerating training and improving generalization over the original dropout. The original dropout creates a randomly selected subset (called a dropout sample) from the input in each training iteration while the multi-sample dropout creates multiple dropout samples. The loss is calculated for each sample, and then the sample losses are averaged to obtain the final loss. This technique can be easily implemented by duplicating a part of the network after the dropout layer while sharing the weights among the duplicated fully connected…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification

MethodsConvolution · Dropout