Understanding Dropout as an Optimization Trick

Sangchul Hahn; Heeyoul Choi

arXiv:1806.09783·cs.LG·October 10, 2019

Understanding Dropout as an Optimization Trick

Sangchul Hahn, Heeyoul Choi

PDF

TL;DR

This paper offers a new perspective on dropout as an optimization method that pushes inputs into activation saturation zones, and introduces GAAF, a technique to enhance gradient flow, improving neural network robustness and performance.

Contribution

It provides a novel explanation of dropout's effectiveness and proposes GAAF, a new activation function technique that accelerates gradients in saturation areas.

Findings

01

GAAF improves image classification accuracy.

02

Dropout's effect is partly due to pushing inputs into saturation zones.

03

GAAF enhances robustness by enabling better gradient flow.

Abstract

As one of standard approaches to train deep neural networks, dropout has been applied to regularize large models to avoid overfitting, and the improvement in performance by dropout has been explained as avoiding co-adaptation between nodes. However, when correlations between nodes are compared after training the networks with or without dropout, one question arises if co-adaptation avoidance explains the dropout effect completely. In this paper, we propose an additional explanation of why dropout works and propose a new technique to design better activation functions. First, we show that dropout can be explained as an optimization technique to push the input towards the saturation area of nonlinear activation function by accelerating gradient information flowing even in the saturation area in backpropagation. Based on this explanation, we propose a new technique for activation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsDropout