To Drop or Not to Drop: Robustness, Consistency and Differential Privacy Properties of Dropout
Prateek Jain, Vivek Kulkarni, Abhradeep Thakurta, Oliver Williams

TL;DR
This paper analyzes dropout's effects on neural network training, showing it helps avoid poor local minima and acts as a stabilizer in convex models, leading to improved generalization and privacy-preserving algorithms.
Contribution
It provides a rigorous theoretical analysis of dropout's dual role in non-convex and convex settings, including its stability and privacy benefits.
Findings
Dropout reduces local minima in neural networks.
Dropout stabilizes convex empirical risk minimizers.
Dropout outperforms L2 regularization on benchmark datasets.
Abstract
Training deep belief networks (DBNs) requires optimizing a non-convex function with an extremely large number of parameters. Naturally, existing gradient descent (GD) based methods are prone to arbitrarily poor local minima. In this paper, we rigorously show that such local minima can be avoided (upto an approximation error) by using the dropout technique, a widely used heuristic in this domain. In particular, we show that by randomly dropping a few nodes of a one-hidden layer neural network, the training objective function, up to a certain approximation error, decreases by a multiplicative factor. On the flip side, we show that for training convex empirical risk minimizers (ERM), dropout in fact acts as a "stabilizer" or regularizer. That is, a simple dropout based GD method for convex ERMs is stable in the face of arbitrary changes to any one of the training points. Using the above…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Mobile Crowdsensing and Crowdsourcing · Privacy, Security, and Data Protection
MethodsDropout
