An Analysis of Dropout for Matrix Factorization
Jacopo Cavazza, Connor Lane, Benjamin D. Haeffele, Vittorio Murino,, Ren\'e Vidal

TL;DR
This paper provides a theoretical analysis of dropout in matrix factorization, revealing its equivalence to a deterministic regularization model and highlighting issues with fixed dropout rates affecting factorization size.
Contribution
It introduces a theoretical framework connecting dropout to a regularizer in matrix factorization and proposes increasing dropout rates with factor size to address size control issues.
Findings
Dropout is equivalent to a deterministic regularizer involving column norms.
Using a fixed dropout rate can lead to trivial solutions with larger factorization.
Increasing dropout rate with factor size mitigates the trivial solution problem.
Abstract
Dropout is a simple yet effective algorithm for regularizing neural networks by randomly dropping out units through Bernoulli multiplicative noise, and for some restricted problem classes, such as linear or logistic regression, several theoretical studies have demonstrated the equivalence between dropout and a fully deterministic optimization problem with data-dependent Tikhonov regularization. This work presents a theoretical analysis of dropout for matrix factorization, where Bernoulli random variables are used to drop a factor, thereby attempting to control the size of the factorization. While recent work has demonstrated the empirical effectiveness of dropout for matrix factorization, a theoretical understanding of the regularization properties of dropout in this context remains elusive. This work demonstrates the equivalence between dropout and a fully deterministic model for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Machine Learning and ELM
MethodsDropout
