Convergence rates for pretraining and dropout: Guiding learning parameters using network structure
Vamsi K. Ithapu, Sathya Ravi, Vikas Singh

TL;DR
This paper investigates how network structure influences the convergence rates of pretraining and dropout in deep learning, providing theoretical bounds and practical insights for parameter choices.
Contribution
It introduces a framework linking network structure and learning parameters to convergence rates, incorporating denoising autoencoder and dropout mechanisms.
Findings
Derived convergence rate bounds for deep networks with dropout and autoencoders
Provided guidelines for selecting learning parameters based on network structure
Supported theoretical results with empirical evaluations
Abstract
Unsupervised pretraining and dropout have been well studied, especially with respect to regularization and output consistency. However, our understanding about the explicit convergence rates of the parameter estimates, and their dependence on the learning (like denoising and dropout rate) and structural (like depth and layer lengths) aspects of the network is less mature. An interesting question in this context is to ask if the network structure could "guide" the choices of such learning parameters. In this work, we explore these gaps between network structure, the learning mechanisms and their interaction with parameter convergence rates. We present a way to address these issues based on the backpropagation convergence rates for general nonconvex objectives using first-order information. We then incorporate two learning mechanisms into this general framework -- denoising autoencoder…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNeural Networks and Applications · Evolutionary Algorithms and Applications · Advanced Memory and Neural Computing
MethodsDenoising Autoencoder · Solana Customer Service Number +1-833-534-1729 · Dropout
