Convergence rates for pretraining and dropout: Guiding learning   parameters using network structure

Vamsi K. Ithapu; Sathya Ravi; Vikas Singh

arXiv:1506.03412·cs.LG·February 23, 2017·1 cites

Convergence rates for pretraining and dropout: Guiding learning parameters using network structure

Vamsi K. Ithapu, Sathya Ravi, Vikas Singh

PDF

Open Access

TL;DR

This paper investigates how network structure influences the convergence rates of pretraining and dropout in deep learning, providing theoretical bounds and practical insights for parameter choices.

Contribution

It introduces a framework linking network structure and learning parameters to convergence rates, incorporating denoising autoencoder and dropout mechanisms.

Findings

01

Derived convergence rate bounds for deep networks with dropout and autoencoders

02

Provided guidelines for selecting learning parameters based on network structure

03

Supported theoretical results with empirical evaluations

Abstract

Unsupervised pretraining and dropout have been well studied, especially with respect to regularization and output consistency. However, our understanding about the explicit convergence rates of the parameter estimates, and their dependence on the learning (like denoising and dropout rate) and structural (like depth and layer lengths) aspects of the network is less mature. An interesting question in this context is to ask if the network structure could "guide" the choices of such learning parameters. In this work, we explore these gaps between network structure, the learning mechanisms and their interaction with parameter convergence rates. We present a way to address these issues based on the backpropagation convergence rates for general nonconvex objectives using first-order information. We then incorporate two learning mechanisms into this general framework -- denoising autoencoder…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNeural Networks and Applications · Evolutionary Algorithms and Applications · Advanced Memory and Neural Computing

MethodsDenoising Autoencoder · Solana Customer Service Number +1-833-534-1729 · Dropout