Stochastic Descent Analysis of Representation Learning Algorithms

Richard M. Golden

arXiv:1412.5744·stat.ML·April 21, 2017

Stochastic Descent Analysis of Representation Learning Algorithms

Richard M. Golden

PDF

Open Access

TL;DR

This paper introduces a new stochastic approximation theorem with verifiable assumptions, enabling rigorous analysis and design of various deep learning algorithms such as adaptive learning and contrastive divergence.

Contribution

It provides a novel stochastic approximation theorem tailored for state-dependent noise, facilitating theoretical understanding of key deep learning algorithms.

Findings

01

The theorem applies to adaptive learning algorithms.

02

It enables analysis of contrastive divergence learning.

03

Supports design of more reliable deep learning methods.

Abstract

Although stochastic approximation learning methods have been widely used in the machine learning literature for over 50 years, formal theoretical analyses of specific machine learning algorithms are less common because stochastic approximation theorems typically possess assumptions which are difficult to communicate and verify. This paper presents a new stochastic approximation theorem for state-dependent noise with easily verifiable assumptions applicable to the analysis and design of important deep learning algorithms including: adaptive learning, contrastive divergence learning, stochastic descent expectation maximization, and active learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Neural Networks and Applications · Gaussian Processes and Bayesian Inference