Estimating or Propagating Gradients Through Stochastic Neurons

Yoshua Bengio

arXiv:1305.2982·cs.LG·May 15, 2013·81 cites

Estimating or Propagating Gradients Through Stochastic Neurons

Yoshua Bengio

PDF

Open Access

TL;DR

This paper explores methods for estimating and propagating gradients through stochastic neurons, introducing two novel solutions that enable unbiased gradient estimation in various deep learning and reinforcement learning contexts.

Contribution

It presents two new approaches for gradient estimation in stochastic neurons, including a biologically plausible unbiased estimator and a method for approximating high-variance estimators.

Findings

01

Unbiased gradient estimator for binary stochastic neurons.

02

Applicability of estimators in reinforcement learning scenarios.

03

Method for learning to predict high-variance estimators.

Abstract

Stochastic neurons can be useful for a number of reasons in deep learning models, but in many cases they pose a challenging problem: how to estimate the gradient of a loss function with respect to the input of such stochastic neurons, i.e., can we "back-propagate" through these stochastic neurons? We examine this question, existing approaches, and present two novel families of solutions, applicable in different settings. In particular, it is demonstrated that a simple biologically plausible formula gives rise to an an unbiased (but noisy) estimator of the gradient with respect to a binary stochastic neuron firing probability. Unlike other estimators which view the noise as a small perturbation in order to estimate gradients by finite differences, this estimator is unbiased even without assuming that the stochastic perturbation is small. This estimator is also interesting because it can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Neural Networks and Applications