Estimating or Propagating Gradients Through Stochastic Neurons
Yoshua Bengio

TL;DR
This paper explores methods for estimating and propagating gradients through stochastic neurons, introducing two novel solutions that enable unbiased gradient estimation in various deep learning and reinforcement learning contexts.
Contribution
It presents two new approaches for gradient estimation in stochastic neurons, including a biologically plausible unbiased estimator and a method for approximating high-variance estimators.
Findings
Unbiased gradient estimator for binary stochastic neurons.
Applicability of estimators in reinforcement learning scenarios.
Method for learning to predict high-variance estimators.
Abstract
Stochastic neurons can be useful for a number of reasons in deep learning models, but in many cases they pose a challenging problem: how to estimate the gradient of a loss function with respect to the input of such stochastic neurons, i.e., can we "back-propagate" through these stochastic neurons? We examine this question, existing approaches, and present two novel families of solutions, applicable in different settings. In particular, it is demonstrated that a simple biologically plausible formula gives rise to an an unbiased (but noisy) estimator of the gradient with respect to a binary stochastic neuron firing probability. Unlike other estimators which view the noise as a small perturbation in order to estimate gradients by finite differences, this estimator is unbiased even without assuming that the stochastic perturbation is small. This estimator is also interesting because it can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques · Neural Networks and Applications
