Monte Carlo Gradient Estimation in Machine Learning

Shakir Mohamed; Mihaela Rosca; Michael Figurnov; Andriy Mnih

arXiv:1906.10652·stat.ML·September 30, 2020·85 cites

Monte Carlo Gradient Estimation in Machine Learning

Shakir Mohamed, Mihaela Rosca, Michael Figurnov, Andriy Mnih

PDF

Open Access 2 Repos

TL;DR

This paper provides a comprehensive survey of Monte Carlo gradient estimation methods in machine learning, discussing their development, assumptions, and applications across various learning paradigms to facilitate further research and improvements.

Contribution

It offers an accessible overview of three key Monte Carlo gradient estimators, their historical context, relationships, and potential for generalization, supporting future advances in the field.

Findings

01

Exploration of pathwise, score function, and measure-valued estimators

02

Analysis of their historical development and assumptions

03

Discussion on combining and generalizing these methods

Abstract

This paper is a broad and accessible survey of the methods we have at our disposal for Monte Carlo gradient estimation in machine learning and across the statistical sciences: the problem of computing the gradient of an expectation of a function with respect to parameters defining the distribution that is integrated; the problem of sensitivity analysis. In machine learning research, this gradient problem lies at the core of many learning problems, in supervised, unsupervised and reinforcement learning. We will generally seek to rewrite such gradients in a form that allows for Monte Carlo estimation, allowing them to be easily and efficiently used and analysed. We explore three strategies--the pathwise, score function, and measure-valued gradient estimators--exploring their historical development, derivation, and underlying assumptions. We describe their use in other fields, show how…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning and Algorithms · Adversarial Robustness in Machine Learning · Stochastic Gradient Optimization Techniques