Gradient Estimators for Implicit Models

Yingzhen Li; Richard E. Turner

arXiv:1705.07107·stat.ML·April 27, 2018·35 cites

Gradient Estimators for Implicit Models

Yingzhen Li, Richard E. Turner

PDF

Open Access 1 Repo

TL;DR

This paper introduces the Stein gradient estimator, a novel method for directly estimating the score function of implicit models, improving training accuracy and sample diversity in applications like GANs and meta-learning.

Contribution

The paper proposes the Stein gradient estimator, enabling direct score function estimation for implicit models, reducing reliance on approximations and enhancing model performance.

Findings

01

Improved sample diversity in entropy-regularized GANs

02

Enhanced meta-learning for approximate inference

03

Demonstrated efficacy of the estimator in various applications

Abstract

Implicit models, which allow for the generation of samples but not for point-wise evaluation of probabilities, are omnipresent in real-world problems tackled by machine learning and a hot topic of current research. Some examples include data simulators that are widely used in engineering and scientific research, generative adversarial networks (GANs) for image synthesis, and hot-off-the-press approximate inference techniques relying on implicit distributions. The majority of existing approaches to learning implicit models rely on approximating the intractable distribution or optimisation objective for gradient-based optimisation, which is liable to produce inaccurate updates and thus poor models. This paper alleviates the need for such approximations by proposing the Stein gradient estimator, which directly estimates the score function of the implicitly defined distribution. The…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YingzhenLi/SteinGrad
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMarkov Chains and Monte Carlo Methods · Gaussian Processes and Bayesian Inference · Model Reduction and Neural Networks