
TL;DR
This paper establishes a novel connection between Fourier analysis of Boolean functions and the REINFORCE gradient estimator, leading to new low-variance unbiased estimators for binary latent variable models.
Contribution
It introduces a Fourier-based perspective on REINFORCE, enabling the design of variance-reducing baselines using Boolean Fourier theory.
Findings
REINFORCE estimates relate to degree-1 Fourier coefficients.
Variance reduction involves eliminating non-degree-1 Fourier components.
Proposed estimators achieve lower variance while remaining unbiased.
Abstract
We show a connection between the Fourier spectrum of Boolean functions and the REINFORCE gradient estimator for binary latent variable models. We show that REINFORCE estimates (up to a factor) the degree-1 Fourier coefficients of a Boolean function. Using this connection we offer a new perspective on variance reduction in gradient estimation for latent variable models: namely, that variance reduction involves eliminating or reducing Fourier coefficients that do not have degree 1. We then use this connection to develop low-variance unbiased gradient estimators for binary latent variable models such as sigmoid belief networks. The estimator is based upon properties of the noise operator from Boolean Fourier theory and involves a sample-dependent baseline added to the REINFORCE estimator in a way that keeps the estimator unbiased. The baseline can be plugged into existing gradient…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Modeling and Causal Inference · Machine Learning and Algorithms · Markov Chains and Monte Carlo Methods
MethodsREINFORCE
