CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator

Alek Dimitriev; Mingyuan Zhou

arXiv:2110.14002·cs.LG·October 28, 2021·1 cites

CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator

Alek Dimitriev, Mingyuan Zhou

PDF

Open Access 1 Repo 1 Video

TL;DR

CARMS introduces an unbiased, variance-reducing gradient estimator for categorical variables using antithetic sampling and copula-based methods, improving training of discrete models across various tasks.

Contribution

The paper presents CARMS, a novel multi-sample gradient estimator for categorical variables that combines REINFORCE with copula-based antithetic sampling, generalizing previous estimators and reducing variance.

Findings

01

CARMS outperforms existing methods on benchmark generative tasks.

02

It achieves lower variance in gradient estimates.

03

The method is effective across multiple discrete modeling tasks.

Abstract

Accurately backpropagating the gradient through categorical variables is a challenging task that arises in various domains, such as training discrete latent variable models. To this end, we propose CARMS, an unbiased estimator for categorical random variables based on multiple mutually negatively correlated (jointly antithetic) samples. CARMS combines REINFORCE with copula based sampling to avoid duplicate samples and reduce its variance, while keeping the estimator unbiased using importance sampling. It generalizes both the ARMS antithetic estimator for binary variables, which is CARMS for two categories, as well as LOORF/VarGrad, the leave-one-out REINFORCE estimator, which is CARMS with independent samples. We evaluate CARMS on several benchmark datasets on a generative modeling task, as well as a structured output prediction task, and find it to outperform competing methods…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alekdimi/carms
tfOfficial

Videos

CARMS: Categorical-Antithetic-REINFORCE Multi-Sample Gradient Estimator· slideslive

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Machine Learning in Healthcare · Gaussian Processes and Bayesian Inference

MethodsREINFORCE