Hindsight Network Credit Assignment: Efficient Credit Assignment in   Networks of Discrete Stochastic Units

Kenny Young

arXiv:2110.07700·cs.LG·December 17, 2021

Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units

Kenny Young

PDF

Open Access 1 Video

TL;DR

This paper introduces Hindsight Network Credit Assignment (HNCA), a novel gradient estimation method for discrete stochastic neural networks that reduces variance and improves training efficiency compared to traditional methods like REINFORCE.

Contribution

The paper presents HNCA, a new unbiased gradient estimator for networks of discrete stochastic units with lower variance and similar computational cost to backpropagation.

Findings

01

HNCA outperforms REINFORCE in contextual bandit tasks.

02

HNCA effectively trains discrete variational auto-encoders.

03

Theoretical analysis confirms variance reduction of HNCA.

Abstract

Training neural networks with discrete stochastic variables presents a unique challenge. Backpropagation is not directly applicable, nor are the reparameterization tricks used in networks with continuous stochastic variables. To address this challenge, we present Hindsight Network Credit Assignment (HNCA), a novel gradient estimation algorithm for networks of discrete stochastic units. HNCA works by assigning credit to each unit based on the degree to which its output influences its immediate children in the network. We prove that HNCA produces unbiased gradient estimates with reduced variance compared to the REINFORCE estimator, while the computational cost is similar to that of backpropagation. We first apply HNCA in a contextual bandit setting to optimize a reward function that is unknown to the agent. In this setting, we empirically demonstrate that HNCA significantly outperforms…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Hindsight Network Credit Assignment: Efficient Credit Assignment in Networks of Discrete Stochastic Units· underline

Taxonomy

TopicsStochastic Gradient Optimization Techniques · Domain Adaptation and Few-Shot Learning · Machine Learning and ELM

MethodsREINFORCE