GMAC: A Distributional Perspective on Actor-Critic Framework

Daniel Wontae Nam; Younghoon Kim; Chan Y. Park

arXiv:2105.11366·cs.LG·July 16, 2021

GMAC: A Distributional Perspective on Actor-Critic Framework

Daniel Wontae Nam, Younghoon Kim, Chan Y. Park

PDF

Open Access 1 Video

TL;DR

GMAC introduces a distributional actor-critic framework that effectively captures value distributions, addressing instability and sample conflation, and demonstrates improved performance in discrete and continuous environments.

Contribution

It proposes a novel distributional actor-critic method using Cramér distance and a Sample-Replacement algorithm, with Gaussian Mixture Model parameterization for enhanced efficiency.

Findings

01

GMAC accurately models value distributions.

02

It improves performance over traditional actor-critic methods.

03

The method is computationally efficient in various environments.

Abstract

In this paper, we devise a distributional framework on actor-critic as a solution to distributional instability, action type restriction, and conflation between samples and statistics. We propose a new method that minimizes the Cram\'er distance with the multi-step Bellman target distribution generated from a novel Sample-Replacement algorithm denoted SR( $λ$ ), which learns the correct value distribution under multiple Bellman operations. Parameterizing a value distribution with Gaussian Mixture Model further improves the efficiency and the performance of the method, which we name GMAC. We empirically show that GMAC captures the correct representation of value distributions and improves the performance of a conventional actor-critic method with low computational cost, in both discrete and continuous action spaces using Arcade Learning Environment (ALE) and PyBullet environment.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

GMAC: A Distributional Perspective on Actor-Critic Framework· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Model Reduction and Neural Networks · Gaussian Processes and Bayesian Inference