Soft Actor-Critic for Discrete Action Settings

Petros Christodoulou

arXiv:1910.07207·cs.LG·October 21, 2019·209 cites

Soft Actor-Critic for Discrete Action Settings

Petros Christodoulou

PDF

Open Access 5 Repos

TL;DR

This paper introduces a discrete-action version of the Soft Actor-Critic algorithm, making it applicable to important discrete settings and demonstrating competitive performance on Atari games without hyperparameter tuning.

Contribution

The paper develops a novel discrete-action Soft Actor-Critic algorithm and shows its competitive performance on Atari benchmarks without hyperparameter tuning.

Findings

01

Competitive performance on Atari games

02

Effective without hyperparameter tuning

03

Extends Soft Actor-Critic to discrete actions

Abstract

Soft Actor-Critic is a state-of-the-art reinforcement learning algorithm for continuous action settings that is not applicable to discrete action settings. Many important settings involve discrete actions, however, and so here we derive an alternative version of the Soft Actor-Critic algorithm that is applicable to discrete action settings. We then show that, even without any hyperparameter tuning, it is competitive with the tuned model-free state-of-the-art on a selection of games from the Atari suite.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Digital Games and Media