Action Branching Architectures for Deep Reinforcement Learning

Arash Tavakoli; Fabio Pardo; Petar Kormushev

arXiv:1711.08946·cs.LG·January 28, 2019·6 cites

Action Branching Architectures for Deep Reinforcement Learning

Arash Tavakoli, Fabio Pardo, Petar Kormushev

PDF

Open Access 5 Repos

TL;DR

This paper introduces a neural architecture with shared decision modules and multiple branches for each action dimension, enabling scalable deep reinforcement learning in high-dimensional and continuous action spaces, and demonstrates its effectiveness on control tasks.

Contribution

The paper proposes a novel branching architecture for deep RL that scales linearly with action dimensions, including a new agent called Branching Dueling Q-Network (BDQ).

Findings

01

BDQ scales well with increasing action dimensions.

02

Shared decision modules improve coordination among branches.

03

BDQ performs competitively with DDPG on control tasks.

Abstract

Discrete-action algorithms have been central to numerous recent successes of deep reinforcement learning. However, applying these algorithms to high-dimensional action tasks requires tackling the combinatorial increase of the number of possible actions with the number of action dimensions. This problem is further exacerbated for continuous-action tasks that require fine control of actions via discretization. In this paper, we propose a novel neural architecture featuring a shared decision module followed by several network branches, one for each action dimension. This approach achieves a linear increase of the number of network outputs with the number of degrees of freedom by allowing a level of independence for each individual action dimension. To illustrate the approach, we present a novel agent, called Branching Dueling Q-Network (BDQ), as a branching variant of the Dueling Double…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications