Action Branching Architectures for Deep Reinforcement Learning
Arash Tavakoli, Fabio Pardo, Petar Kormushev

TL;DR
This paper introduces a neural architecture with shared decision modules and multiple branches for each action dimension, enabling scalable deep reinforcement learning in high-dimensional and continuous action spaces, and demonstrates its effectiveness on control tasks.
Contribution
The paper proposes a novel branching architecture for deep RL that scales linearly with action dimensions, including a new agent called Branching Dueling Q-Network (BDQ).
Findings
BDQ scales well with increasing action dimensions.
Shared decision modules improve coordination among branches.
BDQ performs competitively with DDPG on control tasks.
Abstract
Discrete-action algorithms have been central to numerous recent successes of deep reinforcement learning. However, applying these algorithms to high-dimensional action tasks requires tackling the combinatorial increase of the number of possible actions with the number of action dimensions. This problem is further exacerbated for continuous-action tasks that require fine control of actions via discretization. In this paper, we propose a novel neural architecture featuring a shared decision module followed by several network branches, one for each action dimension. This approach achieves a linear increase of the number of network outputs with the number of degrees of freedom by allowing a level of independence for each individual action dimension. To illustrate the approach, we present a novel agent, called Branching Dueling Q-Network (BDQ), as a branching variant of the Dueling Double…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Evolutionary Algorithms and Applications
