DiGrad: Multi-Task Reinforcement Learning with Shared Actions

Parijat Dewangan; S Phaniteja; K Madhava Krishna; Abhishek Sarkar,; Balaraman Ravindran

arXiv:1802.10463·cs.LG·March 1, 2018·5 cites

DiGrad: Multi-Task Reinforcement Learning with Shared Actions

Parijat Dewangan, S Phaniteja, K Madhava Krishna, Abhishek Sarkar,, Balaraman Ravindran

PDF

Open Access

TL;DR

DiGrad introduces a differential policy gradient method enabling efficient multi-task reinforcement learning with shared actions in complex robotic systems, improving stability and data efficiency over existing approaches.

Contribution

The paper proposes a novel DiGrad framework using differential policy gradients for stable, efficient multi-task learning in continuous action spaces with shared actions.

Findings

01

Supports multi-task learning in complex robots

02

Outperforms related methods in continuous action spaces

03

Effective on robotic manipulators and humanoids

Abstract

Most reinforcement learning algorithms are inefficient for learning multiple tasks in complex robotic systems, where different tasks share a set of actions. In such environments a compound policy may be learnt with shared neural network parameters, which performs multiple tasks concurrently. However such compound policy may get biased towards a task or the gradients from different tasks negate each other, making the learning unstable and sometimes less data efficient. In this paper, we propose a new approach for simultaneous training of multiple tasks sharing a set of common actions in continuous action spaces, which we call as DiGrad (Differential Policy Gradient). The proposed framework is based on differential policy gradients and can accommodate multi-task learning in a single actor-critic network. We also propose a simple heuristic in the differential policy gradient update to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Locomotion and Control