Causal Policy Gradient for Whole-Body Mobile Manipulation

Jiaheng Hu; Peter Stone; Roberto Mart\'in-Mart\'in

arXiv:2305.04866·cs.RO·September 29, 2023·1 cites

Causal Policy Gradient for Whole-Body Mobile Manipulation

Jiaheng Hu, Peter Stone, Roberto Mart\'in-Mart\'in

PDF

Open Access

TL;DR

This paper introduces Causal MoMa, a reinforcement learning framework that automatically discovers causal dependencies in robot actions to improve whole-body mobile manipulation, enabling efficient and simultaneous control of locomotion and interaction in complex tasks.

Contribution

It proposes a causal policy gradient method that reduces gradient variance and automatically discovers action dependencies, enhancing training efficiency and policy performance for mobile manipulation tasks.

Findings

01

Successful transfer of policies from simulation to real robots

02

Improved convergence and results over previous RL algorithms

03

Effective control of robot's whole-body in dynamic environments

Abstract

Developing the next generation of household robot helpers requires combining locomotion and interaction capabilities, which is generally referred to as mobile manipulation (MoMa). MoMa tasks are difficult due to the large action space of the robot and the common multi-objective nature of the task, e.g., efficiently reaching a goal while avoiding obstacles. Current approaches often segregate tasks into navigation without manipulation and stationary manipulation without locomotion by manually matching parts of the action space to MoMa sub-objectives (e.g. learning base actions for locomotion objectives and learning arm actions for manipulation). This solution prevents simultaneous combinations of locomotion and interaction degrees of freedom and requires human domain knowledge for both partitioning the action space and matching the action parts to the sub-objectives. In this paper, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSocial Robot Interaction and HRI · Reinforcement Learning in Robotics · Robotic Locomotion and Control

MethodsBalanced Selection