Causal Policy Gradient for Whole-Body Mobile Manipulation
Jiaheng Hu, Peter Stone, Roberto Mart\'in-Mart\'in

TL;DR
This paper introduces Causal MoMa, a reinforcement learning framework that automatically discovers causal dependencies in robot actions to improve whole-body mobile manipulation, enabling efficient and simultaneous control of locomotion and interaction in complex tasks.
Contribution
It proposes a causal policy gradient method that reduces gradient variance and automatically discovers action dependencies, enhancing training efficiency and policy performance for mobile manipulation tasks.
Findings
Successful transfer of policies from simulation to real robots
Improved convergence and results over previous RL algorithms
Effective control of robot's whole-body in dynamic environments
Abstract
Developing the next generation of household robot helpers requires combining locomotion and interaction capabilities, which is generally referred to as mobile manipulation (MoMa). MoMa tasks are difficult due to the large action space of the robot and the common multi-objective nature of the task, e.g., efficiently reaching a goal while avoiding obstacles. Current approaches often segregate tasks into navigation without manipulation and stationary manipulation without locomotion by manually matching parts of the action space to MoMa sub-objectives (e.g. learning base actions for locomotion objectives and learning arm actions for manipulation). This solution prevents simultaneous combinations of locomotion and interaction degrees of freedom and requires human domain knowledge for both partitioning the action space and matching the action parts to the sub-objectives. In this paper, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI · Reinforcement Learning in Robotics · Robotic Locomotion and Control
MethodsBalanced Selection
