Robot Learning of Mobile Manipulation with Reachability Behavior Priors
Snehal Jauhri, Jan Peters, Georgia Chalvatzaki

TL;DR
This paper introduces a novel reinforcement learning approach that integrates reachability priors and residual Q-function modeling to efficiently learn mobile manipulation tasks, demonstrating superior performance in simulation and real robot transfer.
Contribution
The work presents a new Hybrid RL method with Gumbel-Softmax reparameterization, a residual Q-function learning technique called BHyRL, and effective transfer to a real robot for mobile manipulation.
Findings
BHyRL outperforms baseline methods in simulation tasks.
Reachability priors accelerate learning and improve policy performance.
Successful zero-transfer of policies to a real robot, TIAGo++.
Abstract
Mobile Manipulation (MM) systems are ideal candidates for taking up the role of a personal assistant in unstructured real-world environments. Among other challenges, MM requires effective coordination of the robot's embodiments for executing tasks that require both mobility and manipulation. Reinforcement Learning (RL) holds the promise of endowing robots with adaptive behaviors, but most methods require prohibitively large amounts of data for learning a useful control policy. In this work, we study the integration of robotic reachability priors in actor-critic RL methods for accelerating the learning of MM for reaching and fetching tasks. Namely, we consider the problem of optimal base placement and the subsequent decision of whether to activate the arm for reaching a 6D target. For this, we devise a novel Hybrid RL method that handles discrete and continuous actions jointly, resorting…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsBalanced Selection
