M-EMBER: Tackling Long-Horizon Mobile Manipulation via Factorized Domain Transfer
Bohan Wu, Roberto Martin-Martin, Li Fei-Fei

TL;DR
This paper introduces M-EMBER, a factorized approach that decomposes long-horizon mobile manipulation tasks into primitive skills, enabling robots to perform complex activities like cleaning with improved success.
Contribution
M-EMBER is a novel method that combines skill decomposition, reinforcement learning, and skill composition to address long-horizon mobile manipulation challenges.
Findings
Achieves 53% success rate on cleaning_kitchen task
Successfully decomposes tasks into five learned visual skills
Demonstrates effective long-horizon manipulation in real robot
Abstract
In this paper, we propose a method to create visuomotor mobile manipulation solutions for long-horizon activities. We propose to leverage the recent advances in simulation to train visual solutions for mobile manipulation. While previous works have shown success applying this procedure to autonomous visual navigation and stationary manipulation, applying it to long-horizon visuomotor mobile manipulation is still an open challenge that demands both perceptual and compositional generalization of multiple skills. In this work, we develop Mobile-EMBER, or M-EMBER, a factorized method that decomposes a long-horizon mobile manipulation activity into a repertoire of primitive visual skills, reinforcement-learns each skill, and composes these skills to a long-horizon mobile manipulation activity. On a mobile manipulation robot, we find that M-EMBER completes a long-horizon mobile manipulation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Robotics and Sensor-Based Localization · Robotic Path Planning Algorithms
