Meta-Policy Learning over Plan Ensembles for Robust Articulated Object Manipulation
Constantinos Chamzas, Caelan Garrett, Balakumar Sundaralingam, Lydia, E. Kavraki, Dieter Fox

TL;DR
This paper introduces a meta-policy learning approach over plan ensembles that combines geometric planning with learned policies to achieve robust and efficient articulated object manipulation, demonstrated on a robot pushing a door.
Contribution
It proposes a novel method that integrates geometric planning with meta-policy learning to improve robustness and data efficiency in manipulation tasks.
Findings
Meta-policy successfully learned to push a door with minimal data.
Method demonstrated robustness to environmental model uncertainties.
Effective on a 7-DOF robot in simulation.
Abstract
Recent work has shown that complex manipulation skills, such as pushing or pouring, can be learned through state-of-the-art learning based techniques, such as Reinforcement Learning (RL). However, these methods often have high sample-complexity, are susceptible to domain changes, and produce unsafe motions that a robot should not perform. On the other hand, purely geometric model-based planning can produce complex behaviors that satisfy all the geometric constraints of the robot but might not be dynamically feasible for a given environment. In this work, we leverage a geometric model-based planner to build a mixture of path-policies on which a task-specific meta-policy can be learned to complete the task. In our results, we demonstrate that a successful meta-policy can be learned to push a door, while requiring little data and being robust to model uncertainty of the environment. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms
