Meta-Policy Learning over Plan Ensembles for Robust Articulated Object   Manipulation

Constantinos Chamzas; Caelan Garrett; Balakumar Sundaralingam; Lydia; E. Kavraki; Dieter Fox

arXiv:2307.04040·cs.RO·July 11, 2023

Meta-Policy Learning over Plan Ensembles for Robust Articulated Object Manipulation

Constantinos Chamzas, Caelan Garrett, Balakumar Sundaralingam, Lydia, E. Kavraki, Dieter Fox

PDF

Open Access

TL;DR

This paper introduces a meta-policy learning approach over plan ensembles that combines geometric planning with learned policies to achieve robust and efficient articulated object manipulation, demonstrated on a robot pushing a door.

Contribution

It proposes a novel method that integrates geometric planning with meta-policy learning to improve robustness and data efficiency in manipulation tasks.

Findings

01

Meta-policy successfully learned to push a door with minimal data.

02

Method demonstrated robustness to environmental model uncertainties.

03

Effective on a 7-DOF robot in simulation.

Abstract

Recent work has shown that complex manipulation skills, such as pushing or pouring, can be learned through state-of-the-art learning based techniques, such as Reinforcement Learning (RL). However, these methods often have high sample-complexity, are susceptible to domain changes, and produce unsafe motions that a robot should not perform. On the other hand, purely geometric model-based planning can produce complex behaviors that satisfy all the geometric constraints of the robot but might not be dynamically feasible for a given environment. In this work, we leverage a geometric model-based planner to build a mixture of path-policies on which a task-specific meta-policy can be learned to complete the task. In our results, we demonstrate that a successful meta-policy can be learned to push a door, while requiring little data and being robust to model uncertainty of the environment. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Robot Manipulation and Learning · Robotic Path Planning Algorithms