Learning Skills to Patch Plans Based on Inaccurate Models

Alex LaGrassa; Steven Lee; Oliver Kroemer

arXiv:2009.13732·cs.RO·May 18, 2021

Learning Skills to Patch Plans Based on Inaccurate Models

Alex LaGrassa, Steven Lee, Oliver Kroemer

PDF

TL;DR

This paper introduces a hybrid approach that combines model-based planning with local model-free policies to improve task success and reduce data requirements when models are inaccurate.

Contribution

The method patches sub-optimal model-based planners with local learned policies only where model failures occur, enhancing reliability and efficiency.

Findings

01

Outperforms pure planning in reliability.

02

Requires less data than pure imitation learning.

03

Effective in shape insertion and door opening tasks.

Abstract

Planners using accurate models can be effective for accomplishing manipulation tasks in the real world, but are typically highly specialized and require significant fine-tuning to be reliable. Meanwhile, learning is useful for adaptation, but can require a substantial amount of data collection. In this paper, we propose a method that improves the efficiency of sub-optimal planners with approximate but simple and fast models by switching to a model-free policy when unexpected transitions are observed. Unlike previous work, our method specifically addresses when the planner fails due to transition model error by patching with a local policy only where needed. First, we use a sub-optimal model-based planner to perform a task until model failure is detected. Next, we learn a local model-free policy from expert demonstrations to complete the task in regions where the model failed. To show…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.