Data-driven Planning via Imitation Learning

Sanjiban Choudhury; Mohak Bhardwaj; Sankalp Arora; Ashish Kapoor,; Gireeja Ranade; Sebastian Scherer; Debadeepta Dey

arXiv:1711.06391·cs.RO·November 20, 2017

Data-driven Planning via Imitation Learning

Sanjiban Choudhury, Mohak Bhardwaj, Sankalp Arora, Ashish Kapoor,, Gireeja Ranade, Sebastian Scherer, Debadeepta Dey

PDF

TL;DR

This paper introduces a data-driven imitation learning framework that trains planning policies by mimicking a clairvoyant oracle with full environment knowledge, improving efficiency and performance in robot planning tasks.

Contribution

The paper presents a novel imitation learning approach for planning that leverages a clairvoyant oracle to train policies efficiently under partial information.

Findings

01

Learnt policies outperform state-of-the-art algorithms in various environments.

02

The approach is validated on real UAV experiments.

03

Provides performance guarantees for the learned planning policies.

Abstract

Robot planning is the process of selecting a sequence of actions that optimize for a task specific objective. The optimal solutions to such tasks are heavily influenced by the implicit structure in the environment, i.e. the configuration of objects in the world. State-of-the-art planning approaches, however, do not exploit this structure, thereby expending valuable effort searching the action space instead of focusing on potentially good actions. In this paper, we address the problem of enabling planners to adapt their search strategies by inferring such good actions in an efficient manner using only the information uncovered by the search up until that time. We formulate this as a problem of sequential decision making under uncertainty where at a given iteration a planning policy must map the state of the search to a planning action. Unfortunately, the training process for such partial…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.