Generalized Inverse Planning: Learning Lifted non-Markovian Utility for   Generalizable Task Representation

Sirui Xie; Feng Gao; Song-Chun Zhu

arXiv:2011.09854·cs.LG·November 20, 2020

Generalized Inverse Planning: Learning Lifted non-Markovian Utility for Generalizable Task Representation

Sirui Xie, Feng Gao, Song-Chun Zhu

PDF

Open Access

TL;DR

This paper introduces a new framework called Generalized Inverse Planning that learns non-Markovian, lifted utility functions from demonstrations, enabling generalizable task representations across different environments and structural changes.

Contribution

It proposes a novel generalization framework for utility learning that extends beyond traditional IRL, incorporating non-Markovian and lifted utilities for better transferability.

Findings

01

MEIP effectively learns non-Markovian utility functions from demonstrations.

02

The approach generalizes across different planning problems and structural changes.

03

Successful application to folding tasks demonstrates practical utility.

Abstract

In searching for a generalizable representation of temporally extended tasks, we spot two necessary constituents: the utility needs to be non-Markovian to transfer temporal relations invariant to a probability shift, the utility also needs to be lifted to abstract out specific grounding objects. In this work, we study learning such utility from human demonstrations. While inverse reinforcement learning (IRL) has been accepted as a general framework of utility learning, its fundamental formulation is one concrete Markov Decision Process. Thus the learned reward function does not specify the task independently of the environment. Going beyond that, we define a domain of generalization that spans a set of planning problems following a schema. We hence propose a new quest, Generalized Inverse Planning, for utility learning in this domain. We further outline a computational framework,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · AI-based Problem Solving and Planning · Machine Learning and Algorithms