Learning Performance Graphs from Demonstrations via Task-Based   Evaluations

Aniruddh G. Puranic; Jyotirmoy V. Deshmukh; Stefanos Nikolaidis

arXiv:2204.05909·cs.RO·December 20, 2022

Learning Performance Graphs from Demonstrations via Task-Based Evaluations

Aniruddh G. Puranic, Jyotirmoy V. Deshmukh, Stefanos Nikolaidis

PDF

Open Access

TL;DR

This paper introduces an algorithm to automatically learn performance graphs from demonstrations in robot learning, enabling more accurate evaluation of behaviors without manual specification, demonstrated through a highway driving domain.

Contribution

The paper presents a novel algorithm to learn performance graphs directly from demonstrations, reducing manual effort and improving evaluation accuracy in learning from demonstration frameworks.

Findings

01

Learned performance graphs match user-specified priorities in simulations.

02

Reward functions from learned graphs produce similar policies to manual graphs.

03

User study confirms accurate inference of behavior priorities.

Abstract

In the learning from demonstration (LfD) paradigm, understanding and evaluating the demonstrated behaviors plays a critical role in extracting control policies for robots. Without this knowledge, a robot may infer incorrect reward functions that lead to undesirable or unsafe control policies. Recent work has proposed an LfD framework where a user provides a set of formal task specifications to guide LfD, to address the challenge of reward shaping. However, in this framework, specifications are manually ordered in a performance graph (a partial order that specifies relative importance between the specifications). The main contribution of this paper is an algorithm to learn the performance graph directly from the user-provided demonstrations, and show that the reward functions generated using the learned performance graph generate similar policies to those from manually specified…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSoftware Testing and Debugging Techniques · Reinforcement Learning in Robotics · Formal Methods in Verification