Inspiration Learning through Preferences

Nir Baram; Shie Mannor

arXiv:1809.05872·cs.LG·September 18, 2018

Inspiration Learning through Preferences

Nir Baram, Shie Mannor

PDF

Open Access

TL;DR

This paper introduces Inspiration Learning, a novel imitation learning framework that enables knowledge transfer between agents with different action spaces using preference-based reinforcement learning and a specialized actor-critic architecture.

Contribution

It proposes a new approach to imitation learning that does not require shared action spaces, utilizing a classifier-based reward and an adapted actor-critic method.

Findings

01

Successfully extends imitation learning to different action spaces

02

Capable of continuous-to-discrete and primitive-to-macro imitation

03

Demonstrates effective transfer in diverse agent configurations

Abstract

Current imitation learning techniques are too restrictive because they require the agent and expert to share the same action space. However, oftentimes agents that act differently from the expert can solve the task just as good. For example, a person lifting a box can be imitated by a ceiling mounted robot or a desktop-based robotic-arm. In both cases, the end goal of lifting the box is achieved, perhaps using different strategies. We denote this setup as \textit{Inspiration Learning} - knowledge transfer between agents that operate in different action spaces. Since state-action expert demonstrations can no longer be used, Inspiration learning requires novel methods to guide the agent towards the end goal. In this work, we rely on ideas of Preferential based Reinforcement Learning (PbRL) to design Advantage Actor-Critic algorithms for solving inspiration learning tasks. Unlike classic…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Artificial Intelligence in Games