Inspiration Learning through Preferences
Nir Baram, Shie Mannor

TL;DR
This paper introduces Inspiration Learning, a novel imitation learning framework that enables knowledge transfer between agents with different action spaces using preference-based reinforcement learning and a specialized actor-critic architecture.
Contribution
It proposes a new approach to imitation learning that does not require shared action spaces, utilizing a classifier-based reward and an adapted actor-critic method.
Findings
Successfully extends imitation learning to different action spaces
Capable of continuous-to-discrete and primitive-to-macro imitation
Demonstrates effective transfer in diverse agent configurations
Abstract
Current imitation learning techniques are too restrictive because they require the agent and expert to share the same action space. However, oftentimes agents that act differently from the expert can solve the task just as good. For example, a person lifting a box can be imitated by a ceiling mounted robot or a desktop-based robotic-arm. In both cases, the end goal of lifting the box is achieved, perhaps using different strategies. We denote this setup as \textit{Inspiration Learning} - knowledge transfer between agents that operate in different action spaces. Since state-action expert demonstrations can no longer be used, Inspiration learning requires novel methods to guide the agent towards the end goal. In this work, we rely on ideas of Preferential based Reinforcement Learning (PbRL) to design Advantage Actor-Critic algorithms for solving inspiration learning tasks. Unlike classic…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Adaptive Dynamic Programming Control · Artificial Intelligence in Games
