Adaptive Teaching in Heterogeneous Agents: Balancing Surprise in Sparse   Reward Scenarios

Emma Clark; Kanghyun Ryu; Negar Mehr

arXiv:2405.14199·cs.RO·May 24, 2024

Adaptive Teaching in Heterogeneous Agents: Balancing Surprise in Sparse Reward Scenarios

Emma Clark, Kanghyun Ryu, Negar Mehr

PDF

Open Access 1 Repo

TL;DR

This paper introduces a Teacher-Student learning framework that uses the concept of surprise to adapt demonstrations for heterogeneous agents, improving learning efficiency in sparse-reward control tasks.

Contribution

It proposes a novel surprise-based method for tailoring demonstrations to heterogeneous agents, addressing capability discrepancies in Learning from Demonstration.

Findings

01

Enhanced learning efficiency in sparse-reward environments

02

Effective adaptation of demonstrations to agent capabilities

03

Improved control performance in heterogeneous agent scenarios

Abstract

Learning from Demonstration (LfD) can be an efficient way to train systems with analogous agents by enabling ``Student'' agents to learn from the demonstrations of the most experienced ``Teacher'' agent, instead of training their policy in parallel. However, when there are discrepancies in agent capabilities, such as divergent actuator power or joint angle constraints, naively replicating demonstrations that are out of bounds for the Student's capability can limit efficient learning. We present a Teacher-Student learning framework specifically tailored to address the challenge of heterogeneity between the Teacher and Student agents. Our framework is based on the concept of ``surprise'', inspired by its application in exploration incentivization in sparse-reward environments. Surprise is repurposed to enable the Teacher to detect and adapt to differences between itself and the Student.…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

labicon/Surprise_based_Teaching
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics