Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal   Environments

Hugo Caselles-Dupr\'e; Olivier Sigaud; Mohamed Chetouani

arXiv:2206.04546·cs.LG·September 28, 2023·1 cites

Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments

Hugo Caselles-Dupr\'e, Olivier Sigaud, Mohamed Chetouani

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a Bayesian model for goal inference that enhances learning efficiency in multi-goal environments by incorporating pedagogical and pragmatic mechanisms, especially effective with limited demonstrations.

Contribution

It presents a novel Bayesian Goal Inference model that integrates pedagogy and pragmatism, improving multi-goal learning from demonstrations with fewer examples.

Findings

01

Faster learning with BGI-agents compared to standard methods.

02

Reduced goal ambiguity in few demonstrations regimes.

03

Enhanced goal inference accuracy through pedagogical and pragmatic strategies.

Abstract

Learning from demonstration methods usually leverage close to optimal demonstrations to accelerate training. By contrast, when demonstrating a task, human teachers deviate from optimal demonstrations and pedagogically modify their behavior by giving demonstrations that best disambiguate the goal they want to demonstrate. Analogously, human learners excel at pragmatically inferring the intent of the teacher, facilitating communication between the two agents. These mechanisms are critical in the few demonstrations regime, where inferring the goal is more difficult. In this paper, we implement pedagogy and pragmatism mechanisms by leveraging a Bayesian model of Goal Inference from demonstrations (BGI). We highlight the benefits of this model in multi-goal teacher-learner setups with two artificial agents that learn with goal-conditioned Reinforcement Learning. We show that combining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

caselles/neurips22-demonstrations-pedagogy-pragmatism
pytorchOfficial

Videos

Pragmatically Learning from Pedagogical Demonstrations in Multi-Goal Environments· slideslive

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Machine Learning and Data Classification