Generalized Beliefs for Cooperative AI

Darius Muglich; Luisa Zintgraf; Christian Schroeder de Witt; Shimon; Whiteson; Jakob Foerster

arXiv:2206.12765·cs.AI·June 28, 2022

Generalized Beliefs for Cooperative AI

Darius Muglich, Luisa Zintgraf, Christian Schroeder de Witt, Shimon, Whiteson, Jakob Foerster

PDF

Open Access

TL;DR

This paper introduces a belief-based learning model for cooperative AI that enables agents to adapt to new conventions and improve teamplay by maintaining beliefs over unseen policies, enhancing explainability and flexibility.

Contribution

It proposes a novel belief learning framework that decodes and adapts to unseen conventions at test time, addressing limitations of prior symmetry-based approaches.

Findings

01

Improved ad-hoc teamplay through belief-based adaptation.

02

Enhanced interpretability of agent conventions.

03

Effective decoding of unseen policies at test time.

Abstract

Self-play is a common paradigm for constructing solutions in Markov games that can yield optimal policies in collaborative settings. However, these policies often adopt highly-specialized conventions that make playing with a novel partner difficult. To address this, recent approaches rely on encoding symmetry and convention-awareness into policy training, but these require strong environmental assumptions and can complicate policy training. We therefore propose moving the learning of conventions to the belief space. Specifically, we propose a belief learning model that can maintain beliefs over rollouts of policies not seen at training time, and can thus decode and adapt to novel conventions at test time. We show how to leverage this model for both search and training of a best response over various pools of policies to greatly improve ad-hoc teamplay. We also show how our setup…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Explainable Artificial Intelligence (XAI) · Topic Modeling

MethodsTest