Self-Explaining Deviations for Coordination

Hengyuan Hu; Samuel Sokota; David Wu; Anton Bakhtin; Andrei Lupu,; Brandon Cui; Jakob N. Foerster

arXiv:2207.12322·cs.AI·July 26, 2022

Self-Explaining Deviations for Coordination

Hengyuan Hu, Samuel Sokota, David Wu, Anton Bakhtin, Andrei Lupu,, Brandon Cui, Jakob N. Foerster

PDF

Open Access 1 Video

TL;DR

This paper introduces a novel algorithm, IMPROVISED, enabling agents in multi-agent coordination tasks to perform self-explaining deviations that reveal abnormal circumstances, enhancing understanding and strategic play.

Contribution

The paper formalizes self-explaining deviations and presents IMPROVISED, the first method to generate finesse plays in Hanabi, demonstrating improved coordination and theory of mind capabilities.

Findings

01

IMPROVISED successfully performs self-explaining deviations in toy and Hanabi settings.

02

It is the first method to produce finesse plays in Hanabi.

03

Results show enhanced agent coordination and interpretability.

Abstract

Fully cooperative, partially observable multi-agent problems are ubiquitous in the real world. In this paper, we focus on a specific subclass of coordination problems in which humans are able to discover self-explaining deviations (SEDs). SEDs are actions that deviate from the common understanding of what reasonable behavior would be in normal circumstances. They are taken with the intention of causing another agent or other agents to realize, using theory of mind, that the circumstance must be abnormal. We first motivate SED with a real world example and formalize its definition. Next, we introduce a novel algorithm, improvement maximizing self-explaining deviations (IMPROVISED), to perform SEDs. Lastly, we evaluate IMPROVISED both in an illustrative toy setting and the popular benchmark setting Hanabi, where it is the first method to produce so called finesse plays, which are regarded…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Self-Explaining Deviations for Coordination· slideslive

Taxonomy

TopicsData Visualization and Analytics · Advanced Text Analysis Techniques · Semantic Web and Ontologies