Expected Return Symmetries

Darius Muglich; Johannes Forkel; Elise van der Pol; Jakob Foerster

arXiv:2502.01711·cs.MA·March 13, 2025

Expected Return Symmetries

Darius Muglich, Johannes Forkel, Elise van der Pol, Jakob Foerster

PDF

Open Access 1 Video 3 Reviews

TL;DR

This paper introduces expected return symmetries, a broader class of symmetries in multi-agent environments, enabling agents to achieve better coordination without prior environment knowledge.

Contribution

It proposes expected return symmetries, expanding the types of symmetries used for coordination, and demonstrates improved zero-shot coordination results.

Findings

01

Agents trained with expected return symmetries outperform those with environment symmetries.

02

The method requires minimal prior assumptions about the environment.

03

Expected return symmetries encompass environment symmetries as a subgroup.

Abstract

Symmetry is an important inductive bias that can improve model robustness and generalization across many deep learning domains. In multi-agent settings, a priori known symmetries have been shown to address a fundamental coordination failure mode known as mutually incompatible symmetry breaking; e.g. in a game where two independent agents can choose to move "left'' or "right'', and where a reward of +1 or -1 is received when the agents choose the same action or different actions, respectively. However, the efficient and automatic discovery of environment symmetries, in particular for decentralized partially observable Markov decision processes, remains an open problem. Furthermore, environmental symmetry breaking constitutes only one type of coordination failure, which motivates the search for a more accessible and broader symmetry class. In this paper, we introduce such a broader group…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 6Confidence 1

Strengths

1. The organization of this paper is well-structured. 2. The paper conducts experiments in several zero-shot coordination tasks and gives comprehensive analysis.

Weaknesses

As I'm not familiar with this field, it is hard for me to give constructive suggestions to the authors.

Reviewer 02Rating 8Confidence 3

Strengths

- The writing is clear, concise, and detailed. - The approach is well-motivated theoretically, and seemingly well-grounded in prior literature. - Discussion of prior work appears complete. - The experiments are compelling, and validate the claims made in the paper. - Future directions are compelling. - Work is likely to be of interest to the broader field.

Weaknesses

- Limitations are not adequately addressed. I would suggested addressing this explicitly in the conclusion, between the summary and discussion of future directions. - Readability: - Figure 2 graph text is far too small. - What purpose do the black horizontal bars at the top and bottom of Figure 1's image serve? Seems the text would fit much better without these.

Reviewer 03Rating 5Confidence 3

Strengths

+ The introduction of ER symmetry seems to be a new contribution, extending the symmetry-breaking methods beyond simple action and observation relabeling. + The use of various toy examples throughout the paper makes the key definitions and concepts easy to understand. + The proposed approach is solid, and its effectiveness is underpinned by the with well-defined constructions of equivalence classes.

Weaknesses

- Despite the use of the toy examples and the effort by the authors, I feel the writing could be further improved for readability. Some sections are still a bit challenging to follow. - The experiments largely focus on the Hanabi environment, while the results on overcook and cat/dog environments are very brief. More complex environments, especially those with continuous action and state spaces, should be used to demonstrate the effectiveness and scalability of the ER symmetry method, which un

Videos

Expected Return Symmetries· slideslive

Taxonomy

TopicsQuantum chaos and dynamical systems