Expected Return Symmetries
Darius Muglich, Johannes Forkel, Elise van der Pol, Jakob Foerster

TL;DR
This paper introduces expected return symmetries, a broader class of symmetries in multi-agent environments, enabling agents to achieve better coordination without prior environment knowledge.
Contribution
It proposes expected return symmetries, expanding the types of symmetries used for coordination, and demonstrates improved zero-shot coordination results.
Findings
Agents trained with expected return symmetries outperform those with environment symmetries.
The method requires minimal prior assumptions about the environment.
Expected return symmetries encompass environment symmetries as a subgroup.
Abstract
Symmetry is an important inductive bias that can improve model robustness and generalization across many deep learning domains. In multi-agent settings, a priori known symmetries have been shown to address a fundamental coordination failure mode known as mutually incompatible symmetry breaking; e.g. in a game where two independent agents can choose to move "left'' or "right'', and where a reward of +1 or -1 is received when the agents choose the same action or different actions, respectively. However, the efficient and automatic discovery of environment symmetries, in particular for decentralized partially observable Markov decision processes, remains an open problem. Furthermore, environmental symmetry breaking constitutes only one type of coordination failure, which motivates the search for a more accessible and broader symmetry class. In this paper, we introduce such a broader group…
Peer Reviews
Decision·ICLR 2025 Poster
1. The organization of this paper is well-structured. 2. The paper conducts experiments in several zero-shot coordination tasks and gives comprehensive analysis.
As I'm not familiar with this field, it is hard for me to give constructive suggestions to the authors.
- The writing is clear, concise, and detailed. - The approach is well-motivated theoretically, and seemingly well-grounded in prior literature. - Discussion of prior work appears complete. - The experiments are compelling, and validate the claims made in the paper. - Future directions are compelling. - Work is likely to be of interest to the broader field.
- Limitations are not adequately addressed. I would suggested addressing this explicitly in the conclusion, between the summary and discussion of future directions. - Readability: - Figure 2 graph text is far too small. - What purpose do the black horizontal bars at the top and bottom of Figure 1's image serve? Seems the text would fit much better without these.
+ The introduction of ER symmetry seems to be a new contribution, extending the symmetry-breaking methods beyond simple action and observation relabeling. + The use of various toy examples throughout the paper makes the key definitions and concepts easy to understand. + The proposed approach is solid, and its effectiveness is underpinned by the with well-defined constructions of equivalence classes.
- Despite the use of the toy examples and the effort by the authors, I feel the writing could be further improved for readability. Some sections are still a bit challenging to follow. - The experiments largely focus on the Hanabi environment, while the results on overcook and cat/dog environments are very brief. More complex environments, especially those with continuous action and state spaces, should be used to demonstrate the effectiveness and scalability of the ER symmetry method, which un
Videos
Taxonomy
TopicsQuantum chaos and dynamical systems
