Cognitive Insights and Stable Coalition Matching for Fostering Multi-Agent Cooperation
Jiaqi Shao, Tianjun Yuan, Tao Lin, Bing Luo

TL;DR
This paper introduces a coalition matching mechanism that uses cognitive theory of mind insights to form stable, cooperative multi-agent groups, enhancing system performance through belief alignment and specialized abilities.
Contribution
It presents a novel coalition matching algorithm that explicitly considers agents' ToM levels and belief alignment to foster cooperation in multi-agent systems.
Findings
Higher ToM does not always lead to better cooperation.
The proposed matching algorithm finds stable, cooperative coalitions.
Cognitive insights improve multi-agent coordination strategies.
Abstract
Cognitive abilities, such as Theory of Mind (ToM), play a vital role in facilitating cooperation in human social interactions. However, our study reveals that agents with higher ToM abilities may not necessarily exhibit better cooperative behavior compared to those with lower ToM abilities. To address this challenge, we propose a novel matching coalition mechanism that leverages the strengths of agents with different ToM levels by explicitly considering belief alignment and specialized abilities when forming coalitions. Our proposed matching algorithm seeks to find stable coalitions that maximize the potential for cooperative behavior and ensure long-term viability. By incorporating cognitive insights into the design of multi-agent systems, our work demonstrates the potential of leveraging ToM to create more sophisticated and human-like coordination strategies that foster cooperation…
Peer Reviews
Decision·Submitted to ICLR 2025
- The related work is clear and concise. - The paper is well motivated - The empirical results for HumanEval demonstrating the effectiveness of the matching mechanism to match agents to those that they are able to accurately predict beliefs about is promising. This is alongside promising improvements in terms of Pass@1 rates. - The empirical results are similarly promising in terms of problem solving and general reasoning.
- Whilst the authors do mention that the coalition formation is generally an NP-hard problem, they do not offer any ideas about potential future possibilities that would help with the scalability of the framework - I do not understand the prompt referenced in Appendix A and the corresponding LLM output. The belief model is rather vague, and when looking at the output of the alignment scores it seems a bit arbitrary - e.g. the belief model does not mention using an object oriented approach, but i
The paper is easy to follow, with the appendix clearly aiding in understanding how the models function. There is a clear logic as to why each component is added, this is shown through the experimentation and the results. Especially the need for adding coalition matching on top of the theory of mind. There is a clear increase in Pass@1 the iterative programming environment with this model. There is a clear increase in accuracy in the logic and reasoning problems compared to existing methods.
The calculation of the semantic similarity of beliefs and actions is left to the LLM, this does not lend itself to a general approach as the title alludes to. It is made clear throughout the paper that this is applied to LLMs however and I do not see this as a big weakness, but would like to see this made clear in the title if possible. In the debate environment the baselines where both affirmative and negative lead to a bias of the affirmative winning 65.45% of the time, that is they are both
The introduction and related work section motivate the research pretty well. The idea of guiding multi-agent collaboration through ToM and belief alignment is novel. The authors conduct comprehensive evaluations across diverse task scenarios and base LLMs, presenting both quantitative and qualitative results.
The ToM formulation presented in Section 4.1 deviates from the common definition of higher-order ToM. When conducting recursive ToM inferences at level-k, agents are only given their own belief at level-(k-1) rather than the beliefs of other agents. I recommend that the authors refer to [1] for its definition of higher-order mental state inference. The proposed alignment measurement in Section 4.2 may not apply to general high-order ToM inferences in multi-agent systems. For example, “what I th
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEvolutionary Game Theory and Cooperation
