Deviate or Not: Learning Coalition Structures with Multiple-bit Observations in Games
Yixuan Even Xu, Zhe Feng, Fei Fang

TL;DR
This paper introduces a method to efficiently learn hidden coalition structures in multi-agent systems by designing specific games and analyzing agents' deviations, achieving near-optimal learning rounds.
Contribution
It presents the first logarithmic-round algorithm for coalition structure learning in multi-agent systems, matching the theoretical lower bound, and extends it to practical game formats.
Findings
Learning can be achieved in O(log n) rounds with unrestricted game design.
Algorithms are developed for specific game formats with near-optimal complexity.
Theoretical bounds are established for the number of rounds needed to learn coalition structures.
Abstract
We consider the Coalition Structure Learning (CSL) problem in multi-agent systems, motivated by the existence of coalitions in many real-world systems, e.g., trading platforms and auction systems. In this problem, there is a hidden coalition structure within a set of agents, which affects the behavior of the agents in games. Our goal is to actively design a sequence of games for the agents to play, such that observations in these games can be used to learn the hidden coalition structure. In particular, we consider the setting where in each round, we design and present a game together with a strategy profile to the agents, and receive a multiple-bit observation -- for each agent, we observe whether or not they would like to deviate from the specified strategy. We show that we can learn the coalition structure in rounds if we are allowed to design any normal-form game,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning and Algorithms · Reinforcement Learning in Robotics · Advanced Bandit Algorithms Research
