A Game-Theoretic Model and Best-Response Learning Method for Ad Hoc Coordination in Multiagent Systems
Stefano V. Albrecht, Subramanian Ramamoorthy

TL;DR
This paper introduces a game-theoretic model and a novel learning method called HBA for ad hoc coordination in multiagent systems, demonstrating improved flexibility, efficiency, and human interaction outcomes.
Contribution
It formalizes the ad hoc coordination problem using stochastic Bayesian games and proposes HBA, a solution leveraging Bayesian Nash equilibrium for optimal decision-making.
Findings
HBA outperforms alternative algorithms in a logistics domain.
HBA achieves higher human-welfare and winning rates in experiments.
HBA demonstrates comparable efficiency to humans in game scenarios.
Abstract
The ad hoc coordination problem is to design an autonomous agent which is able to achieve optimal flexibility and efficiency in a multiagent system with no mechanisms for prior coordination. We conceptualise this problem formally using a game-theoretic model, called the stochastic Bayesian game, in which the behaviour of a player is determined by its private information, or type. Based on this model, we derive a solution, called Harsanyi-Bellman Ad Hoc Coordination (HBA), which utilises the concept of Bayesian Nash equilibrium in a planning procedure to find optimal actions in the sense of Bellman optimal control. We evaluate HBA in a multiagent logistics domain called level-based foraging, showing that it achieves higher flexibility and efficiency than several alternative algorithms. We also report on a human-machine experiment at a public science exhibition in which the human…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Game Theory and Applications · Auction Theory and Applications
