Sharing Information Without Regret in Managed Stochastic Games
Michael J. Neely

TL;DR
This paper introduces an online algorithm for information sharing in multi-player stochastic games that maximizes collective utility while ensuring individual players do not regret sharing, applicable in non-ergodic scenarios.
Contribution
It develops a novel online algorithm that coordinates players' actions through a game manager to optimize utilities with no regret, extending beyond traditional equilibrium concepts.
Findings
Algorithm guarantees no regret for players over time.
Applicable to non-ergodic, real-world scenarios with arbitrary player actions.
Enhances cooperation without traditional equilibrium assumptions.
Abstract
This paper considers information sharing in a multi-player repeated game. Every round, each player observes a subset of components of a random vector and then takes a control action. The utility earned by each player depends on the full random vector and on the actions of others. An example is a game where different rewards are placed over multiple locations, each player only knows the rewards in a subset of the locations, and players compete to collect the rewards. Sharing information can help others, but can also increase competition for desirable locations. Standard Nash equilibrium and correlated equilibrium concepts are inadequate in this scenario. Instead, this paper develops an algorithm where, every round, all players pass their information and intended actions to a game manager. The manager provides suggested actions for each player that, if taken, maximize a concave function…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Advanced Bandit Algorithms Research · Economic theories and models
