Co-Learning Empirical Games and World Models
Max Olan Smith, Michael P. Wellman

TL;DR
This paper introduces Dyna-PSRO, a novel algorithm that co-learns empirical games and world models to improve strategic decision-making, achieving lower regret and requiring fewer experiences in complex game environments.
Contribution
The paper proposes a new co-learning algorithm, Dyna-PSRO, that integrates empirical game modeling with world dynamics to enhance strategic planning and efficiency.
Findings
Dyna-PSRO achieves lower regret than PSRO in experiments.
Dyna-PSRO requires fewer experiences to learn effective strategies.
Co-learning empirical games and world models improves decision-making in complex games.
Abstract
Game-based decision-making involves reasoning over both world dynamics and strategic interactions among the agents. Typically, empirical models capturing these respective aspects are learned and used separately. We investigate the potential gain from co-learning these elements: a world model for dynamics and an empirical game for strategic interactions. Empirical games drive world models toward a broader consideration of possible game dynamics induced by a diversity of strategy profiles. Conversely, world models guide empirical games to efficiently discover new strategies through planning. We demonstrate these benefits first independently, then in combination as realized by a new algorithm, Dyna-PSRO, that co-learns an empirical game and a world model. When compared to PSRO -- a baseline empirical-game building algorithm, Dyna-PSRO is found to compute lower regret solutions on partially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Artificial Intelligence in Games · Explainable Artificial Intelligence (XAI)
