Offline Equilibrium Finding
Shuxin Li, Xinrun Wang, Youzhi Zhang, Jakub Cerny, Pengdeng Li, Hau, Chan, Bo An

TL;DR
This paper introduces Offline Equilibrium Finding (OEF), a model-based framework that adapts online equilibrium algorithms to multiplayer offline RL settings, demonstrating superior performance over existing methods.
Contribution
It generalizes online equilibrium algorithms to offline multiplayer game settings and provides a model-based approach with theoretical guarantees.
Findings
Model-based OEF algorithms outperform offline RL baselines.
Adapting PSRO, CFR, and JPSRO to OEF is effective.
Combining behavior cloning improves solution quality.
Abstract
Offline reinforcement learning (offline RL) is an emerging field that has recently begun gaining attention across various application domains due to its ability to learn strategies from earlier collected datasets. Offline RL proved very successful, paving a path to solving previously intractable real-world problems, and we aim to generalize this paradigm to a multiplayer-game setting. To this end, we introduce a problem of offline equilibrium finding (OEF) and construct multiple types of datasets across a wide range of games using several established methods. To solve the OEF problem, we design a model-based framework that can directly apply any online equilibrium finding algorithm to the OEF setting while making minimal changes. The three most prominent contemporary online equilibrium finding algorithms are adapted to the context of OEF, creating three model-based variants: OEF-PSRO…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsReinforcement Learning in Robotics · Sports Analytics and Performance · Experimental Behavioral Economics Studies
