Offline Equilibrium Finding

Shuxin Li; Xinrun Wang; Youzhi Zhang; Jakub Cerny; Pengdeng Li; Hau; Chan; Bo An

arXiv:2207.05285·cs.AI·February 9, 2023

Offline Equilibrium Finding

Shuxin Li, Xinrun Wang, Youzhi Zhang, Jakub Cerny, Pengdeng Li, Hau, Chan, Bo An

PDF

Open Access 1 Repo

TL;DR

This paper introduces Offline Equilibrium Finding (OEF), a model-based framework that adapts online equilibrium algorithms to multiplayer offline RL settings, demonstrating superior performance over existing methods.

Contribution

It generalizes online equilibrium algorithms to offline multiplayer game settings and provides a model-based approach with theoretical guarantees.

Findings

01

Model-based OEF algorithms outperform offline RL baselines.

02

Adapting PSRO, CFR, and JPSRO to OEF is effective.

03

Combining behavior cloning improves solution quality.

Abstract

Offline reinforcement learning (offline RL) is an emerging field that has recently begun gaining attention across various application domains due to its ability to learn strategies from earlier collected datasets. Offline RL proved very successful, paving a path to solving previously intractable real-world problems, and we aim to generalize this paradigm to a multiplayer-game setting. To this end, we introduce a problem of offline equilibrium finding (OEF) and construct multiple types of datasets across a wide range of games using several established methods. To solve the OEF problem, we design a model-based framework that can directly apply any online equilibrium finding algorithm to the OEF setting while making minimal changes. The three most prominent contemporary online equilibrium finding algorithms are adapted to the context of OEF, creating three model-based variants: OEF-PSRO…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

securitygames/oef
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Sports Analytics and Performance · Experimental Behavioral Economics Studies