Solving Large Extensive-Form Games with Strategy Constraints

Trevor Davis; Kevin Waugh; Michael Bowling

arXiv:1809.07893·cs.GT·February 7, 2019

Solving Large Extensive-Form Games with Strategy Constraints

Trevor Davis, Kevin Waugh, Michael Bowling

PDF

TL;DR

This paper introduces a generalized Counterfactual Regret Minimization algorithm capable of efficiently computing optimal strategies in large extensive-form games with convex strategy constraints, addressing limitations of previous methods.

Contribution

It presents a novel algorithm that handles convex strategy constraints in large games, extending the applicability of equilibrium computation methods.

Findings

01

Effective risk mitigation in security games

02

Opponent modeling in poker with partial information

03

Algorithm scales to large extensive-form games

Abstract

Extensive-form games are a common model for multiagent interactions with imperfect information. In two-player zero-sum games, the typical solution concept is a Nash equilibrium over the unconstrained strategy set for each player. In many situations, however, we would like to constrain the set of possible strategies. For example, constraints are a natural way to model limited resources, risk mitigation, safety, consistency with past observations of behavior, or other secondary objectives for an agent. In small games, optimal strategies under linear constraints can be found by solving a linear program; however, state-of-the-art algorithms for solving large games cannot handle general constraints. In this work we introduce a generalized form of Counterfactual Regret Minimization that provably finds optimal strategies under any feasible set of convex constraints. We demonstrate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.