Online Convex Optimization for Sequential Decision Processes and   Extensive-Form Games

Gabriele Farina; Christian Kroer; Tuomas Sandholm

arXiv:1809.03075·cs.GT·September 11, 2018

Online Convex Optimization for Sequential Decision Processes and Extensive-Form Games

Gabriele Farina, Christian Kroer, Tuomas Sandholm

PDF

TL;DR

This paper introduces a generalized regret minimization framework for sequential decision processes and extensive-form games with convex sets and losses, extending existing algorithms and enabling scalable solutions for large games.

Contribution

It develops laminar regret decomposition, generalizes CFR to convex settings, and introduces new algorithms for regularized equilibria and opponent exploitation.

Findings

01

Framework scales comparably to fastest CFR variants

02

First algorithm for quantal response equilibria in large games

03

Enables scalable opponent exploitation methods

Abstract

Regret minimization is a powerful tool for solving large-scale extensive-form games. State-of-the-art methods rely on minimizing regret locally at each decision point. In this work we derive a new framework for regret minimization on sequential decision problems and extensive-form games with general compact convex sets at each decision point and general convex losses, as opposed to prior work which has been for simplex decision points and linear losses. We call our framework laminar regret decomposition. It generalizes the CFR algorithm to this more general setting. Furthermore, our framework enables a new proof of CFR even in the known setting, which is derived from a perspective of decomposing polytope regret, thereby leading to an arguably simpler interpretation of the algorithm. Our generalization to convex compact sets and convex losses allows us to develop new algorithms for…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.