Theoretical and Practical Advances on Smoothing for Extensive-Form Games

Christian Kroer; Kevin Waugh; Fatma Kilinc-Karzan; Tuomas; Sandholm

arXiv:1702.04849·cs.GT·May 10, 2017·2 cites

Theoretical and Practical Advances on Smoothing for Extensive-Form Games

Christian Kroer, Kevin Waugh, Fatma Kilinc-Karzan, Tuomas, Sandholm

PDF

Open Access

TL;DR

This paper introduces a novel weighting scheme for the dilated entropy function, significantly improving the convergence rates of first-order methods in solving large-scale extensive-form games and surpassing counterfactual regret minimization in practice.

Contribution

It develops a new distance-generating function with no dependence on the branching factor, enhancing first-order methods' efficiency in extensive-form games.

Findings

01

New weighting scheme improves convergence rate by a factor of Ω(b^dd).

02

First-order methods can outperform CFR+ with the new approach.

03

Practical tuning makes the excessive gap technique faster than CFR+.

Abstract

Sparse iterative methods, in particular first-order methods, are known to be among the most effective in solving large-scale two-player zero-sum extensive-form games. The convergence rates of these methods depend heavily on the properties of the distance-generating function that they are based on. We investigate the acceleration of first-order methods for solving extensive-form games through better design of the dilated entropy function---a class of distance-generating functions related to the domains associated with the extensive-form games. By introducing a new weighting scheme for the dilated entropy function, we develop the first distance-generating function for the strategy spaces of sequential games that has no dependence on the branching factor of the player. This result improves the convergence rate of several first-order methods by a factor of $Ω (b^{d} d)$ , where $b$ is the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Games · Advanced Bandit Algorithms Research · Reinforcement Learning in Robotics