Lazy-CFR: fast and near optimal regret minimization for extensive games   with imperfect information

Yichi Zhou; Tongzheng Ren; Jialian Li; Dong Yan; Jun Zhu

arXiv:1810.04433·cs.LG·December 27, 2018·5 cites

Lazy-CFR: fast and near optimal regret minimization for extensive games with imperfect information

Yichi Zhou, Tongzheng Ren, Jialian Li, Dong Yan, Jun Zhu

PDF

Open Access

TL;DR

Lazy-CFR introduces a novel lazy update technique that reduces computation in extensive games with imperfect information, achieving near-optimal regret bounds and significantly faster convergence than traditional CFR methods.

Contribution

The paper proposes Lazy-CFR, a new variant of CFR that reduces traversal complexity and tightens regret bounds through a novel lazy update technique and analysis.

Findings

01

Lazy-CFR needs only O(√|I|) information set traversals per round.

02

Lazy-CFR achieves almost the same regret bounds as vanilla CFR.

03

Experimental results show Lazy-CFR outperforms vanilla CFR significantly.

Abstract

Counterfactual regret minimization (CFR) is the most popular algorithm on solving two-player zero-sum extensive games with imperfect information and achieves state-of-the-art performance in practice. However, the performance of CFR is not fully understood, since empirical results on the regret are much better than the upper bound proved in \cite{zinkevich2008regret}. Another issue is that CFR has to traverse the whole game tree in each round, which is time-consuming in large scale games. In this paper, we present a novel technique, lazy update, which can avoid traversing the whole game tree in CFR, as well as a novel analysis on the regret of CFR with lazy update. Our analysis can also be applied to the vanilla CFR, resulting in a much tighter regret bound than that in \cite{zinkevich2008regret}. Inspired by lazy update, we further present a novel CFR variant, named Lazy-CFR. Compared…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Artificial Intelligence in Games · Reinforcement Learning in Robotics