Sample-Efficient Regret-Minimizing Double Oracle in Extensive-Form Games
Xiaohang Tang, Chiyuan Wang, Chengdong Ma, Ilija Bogunovic, Stephen, McAleer, Yaodong Yang

TL;DR
This paper introduces AdaDO, an adaptive double oracle method that reduces sample complexity from exponential to polynomial in extensive-form games, supported by a new theoretical framework RMDO and empirical validation.
Contribution
It proposes AdaDO with optimal expansion frequency to improve sample efficiency and introduces RMDO for understanding and guiding sample complexity reduction in double oracle algorithms.
Findings
AdaDO achieves lower sample complexity than existing methods.
Empirical results show AdaDO better approximates NE with fewer samples.
Combining RMDO with warm start enhances convergence and scalability.
Abstract
Extensive-Form Game (EFG) represents a fundamental model for analyzing sequential interactions among multiple agents and the primary challenge to solve it lies in mitigating sample complexity. Existing research indicated that Double Oracle (DO) can reduce the sample complexity dependence on the information set number to the final restricted game size in solving EFG. This is attributed to the early convergence of full-game Nash Equilibrium (NE) through iteratively solving restricted games. However, we prove that the state-of-the-art Extensive-Form Double Oracle (XDO) exhibits \textit{exponential} sample complexity of , due to its exponentially increasing restricted game expansion frequency. Here we introduce Adaptive Double Oracle (AdaDO) to significantly alleviate sample complexity to \textit{polynomial} by deploying the optimal expansion frequency. Furthermore, to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Games · Data Management and Algorithms · Data Mining Algorithms and Applications
