An LP-Based Approach for Bilinear Saddle Point Problem with Instance-dependent Guarantee and Noisy Feedback
Jiashuo Jiang, Mengxiao Zhang

TL;DR
This paper introduces a novel LP-based algorithm for estimating Nash equilibria in two-player zero-sum matrix games with noisy feedback, providing the first instance-dependent sample complexity bounds for general matrices.
Contribution
The paper proposes a new LP-based method that achieves instance-dependent sample complexity bounds for Nash equilibrium estimation in noisy, general-dimension matrix games.
Findings
First instance-dependent sample complexity bound for noisy matrix games
Algorithm identifies NE support and computes NE with controlled bias
Sample complexity depends on problem-specific constants
Abstract
In this work, we study the sample complexity of obtaining a Nash equilibrium (NE) estimate in two-player zero-sum matrix games with noisy feedback. Specifically, we propose a novel algorithm that repeatedly solves linear programs (LPs) to obtain an NE estimate with bias at most with a sample complexity of for general game matrices, where , , are some problem-dependent constants. To our knowledge, this is the first instance-dependent sample complexity bound for finding an NE estimate with bias in general-dimension matrix games with noisy feedback and potentially non-unique equilibria. Our algorithm builds on recent advances in online resource allocation and operates in two stages: (1) identifying the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptimization and Variational Analysis · Game Theory and Applications · Advanced Optimization Algorithms Research
