Near Optimal Convergence to Coarse Correlated Equilibrium in General-Sum Markov Games
Asrin Efe Yorulmaz, Tamer Ba\c{s}ar

TL;DR
This paper presents a new no-regret learning algorithm for general-sum Markov games that significantly improves the convergence rate to Coarse Correlated Equilibrium, especially in high-dimensional settings, matching the best rates known for simpler game types.
Contribution
It introduces an adaptive, stage-wise no-regret algorithm extending recent advances to Markov games, achieving near-optimal convergence rates to CCE.
Findings
Convergence rate improved to O(log T / T) for CCE.
Dependence on action set size reduced to polylogarithmic.
Algorithm achieves the fastest known convergence to CCE in Markov games.
Abstract
No-regret learning dynamics play a central role in game theory, enabling decentralized convergence to equilibrium for concepts such as Coarse Correlated Equilibrium (CCE) or Correlated Equilibrium (CE). In this work, we improve the convergence rate to CCE in general-sum Markov games, reducing it from the previously best-known rate of to a sharper . This matches the best known convergence rate for CE in terms of , number of iterations, while also improving the dependence on the action set size from polynomial to polylogarithmic-yielding exponential gains in high-dimensional settings. Our approach builds on recent advances in adaptive step-size techniques for no-regret algorithms in normal-form games, and extends them to the Markovian setting via a stage-wise scheme that adjusts learning rates based on real-time feedback. We frame…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Reinforcement Learning in Robotics · Game Theory and Applications
