Semi Bandit Dynamics in Congestion Games: Convergence to Nash   Equilibrium and No-Regret Guarantees

Ioannis Panageas; Stratis Skoulakis; Luca Viano; Xiao Wang; Volkan; Cevher

arXiv:2306.15543·cs.GT·June 28, 2023·1 cites

Semi Bandit Dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees

Ioannis Panageas, Stratis Skoulakis, Luca Viano, Xiao Wang, Volkan, Cevher

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper presents a new online gradient descent variant for congestion games that guarantees convergence to Nash Equilibrium and sublinear regret, with efficiency independent of the exponential action set size.

Contribution

Introduces a novel online gradient descent method for congestion games that achieves convergence and regret guarantees with polynomial dependence on game parameters.

Findings

01

Proves convergence to Nash Equilibrium in congestion games.

02

Achieves sublinear regret in semi-bandit feedback setting.

03

Method's complexity depends polynomially on game size, not action set size.

Abstract

In this work, we introduce a new variant of online gradient descent, which provably converges to Nash Equilibria and simultaneously attains sublinear regret for the class of congestion games in the semi-bandit feedback setting. Our proposed method admits convergence rates depending only polynomially on the number of players and the number of facilities, but not on the size of the action set, which can be exponentially large in terms of the number of facilities. Moreover, the running time of our method has polynomial-time dependence on the implicit description of the game. As a result, our work answers an open question from (Du et. al, 2022).

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lviano/sbgd-ce
noneOfficial

Videos

Semi Bandit dynamics in Congestion Games: Convergence to Nash Equilibrium and No-Regret Guarantees.· slideslive

Taxonomy

TopicsAdvanced Bandit Algorithms Research · Auction Theory and Applications · Game Theory and Applications