Taming the Exponential Action Set: Sublinear Regret and Fast Convergence   to Nash Equilibrium in Online Congestion Games

Jing Dong; Jingyu Wu; Siwei Wang; Baoxiang Wang; Wei Chen

arXiv:2306.13673·cs.GT·June 27, 2023

Taming the Exponential Action Set: Sublinear Regret and Fast Convergence to Nash Equilibrium in Online Congestion Games

Jing Dong, Jingyu Wu, Siwei Wang, Baoxiang Wang, Wei Chen

PDF

Open Access

TL;DR

This paper introduces CongestEXP, a decentralized algorithm for online congestion games that achieves sublinear regret and fast convergence to Nash equilibrium by efficiently handling large action sets.

Contribution

It presents CongestEXP, a novel exponential weights-based method that reduces regret dependence on action set size and guarantees rapid convergence to Nash equilibrium in online congestion games.

Findings

01

Regret bound of O(kF√T) for each player

02

Linear scaling of regret with number of facilities F

03

Almost exponential convergence to Nash equilibrium

Abstract

The congestion game is a powerful model that encompasses a range of engineering systems such as traffic networks and resource allocation. It describes the behavior of a group of agents who share a common set of $F$ facilities and take actions as subsets with $k$ facilities. In this work, we study the online formulation of congestion games, where agents participate in the game repeatedly and observe feedback with randomness. We propose CongestEXP, a decentralized algorithm that applies the classic exponential weights method. By maintaining weights on the facility level, the regret bound of CongestEXP avoids the exponential dependence on the size of possible facility sets, i.e., $(k F) \approx F^{k}$ , and scales only linearly with $F$ . Specifically, we show that CongestEXP attains a regret upper bound of $O (k F T)$ for every individual player, where $T$ is the time horizon. On…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAuction Theory and Applications · Game Theory and Applications · Advanced Bandit Algorithms Research