Cycles and collusion in congestion games under Q-learning
Cesare Carissimo, Jan Nagler, Heinrich Nax

TL;DR
This paper analyzes how Q-learning dynamics in network routing games can lead to cycles or convergence to Nash equilibria, revealing challenges in regulation and collusion in such systems.
Contribution
It provides a comprehensive convergence analysis of Q-learning in generalized Braess paradox games, highlighting phenomena like cycling and the impact of meta-game incentives.
Findings
Q-learning can lead to cycling or convergence in routing games.
Meta-game incentives influence agent parameters and outcomes.
Results have implications for regulation and collusion in network games.
Abstract
We investigate the dynamics of Q-learning in a class of generalized Braess paradox games. These games represent an important class of network routing games where the associated stage-game Nash equilibria do not constitute social optima. We provide a full convergence analysis of Q-learning with varying parameters and learning rates. A wide range of phenomena emerges, broadly either settling into Nash or cycling continuously in ways reminiscent of "Edgeworth cycles" (i.e. jumping suddenly from Nash toward social optimum and then deteriorating gradually back to Nash). Our results reveal an important incentive incompatibility when thinking in terms of a meta-game being played by the designers of the individual Q-learners who set their agents' parameters. Indeed, Nash equilibria of the meta-game are characterized by heterogeneous parameters, and resulting outcomes achieve little to no…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Experimental Behavioral Economics Studies · Evolutionary Game Theory and Cooperation
MethodsSparse Evolutionary Training · Q-Learning
