The Bounds of Algorithmic Collusion; $Q$-learning, Gradient Learning, and the Folk Theorem

Galit Askenazi-Golan; Domenico Mergoni Cecchelli; Edward Plumb; Clemens Possnig

arXiv:2411.12725·cs.GT·March 4, 2026

The Bounds of Algorithmic Collusion; $Q$-learning, Gradient Learning, and the Folk Theorem

Galit Askenazi-Golan, Domenico Mergoni Cecchelli, Edward Plumb, Clemens Possnig

PDF

Open Access

TL;DR

This paper investigates the strategic behavior of learning agents in repeated games, revealing the potential for algorithmic collusion through various learning dynamics and establishing new convergence results for multi-agent Q-learning.

Contribution

It provides a comprehensive analysis of learning dynamics in repeated games, including the first convergence proof for multi-agent Q-learning, and characterizes the set of achievable payoffs, highlighting collusion possibilities.

Findings

01

Wide range of payoff vectors can be achieved by learning dynamics.

02

First convergence result for multi-agent Q-learning in repeated games.

03

Algorithmic collusion can emerge under various learning algorithms.

Abstract

We explore the behaviour emerging from learning agents repeatedly interacting strategically for a wide range of learning dynamics, including $Q$ -learning, projected gradient, replicator and log-barrier dynamics. Going beyond the better understood classes of potential games and zero-sum games, we consider the setting of a general repeated game with finite recall under different forms of monitoring. We obtain a Folk Theorem-style result and characterise the set of payoff vectors that can be obtained by these dynamics, discovering a wide range of possibilities for the emergence of algorithmic collusion. Achieving this requires a novel technical approach, which, to the best of our knowledge, yields the first convergence result for multi-agent $Q$ -learning algorithms in repeated games.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsExperimental Behavioral Economics Studies

MethodsSparse Evolutionary Training