No-Regret Learning in Games is Turing Complete
Gabriel P. Andrade, Rafael Frongillo, Georgios Piliouras

TL;DR
This paper proves that learning dynamics in simple game models can simulate any Turing machine, implying that predicting their long-term behavior is fundamentally undecidable.
Contribution
It demonstrates that the replicator dynamic in matrix games is Turing complete, revealing inherent computational complexity in learning in games.
Findings
Replicator dynamic is Turing complete in matrix games
Undecidability of reachability and equilibrium convergence problems
Implications for the limits of predicting learning outcomes in games
Abstract
Games are natural models for multi-agent machine learning settings, such as generative adversarial networks (GANs). The desirable outcomes from algorithmic interactions in these games are encoded as game theoretic equilibrium concepts, e.g. Nash and coarse correlated equilibria. As directly computing an equilibrium is typically impractical, one often aims to design learning algorithms that iteratively converge to equilibria. A growing body of negative results casts doubt on this goal, from non-convergence to chaotic and even arbitrary behaviour. In this paper we add a strong negative result to this list: learning in games is Turing complete. Specifically, we prove Turing completeness of the replicator dynamic on matrix games, one of the simplest possible settings. Our results imply the undecicability of reachability problems for learning algorithms in games, a special case of which is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGame Theory and Applications · Computability, Logic, AI Algorithms · Mathematical Biology Tumor Growth
