A Structural Threshold in Decision Capacity Governs Collapse in Self-Play Reinforcement Learning

Arahan Kujur

arXiv:2605.16315·cs.LG·May 19, 2026

A Structural Threshold in Decision Capacity Governs Collapse in Self-Play Reinforcement Learning

Arahan Kujur

PDF

TL;DR

This paper identifies a decision capacity threshold that determines whether self-play reinforcement learning agents collapse under perturbations, with the collapse being reversible and dependent on the preservation of certain decision points.

Contribution

It reveals a sharp threshold in decision capacity that governs collapse in self-play RL, supported by experiments across multiple games and algorithms.

Findings

01

Eliminating positive-reach decisions causes agents to collapse into a deterministic fixed point.

02

Preserving even one positive-reach decision prevents collapse.

03

Collapse severity scales with reach-weighted capacity.

Abstract

We show that a threshold in decision capacity determines whether self-play reinforcement learning agents collapse under asymmetric rule perturbations. Across poker variants, matrix games, a dice game, and multiple learning algorithms, eliminating all positive-reach contingent decisions causes rapid convergence to a deterministic exploitation attractor, a fixed point at near-maximal loss. Preserving even a single positive-reach contingent decision point prevents this collapse. A frozen baseline and fixed-opponent control confirm that the mechanism is co-adaptation under constraint, not the perturbation itself. The phenomenon is timing-invariant, fully reversible upon action restoration, and intensifies under function approximation. These results establish a sharp threshold at zero reach-weighted contingent action capacity, with severity scaling continuously via reach-weighted capacity in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.