Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria

Fengming Zhu; Fangzhen Lin

arXiv:2505.07008·cs.GT·October 16, 2025

Constant-Memory Strategies in Stochastic Games: Best Responses and Equilibria

Fengming Zhu, Fangzhen Lin

PDF

Open Access

TL;DR

This paper explores constant-memory strategies in stochastic games, analyzing their best responses and equilibria, and verifies theoretical insights through experiments on classic multi-agent decision-making testbeds.

Contribution

It provides new theoretical results on best responses and Nash equilibria for constant-memory strategies and discusses their computational hardness, supported by empirical experiments.

Findings

01

Best response computations are computationally hard.

02

Nash equilibria for constant-memory strategies are characterized.

03

Experimental results validate theoretical insights on testbeds.

Abstract

Stochastic games have become a prevalent framework for studying long-term multi-agent interactions, especially in the context of multi-agent reinforcement learning. In this work, we comprehensively investigate the concept of constant-memory strategies in stochastic games. We first establish some results on best responses and Nash equilibria for behavioral constant-memory strategies, followed by a discussion on the computational hardness of best responding to mixed constant-memory strategies. Those theoretic insights are later verified on several sequential decision-making testbeds, including the $Iterated Prisoner’s Dilemma$ , the $Iterated Traveler’s Dilemma$ , and the $Pursuit$ domain. This work aims to enhance the understanding of theoretical issues in single-agent planning under multi-agent systems, and uncover the connection between decision models in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsReinforcement Learning in Robotics · Game Theory and Applications · Advanced Bandit Algorithms Research