On the Convergence of Bounded Agents

David Abel; Andr\'e Barreto; Hado van Hasselt; Benjamin Van Roy; Doina; Precup; Satinder Singh

arXiv:2307.11044·cs.LG·July 21, 2023

On the Convergence of Bounded Agents

David Abel, Andr\'e Barreto, Hado van Hasselt, Benjamin Van Roy, Doina, Precup, Satinder Singh

PDF

Open Access

TL;DR

This paper explores new definitions of agent convergence in reinforcement learning, focusing on bounded agents and their internal state stability, providing theoretical insights into when an agent's behavior stabilizes.

Contribution

It introduces two novel, formal definitions of convergence for bounded agents and analyzes their properties and relationships, clarifying a key concept in reinforcement learning theory.

Findings

01

The first definition relates convergence to the minimal number of states needed for future behavior.

02

The second definition ties convergence to the stability of the agent's performance relative to internal state changes.

03

Both definitions align with standard views in typical reinforcement learning settings.

Abstract

When has an agent converged? Standard models of the reinforcement learning problem give rise to a straightforward definition of convergence: An agent converges when its behavior or performance in each environment state stops changing. However, as we shift the focus of our learning problem from the environment's state to the agent's state, the concept of an agent's convergence becomes significantly less clear. In this paper, we propose two complementary accounts of agent convergence in a framing of the reinforcement learning problem that centers around bounded agents. The first view says that a bounded agent has converged when the minimal number of states needed to describe the agent's future behavior cannot decrease. The second view says that a bounded agent has converged just when the agent's performance only changes if the agent's internal state changes. We establish basic properties…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComputability, Logic, AI Algorithms · Evolutionary Algorithms and Applications · Game Theory and Applications

MethodsFocus