Rationality Measurement and Theory for Reinforcement Learning Agents

Kejiang Qian; Amos Storkey; Fengxiang He

arXiv:2602.04737·cs.LG·May 5, 2026

Rationality Measurement and Theory for Reinforcement Learning Agents

Kejiang Qian, Amos Storkey, Fengxiang He

PDF

1 Repo

TL;DR

This paper introduces rationality measures and a theoretical framework for reinforcement learning agents, analyzing their performance and generalisability in dynamic environments.

Contribution

It defines new metrics for rationality in RL, decomposes rational risk gap, and links it to environment shifts and model complexity, supported by experiments.

Findings

01

Rational risk gap decomposes into environment shift and algorithmic generalisability.

02

Regularisers and domain randomisation can improve rationality.

03

Environment shifts can harm RL agent performance.

Abstract

This paper proposes a suite of rationality measures and associated theory for reinforcement learning agents, a property increasingly critical yet rarely explored. We define an action in deployment to be perfectly rational if it maximises the hidden true value function in the steepest direction. The expected value discrepancy of a policy's actions against their rational counterparts, culminating over the trajectory in deployment, is defined to be expected rational risk; an empirical average version in training is also defined. Their difference, termed as rational risk gap, is decomposed into (1) an extrinsic component caused by environment shifts between training and deployment, and (2) an intrinsic one due to the algorithm's generalisability in a dynamic environment. They are upper bounded by, respectively, (1) the $1$ -Wasserstein distance between transition kernels and initial state…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

EVIEHub/Rationality
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.