Resource-rational reinforcement learning and sensorimotor causal states,   and resource-rational maximiners

Sarah Marzen

arXiv:2404.18775·q-bio.NC·March 21, 2025

Resource-rational reinforcement learning and sensorimotor causal states, and resource-rational maximiners

Sarah Marzen

PDF

Open Access

TL;DR

This paper introduces a new computational objective combining reinforcement learning, rate-distortion theory, and causal states to evaluate and benchmark biological and artificial agents' resource-rationality in complex environments.

Contribution

It proposes a novel framework and algorithm for assessing reward-rate optimization and resource-rationality, including the concept of reward-rate manifolds and maximin strategies.

Findings

01

Introduction of reward-rate manifold as a benchmark

02

Proposal of a new algorithm for evaluating resource-rationality

03

Discussion of biological organisms as approximate maximiners

Abstract

We propose a new computational-level objective function for theoretical biology and theoretical neuroscience that combines: reinforcement learning, the study of learning with feedback via rewards; rate-distortion theory, a branch of information theory that deals with compressing signals to retain relevant information; and computational mechanics, the study of minimal sufficient statistics of prediction also known as causal states. We highlight why this proposal is likely only an approximation, but is likely to be an interesting one, and propose a new algorithm for evaluating it to obtain the newly-coined ``reward-rate manifold''. The performance of real and artificial agents in partially observable environments can be newly benchmarked using these reward-rate manifolds. Finally, we describe experiments that can probe whether or not biological organisms are resource-rational…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEEG and Brain-Computer Interfaces