Illuminating the Three Dogmas of Reinforcement Learning under Evolutionary Light

Mani Hamidi; Terrence W. Deacon

arXiv:2507.11482·cs.AI·July 30, 2025

Illuminating the Three Dogmas of Reinforcement Learning under Evolutionary Light

Mani Hamidi, Terrence W. Deacon

PDF

TL;DR

This paper critically examines three foundational beliefs in reinforcement learning, proposing an evolutionary framework to revise these dogmas and explore implications for biological learning and agency.

Contribution

It introduces an evolutionary-inspired framework to reconsider core RL dogmas, linking biological evolution, agency, and multi-objective optimization.

Findings

01

Evolutionary dynamics can operate within individual brains.

02

Enrichment of the adaptation view of learning through evolutionary insights.

03

Analogies from evolution clarify the limits of the reward hypothesis.

Abstract

Three core tenets of reinforcement learning (RL)--concerning the definition of agency, the objective of learning, and the scope of the reward hypothesis--have been highlighted as key targets for conceptual revision, with major implications for theory and application. We propose a framework, inspired by open-ended evolutionary theory, to reconsider these three "dogmas." We revisit each assumption and address related concerns raised alongside them. To make our arguments relevant to RL as a model of biological learning, we first establish that evolutionary dynamics can plausibly operate within living brains over an individual's lifetime, and are not confined to cross-generational processes. We begin by revisiting the second dogma, drawing on evolutionary insights to enrich the "adaptation-rather-than-search" view of learning. We then address the third dogma regarding the limits of the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.