Illuminating the Three Dogmas of Reinforcement Learning under Evolutionary Light
Mani Hamidi, Terrence W. Deacon

TL;DR
This paper critically examines three foundational beliefs in reinforcement learning, proposing an evolutionary framework to revise these dogmas and explore implications for biological learning and agency.
Contribution
It introduces an evolutionary-inspired framework to reconsider core RL dogmas, linking biological evolution, agency, and multi-objective optimization.
Findings
Evolutionary dynamics can operate within individual brains.
Enrichment of the adaptation view of learning through evolutionary insights.
Analogies from evolution clarify the limits of the reward hypothesis.
Abstract
Three core tenets of reinforcement learning (RL)--concerning the definition of agency, the objective of learning, and the scope of the reward hypothesis--have been highlighted as key targets for conceptual revision, with major implications for theory and application. We propose a framework, inspired by open-ended evolutionary theory, to reconsider these three "dogmas." We revisit each assumption and address related concerns raised alongside them. To make our arguments relevant to RL as a model of biological learning, we first establish that evolutionary dynamics can plausibly operate within living brains over an individual's lifetime, and are not confined to cross-generational processes. We begin by revisiting the second dogma, drawing on evolutionary insights to enrich the "adaptation-rather-than-search" view of learning. We then address the third dogma regarding the limits of the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
