Reward is not enough: can we liberate AI from the reinforcement learning   paradigm?

Vacslav Glukhov

arXiv:2202.03192·cs.AI·November 12, 2024

Reward is not enough: can we liberate AI from the reinforcement learning paradigm?

Vacslav Glukhov

PDF

Open Access

TL;DR

This paper argues that reinforcement learning, centered on reward maximization, is insufficient to explain or develop true intelligence, emphasizing the need for broader frameworks in AI development.

Contribution

It challenges the dominance of reward-based paradigms in AI, advocating for alternative approaches to understanding and creating intelligent systems.

Findings

01

Reward maximization does not fully account for intelligence.

02

Reinforcement learning is an incomplete framework for AI.

03

Broader models are necessary for safe and robust AI development.

Abstract

I present arguments against the hypothesis put forward by Silver, Singh, Precup, and Sutton ( https://www.sciencedirect.com/science/article/pii/S0004370221000862 ) : reward maximization is not enough to explain many activities associated with natural and artificial intelligence including knowledge, learning, perception, social intelligence, evolution, language, generalisation and imitation. I show such reductio ad lucrum has its intellectual origins in the political economy of Homo economicus and substantially overlaps with the radical version of behaviourism. I show why the reinforcement learning paradigm, despite its demonstrable usefulness in some practical application, is an incomplete framework for intelligence -- natural and artificial. Complexities of intelligent behaviour are not simply second-order complications on top of reward maximisation. This fact has profound implications…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsComplex Systems and Time Series Analysis