Learning Optimal Behavior Through Reasoning and Experiences

Cosmin Ilut; Rosen Valchev

arXiv:2403.18185·econ.TH·March 28, 2024·3 cites

Learning Optimal Behavior Through Reasoning and Experiences

Cosmin Ilut, Rosen Valchev

PDF

Open Access

TL;DR

This paper introduces a new framework for learning optimal behavior that combines reasoning and experience, accounting for cognitive costs and uncertainty in decision-making.

Contribution

It develops a Bayesian non-parametric model integrating deliberative reasoning and experiential learning under bounded rationality with cognitive frictions.

Findings

01

Agents use Bayesian estimation of action values from reasoning and experience.

02

Subjective uncertainty influences the balance between reasoning and experience.

03

The framework bridges insights from economics and cognitive science on reinforcement learning.

Abstract

We develop a novel framework of bounded rationality under cognitive frictions that studies learning over optimal behavior through both deliberative reasoning and accumulated experiences. Using both types of information, agents engage in Bayesian non-parametric estimation of the unknown action value function. Reasoning signals are produced internally through mental deliberation, subject to a cognitive cost. Experience signals are the observed utility outcomes at previous actions. Agents' subjective estimation uncertainty, which evolves through information accumulation, modulates the two modes of learning in a state- and history-dependent way. We discuss how the model draws on and bridges conceptual, methodological and empirical insights from both economics and the cognitive sciences literature on reinforcement learning.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEvolutionary Algorithms and Applications