Sequential sampling without comparison to boundary through model-free   reinforcement learning

Jamal Esmaily; Rani Moran; Yasser Roudi; and Bahador Bahrami

arXiv:2408.06080·cs.NE·August 13, 2024

Sequential sampling without comparison to boundary through model-free reinforcement learning

Jamal Esmaily, Rani Moran, Yasser Roudi, and Bahador Bahrami

PDF

Open Access

TL;DR

This paper introduces a model-free reinforcement learning approach for perceptual decision-making that eliminates the need for decision boundaries and evidence accumulation, explaining behavioral data without traditional boundary models.

Contribution

It presents a novel reinforcement learning algorithm that learns decision policies directly from evidence, bypassing the boundary concept and unifying learning and decision-making.

Findings

01

Reproduces key features of perceptual decision-making.

02

Accounts for behavior during training and after stabilization.

03

Offers a new perspective on decision process modeling.

Abstract

Although evidence integration to the boundary model has successfully explained a wide range of behavioral and neural data in decision making under uncertainty, how animals learn and optimize the boundary remains unresolved. Here, we propose a model-free reinforcement learning algorithm for perceptual decisions under uncertainty that dispenses entirely with the concepts of decision boundary and evidence accumulation. Our model learns whether to commit to a decision given the available evidence or continue sampling information at a cost. We reproduced the canonical features of perceptual decision-making such as dependence of accuracy and reaction time on evidence strength, modulation of speed-accuracy trade-off by payoff regime, and many others. By unifying learning and decision making within the same framework, this model can account for unstable behavior during training as well as…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Statistical Process Monitoring · Machine Learning and Algorithms