Cognitive psychology: computing the value of the choices we do not make

Inti A. Brazil

PMC · DOI:10.1038/s44271-024-00064-x·March 2, 2024

Cognitive psychology: computing the value of the choices we do not make

Inti A. Brazil

PDF

Open Access

TL;DR

This paper explores how people evaluate choices they didn't make using reinforcement learning and computational models.

Contribution

The study introduces a new method to analyze unchosen action values using computational modeling.

Findings

01

Researchers found that people update the value of unchosen actions during decision-making.

02

The study used computational models to track how choices and their alternatives are evaluated.

Abstract

Reflecting on choices we did make and those we could have made is very common. In a recent study in Science Advances, researchers used a reinforcement learning paradigm together with computational modeling to study the processes underlying the value update of unchosen actions.

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases1

psychopathic

Figures1

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health Research Topics · Decision-Making and Behavioral Economics · Neural and Behavioral Psychology Studies

Full text

BackyardBest for Alamy

Research on decision-making suggests that we use feedback to assign values to different choice options. We tend to act in ways that are routinely associated with favorable outcomes and avoid behaviors leading to poor outcomes. But how do we learn the value of unchosen options for which we do not obtain feedback, given that they represent hypothetical scenarios that never occurred?

A recent study by Ben-Artzi and colleagues^1^ moves us closer towards obtaining an answer. Participants performed a multi-armed bandit task, in which they were presented with 2 (out of 4 cards) on each trial and had to learn which card to choose in order to obtain the highest monetary gain. By combining a clever task-design with the use of computational models, they were able to estimate the value assigned to cards that were not chosen and elucidate a possible underlying mechanism.

One key finding was that the values of the unchosen options were updated relative to the value of the chosen option, by integrating the history of outcomes for choosing a particular card with that of rejecting the other cards. The findings suggest that choosing an option that leads to a favorable outcome may reinforce the avoidance of alternative choices, and the value of each unchosen option is updated even when this is not required to perform the task.

Such discoveries could inspire studies in (patient) populations associated with disturbances in reinforcement-based decision-making. For example, selective reinforcement learning impairments have been observed in offenders with strong psychopathic tendencies (e.g., callousness, recklessness)^2^, who often fail to avoid choices that previously led to rewards but no longer do. Ben-Artzi et al.’s^1^ results could suggest that, rather than failing to update information pertaining to the choice made, such individuals perhaps fail to detect when alternative choices have become more favorable and, therefore, keep making poor choices.

This study sheds new light on the mechanisms of counterfactual thinking and could also new avenues for research in other subfields of psychology.

Bibliography1

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ben-Artzi, I., Kessler, Y., Nicenboim, B. & Shahar, N. Computational mechanisms underlying latent value updating of unchosen actions. Sci. Adv.9, eadi 2704 (2023).10.1126/sciadv.adi 2704 PMC 1058894737862419 · doi ↗ · pubmed ↗