Adaptive coordination of working-memory and reinforcement learning in   non-human primates performing a trial-and-error problem solving task

Guillaume Viejo (ISIR); Beno\^it Girard (ISIR); Emmanuel Procyk; Mehdi; Khamassi (ISIR)

arXiv:1711.00698·cs.AI·April 30, 2019

Adaptive coordination of working-memory and reinforcement learning in non-human primates performing a trial-and-error problem solving task

Guillaume Viejo (ISIR), Beno\^it Girard (ISIR), Emmanuel Procyk, Mehdi, Khamassi (ISIR)

PDF

1 Repo

TL;DR

This study investigates how non-human primates combine reinforcement learning and working memory during a trial-and-error problem-solving task, revealing individual differences and potential species-specific coordination dynamics.

Contribution

It introduces a computational model combining RL and WM to explain primate behavior, highlighting inter-individual variability and implications for cross-species comparisons.

Findings

01

Monkeys' behavior is better explained by combined RL and WM models.

02

Different monkeys exhibit distinct coordination dynamics between RL and WM.

03

Pretraining may influence the dominance of certain RL-WM coordination patterns.

Abstract

Accumulating evidence suggest that human behavior in trial-and-error learning tasks based on decisions between discrete actions may involve a combination of reinforcement learning (RL) and working-memory (WM). While the understanding of brain activity at stake in this type of tasks often involve the comparison with non-human primate neurophysiological results, it is not clear whether monkeys use similar combined RL and WM processes to solve these tasks. Here we analyzed the behavior of five monkeys with computational models combining RL and WM. Our model-based analysis approach enables to not only fit trial-by-trial choices but also transient slowdowns in reaction times, indicative of WM use. We found that the behavior of the five monkeys was better explained in terms of a combination of RL and WM despite inter-individual differences. The same coordination dynamics we used in a previous…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

gviejo/Gohal
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsSoftmax · Q-Learning