Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning

Guillaume Viejo; Mehdi Khamassi; Andrea Brovelli; Benoît Girard

doi:10.3389/fnbeh.2015.00225

Article Dans Une Revue Frontiers in Behavioral Neuroscience Année : 2015

Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning

(1, 2) , (1, 2) , (3) , (1, 2)

1
2
3

Guillaume Viejo

Fonction : Auteur correspondant
PersonId : 971628

Connectez-vous pour contacter l'auteur

Institut des Systèmes Intelligents et de Robotique

AMAC

Mehdi Khamassi

Fonction : Auteur
PersonId : 186
IdHAL : mehdi-khamassi
ORCID : 0000-0002-2515-1046
IdRef : 12845072X

Institut des Systèmes Intelligents et de Robotique

AMAC

Andrea Brovelli

Fonction : Auteur
PersonId : 184498
IdHAL : andrea-brovelli
ORCID : 0000-0002-5342-1330
IdRef : 204209064

Institut de Neurosciences de la Timone

Benoît Girard

Fonction : Auteur
PersonId : 1537
IdHAL : benoit-girard
ORCID : 0000-0002-8117-7064
IdRef : 089381092

Institut des Systèmes Intelligents et de Robotique

AMAC

Résumé

Current learning theory provides a comprehensive description of how humans and other animals learn, and places behavioral flexibility and automaticity at heart of adaptive behaviors. However, the computations supporting the interactions between goal-directed and habitual decision-making systems are still poorly understood. Previous functional magnetic resonance imaging (fMRI) results suggest that the brain hosts complementary computations that may differentially support goal-directed and habitual processes in the form of a dynamical interplay rather than a serial recruitment of strategies. To better elucidate the computations underlying flexible behavior, we develop a dual-system computational model that can predict both performance (i.e., participants' choices) and modulations in reaction times during learning of a stimulus–response association task. The habitual system is modeled with a simple Q-Learning algorithm (QL). For the goal-directed system, we propose a new Bayesian Working Memory (BWM) model that searches for information in the history of previous trials in order to minimize Shannon entropy. We propose a model for QL and BWM coordination such that the expensive memory manipulation is under control of, among others, the level of convergence of the habitual learning. We test the ability of QL or BWM alone to explain human behavior, and compare them with the performance of model combinations, to highlight the need for such combinations to explain behavior. Two of the tested combination models are derived from the literature, and the latter being our new proposal. In conclusion, all subjects were better explained by model combinations, and the majority of them are explained by our new coordination proposal.

Mots clés

behavior action selection decision-making working-memory reinforcement learning reaction times multi-objective optimization

Domaines

Neurosciences [q-bio.NC]

Fichier principal

fnbeh-09-00225.pdf (1.08 Mo)

Origine : Publication financée par une institution

Gestionnaire HAL-UPMC : Connectez-vous pour contacter le contributeur

https://hal.sorbonne-universite.fr/hal-01215419

Soumis le : mercredi 14 octobre 2015-10:52:38

Dernière modification le : mardi 16 avril 2024-10:50:15

Archivage à long terme le : vendredi 5 mai 2017-13:32:41

Dates et versions

hal-01215419 , version 1 (14-10-2015)

Licence

Paternité

Identifiants

HAL Id : hal-01215419 , version 1
DOI : 10.3389/fnbeh.2015.00225

Citer

Guillaume Viejo, Mehdi Khamassi, Andrea Brovelli, Benoît Girard. Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning. Frontiers in Behavioral Neuroscience, 2015, 9, pp.225. ⟨10.3389/fnbeh.2015.00225⟩. ⟨hal-01215419⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

UPMC CNRS UNIV-AMU ISIR INT SORBONNE-UNIVERSITE SU-SCIENCES ANR ISIR_AMAC

166 Consultations

155 Téléchargements

Modeling choice and reaction time during arbitrary visuomotor learning through the coordination of adaptive working memory and reinforcement learning

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager