s'authentifier
version française rss feed
HAL : hal-00688928, version 1

Fiche détaillée  Récupérer au format
Frontiers in neuroscience 6, 9 (2012) 1-14
Dopaminergic control of the exploration-exploitation trade-off via the basal ganglia
Mark D. Humphries 1, 2, Mehdi Khamassi ( ) 3, Kevin Gurney 2
(06/02/2012)

We continuously face the dilemma of choosing between actions that gather new information or actions that exploit existing knowledge. This "exploration-exploitation" trade-off depends on the environment: stability favours exploiting knowledge to maximise gains; volatility favours exploring new options and discovering new outcomes. Here we set out to reconcile recent evidence for dopamine's involvement in the exploration-exploitation trade-off with the existing evidence for basal ganglia control of action selection, by testing the hypothesis that the tonic dopamine in the striatum, the basal ganglia's input nucleus, sets the current exploration-exploitation tradeoff. We first advanced the idea of interpreting the basal ganglia output as a probability distribution function for action selection. Using computational models of the full basal ganglia circuit, we showed that, under this interpretation, the actions of dopamine within the striatum change the basal ganglia's output to favour the level of exploration or exploitation encoded in the probability distribution. We also found that our models predict striatal dopamine controls the exploration-exploitation trade-off if we instead read out the probability distribution from the target nuclei of the basal ganglia, where their inhibitory input shapes the cortical input to these nuclei. Finally, by integrating the basal ganglia within a reinforcement learning model, we showed ho dopamine's effect on the exploration-exploitation trade-off could be measurable in a forced two-choice task. These simulations also showed how tonic dopamine can appear to affect learning while only directly altering the trade-off. Thus, our models support the hypothesis that changes in tonic dopamine within the striatum can alter the exploration-exploitation trade-off by modulating the output of the basal ganglia.
1 :  Laboratoire de Neurosciences cognitives
INSERM : U960 – Ecole normale supérieure de Paris - ENS Paris
2 :  Psychology Department
University of Sheffield
3 :  Institut des Systèmes Intelligents et Robotique (ISIR)
CNRS : UMR7222 – Université Pierre et Marie Curie [UPMC] - Paris VI
Group for Neural Theory
Architectures et Modèles pour l'Adaptation et la Cognition
Adaptive Behaviour Research Group
Sciences du Vivant/Neurosciences
reinforcement learning – meta-parameters – decision making – reward – uncertainty
Liste des fichiers attachés à ce document : 
PDF
Humphries2012_dopamineExploration.pdf(1 MB)