Respective Advantages and Disadvantages of Model-based and Model-free Reinforcement Learning in a Robotics Neuro-inspired Cognitive Architecture - Sorbonne Université Accéder directement au contenu
Article Dans Une Revue Procedia Computer Science Année : 2015

Respective Advantages and Disadvantages of Model-based and Model-free Reinforcement Learning in a Robotics Neuro-inspired Cognitive Architecture

Résumé

Combining model-based and model-free reinforcement learning systems in robotic cognitive architectures appears as a promising direction to endow artificial agents with flexibility and decisional autonomy close to mammals. In particular, it could enable robots to build an internal model of the environment, plan within it in response to detected environmental changes, and avoid the cost and time of planning when the stability of the environment is recognized as enabling habit learning. However, previously proposed criteria for the coordination of these two learning systems do not scale up to the large, partial and uncertain models autonomously learned by robots. Here we precisely analyze the performances of these two systems in an asynchronous robotic simulation of a cube-pushing task requiring a permanent trade-off between speed and accuracy. We propose solutions to make learning successful in these conditions. We finally discuss possible criteria for their efficient coordination within robotic cognitive architectures.
Fichier principal
Vignette du fichier
1-s2.0-S1877050915036558-main.pdf (512.91 Ko) Télécharger le fichier
Origine : Publication financée par une institution
Loading...

Dates et versions

hal-01250157 , version 1 (04-01-2016)

Licence

Paternité - Pas d'utilisation commerciale - Pas de modification

Identifiants

Citer

Erwan Renaudo, Benoît Girard, Raja Chatila, Mehdi Khamassi. Respective Advantages and Disadvantages of Model-based and Model-free Reinforcement Learning in a Robotics Neuro-inspired Cognitive Architecture. Procedia Computer Science, 2015, 71, pp.178-184. ⟨10.1016/j.procs.2015.12.194⟩. ⟨hal-01250157⟩
168 Consultations
126 Téléchargements

Altmetric

Partager

Gmail Facebook X LinkedIn More