167 articles – 36 Notices  [english version]
HAL : hal-00703755, version 1

Fiche détaillée  Récupérer au format
Robotica 2012, Guimaraes : Portugal (2012)
Towards fast and adaptive optimal control policies for robots: A direct policy search approach
Didier Marin ( ) 1, Olivier Sigaud 1
(2012)

Optimal control methods are generally too expensive to be applied on-line and in real-time to the control of robots. An alternative method consists in tuning a parametrized reactive controller so that it converges to optimal behavior. In this paper we present such a method based on the "direct Policy Search" paradigm to get a cost-efficient control policy for a simulated two degrees-of-freedom planar arm actuated by six muscles. We learn a parametric controller from demonstration using a few near-optimal trajectories. Then we tune the parameters of this controller using two versions of a Cross-Entropy Policy Search method that we compare. Finally, we show that the resulting controller is 20000 times faster than an optimal control method producing the same trajectories.
1 :  Institut des Systèmes Intelligents et Robotique (ISIR)
CNRS : UMR7222 – Université Pierre et Marie Curie [UPMC] - Paris VI
Informatique/Intelligence artificielle