scholarly journals Compound Heuristic Information Guided Policy Improvement for Robot Motor Skill Acquisition

2020 ◽  
Vol 10 (15) ◽  
pp. 5346
Author(s):  
Jian Fu ◽  
Cong Li ◽  
Xiang Teng ◽  
Fan Luo ◽  
Boqun Li

Discovering the implicit pattern and using it as heuristic information to guide the policy search is one of the core factors to speed up the procedure of robot motor skill acquisition. This paper proposes a compound heuristic information guided reinforcement learning algorithm PI2-CMA-KCCA for policy improvement. Its structure and workflow are similar to a double closed-loop control system. The outer loop realized by Kernel Canonical Correlation Analysis (KCCA) infers the implicit nonlinear heuristic information between the joints of the robot. In addition, the inner loop operated by Covariance Matrix Adaptation (CMA) discovers the hidden linear correlations between the basis functions within the joint of the robot. These patterns which are good for learning the new task can automatically determine the mean and variance of the exploring perturbation for Path Integral Policy Improvement (PI2). Compared with classical PI2, PI2-CMA, and PI2-KCCA, PI2-CMA-KCCA can not only endow the robot with the ability to realize transfer learning of trajectory planning from the demonstration to the new task, but also complete it more efficiently. The classical via-point experiments based on SCARA and Swayer robots have validated that the proposed method has fast learning convergence and can find a solution for the new task.

1997 ◽  
Vol 61 (1) ◽  
pp. 56-65 ◽  
Author(s):  
GW Knight ◽  
PJ Guenzel ◽  
P Feil

2008 ◽  
Author(s):  
Michelle V. Thompson ◽  
Janet L. Utschig ◽  
Mikaela K. Vaughan ◽  
Marc V. Richard ◽  
Benjamin A. Clegg

2013 ◽  
Vol 45 (6) ◽  
pp. 531-549 ◽  
Author(s):  
Ashvin Shah ◽  
Andrew G. Barto ◽  
Andrew H. Fagg

1994 ◽  
Vol 11 (2) ◽  
pp. 214-235 ◽  
Author(s):  
Cheryl Missiuna

Children with developmental coordination disorder (DCD) demonstrate coordination difficulties during the learning of novel motor skills; no previous studies, however, have investigated their ability to learn and then generalize a new movement. This study compared 24 young children with DCD with 24 age-matched control children (AMC) during the early stages of learning a simple aiming task. Children with DCD were found to perform more poorly than their peers on measures of acquired motor skill, and to react and move more slowly at every level of task performance. The effect of age and its relationship to practice of the task was also different within each group. The groups did not differ, however, in their rate of learning, or in the extent to which they were able to generalize the learned movement. Children with DCD sacrificed more speed than the AMC group when aiming at a small target, but the effects of amplitude and directional changes were quite similar for each group. The implications of these findings are discussed.


Sign in / Sign up

Export Citation Format

Share Document