An Adaptive Policy Evaluation Network Based on Recursive Least Squares Temporal Difference With Gradient Correction
Keyword(s):
Kernel Recursive Least-Squares Temporal Difference Algorithms with Sparsification and Regularization
2016 ◽
Vol 2016
◽
pp. 1-11
◽
2016 ◽
Vol 27
(4)
◽
pp. 771-782
◽
2017 ◽
Vol 31
(S2)
◽
pp. 1013-1028
◽
Keyword(s):