Two Time-Scale Stochastic Approximation with Controlled Markov Noise and Off-Policy Temporal-Difference Learning
2018 ◽
Vol 43
(1)
◽
pp. 130-151
◽
Keyword(s):
2001 ◽
Vol 13
(10)
◽
pp. 2221-2237
◽
2008 ◽
Vol 7
(4)
◽
pp. 432-442
◽