A Two-Timescale Simulation-Based Gradient Algorithm for Weighted Cost Markov Decision Processes
Keyword(s):
2006 ◽
Vol 45
(5)
◽
pp. 1633-1656
◽
Keyword(s):
2007 ◽
Vol 7
(1)
◽
pp. 59-92
◽
2001 ◽
Vol 31
(6)
◽
pp. 609-622
◽
Keyword(s):
2019 ◽
Vol 36
(06)
◽
pp. 1940009