The complexity of Policy Iteration is exponential for discounted Markov Decision Processes
2008 ◽
Vol 339
(1)
◽
pp. 691-704
◽
Keyword(s):
1986 ◽
Vol 13
(4)
◽
pp. 411-420
◽
The policy iteration algorithm for average reward Markov decision processes with general state space
1997 ◽
Vol 42
(12)
◽
pp. 1663-1680
◽
2010 ◽
Vol 21
(8)
◽
pp. 1270-1280
◽
2019 ◽
Vol 105
◽
pp. 287-304
◽
Keyword(s):
2003 ◽
Vol 28
(1)
◽
pp. 194-200
◽
Keyword(s):
2016 ◽
Vol 133
(10)
◽
pp. 28-33
◽