A new policy iteration scheme for Markov decision processes using Schweitzer's formula
Keyword(s):
One Step
◽
Given a family of Markov chains with a single recurrent class, we present a potential application of Schweitzer's exact formula relating the steady-state probability and fundamental matrices of any two chains in the family. We propose a new policy iteration scheme for Markov decision processes where in contrast to policy iteration, the new criterion for selecting an action ensures the maximal one-step average cost improvement. Its computational complexity and storage requirement are analysed.
1994 ◽
Vol 31
(01)
◽
pp. 268-273
◽
2008 ◽
Vol 339
(1)
◽
pp. 691-704
◽
Keyword(s):
1986 ◽
Vol 13
(4)
◽
pp. 411-420
◽