Blackwell optimality in the class of all policies in Markov decision chains with a Borel state space and unbounded rewards

1999 ◽  
Vol 50 (3) ◽  
pp. 421-448 ◽  
Author(s):  
Arie Hordijk ◽  
Alexander A. Yushkevich
1991 ◽  
Vol 28 (1) ◽  
pp. 29-46 ◽  
Author(s):  
Onésimo Hernández-Lerma ◽  
RaÚl Montes-De-Oca ◽  
Rolando Cavazos-Cadena

1991 ◽  
Vol 5 (4) ◽  
pp. 463-475 ◽  
Author(s):  
Linn I. Sennott

A Markov decision chain with countable state space incurs two types of costs: an operating cost and a holding cost. The objective is to minimize the expected discounted operating cost, subject to a constraint on the expected discounted holding cost. The existence of an optimal randomized simple policy is proved. This is a policy that randomizes between two stationary policies, that differ in at most one state. Several examples from the control of discrete time queueing systems are discussed.


Sign in / Sign up

Export Citation Format

Share Document