optimality inequality Latest Research Papers

In this paper we study discrete-time Markov decision processes with Borel state and action spaces. The criterion is to minimize average expected costs, and the costs may have neither upper nor lower bounds. We first provide two average optimality inequalities of opposing directions and give conditions for the existence of solutions to them. Then, using the two inequalities, we ensure the existence of an average optimal (deterministic) stationary policy under additional continuity-compactness assumptions. Our conditions are slightly weaker than those in the previous literature. Also, some new sufficient conditions for the existence of an average optimal stationary policy are imposed on the primitive data of the model. Moreover, our approach is slightly different from the well-known ‘optimality inequality approach’ widely used in Markov decision processes. Finally, we illustrate our results in two examples.

Download Full-text

Average optimality for Markov decision processes in borel spaces: a new condition and approach

Journal of Applied Probability ◽

10.1239/jap/1152413725 ◽

2006 ◽

Vol 43 (2) ◽

pp. 318-334 ◽

Cited By ~ 21

Author(s):

Xianping Guo ◽

Quanxin Zhu

Keyword(s):

Markov Decision Processes ◽

Discrete Time ◽

Existence Of Solutions ◽

Sufficient Conditions ◽

Decision Processes ◽

Stationary Policy ◽

Markov Decision ◽

Optimal Stationary Policy ◽

Optimality Inequality ◽

Action Spaces

In this paper we study discrete-time Markov decision processes with Borel state and action spaces. The criterion is to minimize average expected costs, and the costs may have neither upper nor lower bounds. We first provide two average optimality inequalities of opposing directions and give conditions for the existence of solutions to them. Then, using the two inequalities, we ensure the existence of an average optimal (deterministic) stationary policy under additional continuity-compactness assumptions. Our conditions are slightly weaker than those in the previous literature. Also, some new sufficient conditions for the existence of an average optimal stationary policy are imposed on the primitive data of the model. Moreover, our approach is slightly different from the well-known ‘optimality inequality approach’ widely used in Markov decision processes. Finally, we illustrate our results in two examples.

Download Full-text

On weak conditions and optimality inequality solutions in risk-sensitive controlled Markov processes with average criterion

Proceedings of the 41st IEEE Conference on Decision and Control, 2002. ◽

10.1109/cdc.2002.1184709 ◽

2003 ◽

Author(s):

A. Brau-Rojas ◽

E. Fernandez-Gaucherand

Keyword(s):

Markov Processes ◽

Risk Sensitive ◽

Average Criterion ◽

Optimality Inequality ◽

Controlled Markov Processes

Download Full-text

The Average Cost Optimality Equation and Critical Number Policies

Probability in the Engineering and Informational Sciences ◽

10.1017/s0269964800002783 ◽

1993 ◽

Vol 7 (1) ◽

pp. 47-67 ◽

Cited By ~ 13

Author(s):

Linn I. Sennott

Keyword(s):

Average Cost ◽

Critical Number ◽

Stationary Policy ◽

Optimality Equation ◽

Markov Decision ◽

Optimal Stationary Policy ◽

Optimality Inequality ◽

Positive Recurrent ◽

Average Cost Optimality Equation ◽

Cost Optimality

We consider a Markov decision chain with countable state space, finite action sets, and nonnegative costs. Conditions for the average cost optimality inequality to be an equality are derived. This extends work of Cavazos-Cadena [8]. It is shown that an optimal stationary policy must satisfy the optimality equation at all positive recurrent states. Structural results on the chain induced by an optimal stationary policy are derived. The results are employed in two examples to prove that any optimal stationary policy must be of critical number form.

Download Full-text

optimality inequality
Recently Published Documents

TOTAL DOCUMENTS

H-INDEX

Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies

Average optimality inequality for continuous-time Markov decision processes in Polish spaces

Average optimality for Markov decision processes in borel spaces: a new condition and approach

Average optimality for Markov decision processes in borel spaces: a new condition and approach

On weak conditions and optimality inequality solutions in risk-sensitive controlled Markov processes with average criterion

The Average Cost Optimality Equation and Critical Number Policies

Export Citation Format

optimality inequalityRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Average Cost Optimality Inequality for Markov Decision Processes with Borel Spaces and Universally Measurable Policies

Average optimality inequality for continuous-time Markov decision processes in Polish spaces

Average optimality for Markov decision processes in borel spaces: a new condition and approach

Average optimality for Markov decision processes in borel spaces: a new condition and approach

On weak conditions and optimality inequality solutions in risk-sensitive controlled Markov processes with average criterion

The Average Cost Optimality Equation and Critical Number Policies

optimality inequality
Recently Published Documents