average reward optimality
Recently Published Documents

TOTAL DOCUMENTS

(FIVE YEARS 0)

H-INDEX

(FIVE YEARS 0)

Latest Documents Most Cited Documents Contributed Authors Related Sources Related Keywords

Policy Iteration for Continuous-Time Average Reward Markov Decision Processes in Polish Spaces

Abstract and Applied Analysis ◽

10.1155/2009/103723 ◽

2009 ◽

Vol 2009 ◽

pp. 1-17 ◽

Cited By ~ 2

Author(s):

Quanxin Zhu ◽

Xinsong Yang ◽

Chuangxia Huang

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Policy Iteration ◽

Decision Processes ◽

Iteration Algorithm ◽

Average Reward ◽

Stationary Policy ◽

Optimality Equation ◽

Markov Decision ◽

Average Reward Optimality

We study thepolicy iteration algorithm(PIA) for continuous-time jump Markov decision processes in general state and action spaces. The corresponding transition rates are allowed to beunbounded, and the reward rates may haveneither upper nor lower bounds. The criterion that we are concerned with isexpected average reward. We propose a set of conditions under which we first establish the average reward optimality equation and present the PIA. Then under twoslightlydifferent sets of conditions we show that the PIA yields the optimal (maximum) reward, an average optimal stationary policy, and a solution to the average reward optimality equation.

Download Full-text

average reward optimalityRecently Published Documents

TOTAL DOCUMENTS

H-INDEX

Policy Iteration for Continuous-Time Average Reward Markov Decision Processes in Polish Spaces

average reward optimality
Recently Published Documents