Risk-sensitive semi-Markov decision processes with general utilities and multiple criteria

Yonghui Huang; Zhaotong Lian; Xianping Guo

doi:10.1017/apr.2018.36

Risk-sensitive semi-Markov decision processes with general utilities and multiple criteria

Advances in Applied Probability ◽

10.1017/apr.2018.36 ◽

2018 ◽

Vol 50 (3) ◽

pp. 783-804

Author(s):

Yonghui Huang ◽

Zhaotong Lian ◽

Xianping Guo

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Finite Horizon ◽

Performance Criteria ◽

Occupation Measure ◽

Constrained Problems ◽

Constrained Problem ◽

Risk Sensitive ◽

Special Cases ◽

Markov Decision

Abstract In this paper we investigate risk-sensitive semi-Markov decision processes with a Borel state space, unbounded cost rates, and general utility functions. The performance criteria are several expected utilities of the total cost in a finite horizon. Our analysis is based on a type of finite-horizon occupation measure. We express the distribution of the finite-horizon cost in terms of the occupation measure for each policy, wherein the discount is not needed. For unconstrained and constrained problems, we establish the existence and computation of optimal policies. In particular, we develop a linear program and its dual program for the constrained problem and, moreover, establish the strong duality between the two programs. Finally, we provide two special cases of our results, one of which concerns the discrete-time model, and the other the chance-constrained problem.

Download Full-text

Continuous-time Markov decision processes with risk-sensitive finite-horizon cost criterion

Mathematical Methods of Operations Research ◽

10.1007/s00186-016-0550-4 ◽

2016 ◽

Vol 84 (3) ◽

pp. 461-487 ◽

Cited By ~ 11

Author(s):

Qingda Wei

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Finite Horizon ◽

Cost Criterion ◽

Risk Sensitive ◽

Markov Decision

Download Full-text

Risk-sensitive finite-horizon piecewise deterministic Markov decision processes

Operations Research Letters ◽

10.1016/j.orl.2019.05.001 ◽

2020 ◽

Vol 48 (1) ◽

pp. 96-103 ◽

Cited By ~ 1

Author(s):

Yonghui Huang ◽

Zhaotong Lian ◽

Xianping Guo

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Finite Horizon ◽

Risk Sensitive ◽

Markov Decision

Download Full-text

Finite horizon risk-sensitive continuous-time Markov decision processes with unbounded transition and cost rates

4OR ◽

10.1007/s10288-019-0398-6 ◽

2019 ◽

Vol 17 (4) ◽

pp. 427-442 ◽

Cited By ~ 3

Author(s):

Xin Guo ◽

Qiuli Liu ◽

Yi Zhang

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Finite Horizon ◽

Risk Sensitive ◽

Markov Decision

Download Full-text

Risk-sensitive average continuous-time Markov decision processes with unbounded transition and cost rates

Journal of Applied Probability ◽

10.1017/jpr.2020.105 ◽

2021 ◽

Vol 58 (2) ◽

pp. 523-550

Author(s):

Xin Guo ◽

Yonghui Huang

Keyword(s):

Markov Decision Processes ◽

Continuous Time ◽

Decision Processes ◽

Finite Horizon ◽

Dynamic Programming Principle ◽

Iteration Algorithm ◽

Stationary Policy ◽

Risk Sensitive ◽

Finite State ◽

Markov Decision

AbstractThis paper considers risk-sensitive average optimization for denumerable continuous-time Markov decision processes (CTMDPs), in which the transition and cost rates are allowed to be unbounded, and the policies can be randomized history dependent. We first derive the multiplicative dynamic programming principle and some new facts for risk-sensitive finite-horizon CTMDPs. Then, we establish the existence and uniqueness of a solution to the risk-sensitive average optimality equation (RS-AOE) through the results for risk-sensitive finite-horizon CTMDPs developed here, and also prove the existence of an optimal stationary policy via the RS-AOE. Furthermore, for the case of finite actions available at each state, we construct a sequence of models of finite-state CTMDPs with optimal stationary policies which can be obtained by a policy iteration algorithm in a finite number of iterations, and prove that an average optimal policy for the case of infinitely countable states can be approximated by those of the finite-state models. Finally, we illustrate the conditions and the iteration algorithm with an example.

Download Full-text

Finite-horizon piecewise deterministic Markov decision processes with unbounded transition rates

Stochastics ◽

10.1080/17442508.2018.1518450 ◽

2018 ◽

Vol 91 (1) ◽

pp. 67-95 ◽

Cited By ~ 2

Author(s):

Yonghui Huang ◽

Xianping Guo

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Finite Horizon ◽

Transition Rates ◽

Unbounded Transition Rates ◽

Markov Decision

Download Full-text

Markov decision processes with restricted observations: Finite horizon case

Naval Research Logistics (NRL) ◽

10.1002/(sici)1520-6750(199708)44:5<439::aid-nav3>3.0.co;2-5 ◽

1997 ◽

Vol 44 (5) ◽

pp. 439-456 ◽

Cited By ~ 4

Author(s):

Yasemin Serin ◽

Zeynep Muge Avsar

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Finite Horizon ◽

Markov Decision ◽

Horizon Case

Download Full-text

An Approximate Stochastic Annealing algorithm for finite horizon Markov decision processes

49th IEEE Conference on Decision and Control (CDC) ◽

10.1109/cdc.2010.5717689 ◽

2010 ◽

Cited By ~ 5

Author(s):

Jiaqiao Hu ◽

Hyeong Soo Chang

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Finite Horizon ◽

Markov Decision ◽

Annealing Algorithm

Download Full-text

Risk‐Sensitive Markov Decision Processes with Combined Metrics of Mean and Variance

Production and Operations Management ◽

10.1111/poms.13252 ◽

2020 ◽

Author(s):

Li Xia

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Risk Sensitive ◽

Markov Decision ◽

Mean And Variance

Download Full-text

A Corrected And Improved Computational Scheme For Finite Horizon Partially Observable Markov Decision Processes

INFOR Information Systems and Operational Research ◽

10.1080/03155986.1991.11732169 ◽

1991 ◽

Vol 29 (3) ◽

pp. 206-212

Author(s):

Sraban Mukherjee ◽

Kiran Seth

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Finite Horizon ◽

Computational Scheme ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

Time-Inconsistent Risk-Sensitive Equilibrium for Countable-Stated Markov Decision Processes

Applied Mathematics & Optimization ◽

10.1007/s00245-020-09690-3 ◽

2020 ◽

Author(s):

Hongwei Mei

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Risk Sensitive ◽

Markov Decision ◽

Time Inconsistent

Download Full-text