Policy gradient stochastic approximation algorithms for adaptive control of constrained time varying markov decision processes

42nd IEEE International Conference on Decision and Control (IEEE Cat. No.03CH37475) ◽

10.1109/cdc.2003.1273053 ◽

2004 ◽

Cited By ~ 10

Author(s):

F.J. Vazquez Abad ◽

V. Krishnamurthy

Keyword(s):

Adaptive Control ◽

Approximation Algorithms ◽

Markov Decision Processes ◽

Stochastic Approximation ◽

Decision Processes ◽

Time Varying ◽

Policy Gradient ◽

Markov Decision

Download Full-text

A unified approach to adaptive control of average reward Markov decision processes

OR Spectrum ◽

10.1007/bf01740510 ◽

1988 ◽

Vol 10 (3) ◽

pp. 161-166 ◽

Cited By ~ 5

Author(s):

G. Hübner

Keyword(s):

Adaptive Control ◽

Markov Decision Processes ◽

Decision Processes ◽

Average Reward ◽

Unified Approach ◽

Markov Decision

Download Full-text

Deterministic policy gradient algorithms for semi‐Markov decision processes

International Journal of Intelligent Systems ◽

10.1002/int.22709 ◽

2021 ◽

Author(s):

Ashkan Haji Hosseinloo ◽

Munther A. Dahleh

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Gradient Algorithms ◽

Policy Gradient ◽

Markov Decision

Download Full-text

Scalable grid‐based approximation algorithms for partially observable Markov decision processes

Concurrency and Computation Practice and Experience ◽

10.1002/cpe.6743 ◽

2021 ◽

Author(s):

Can Kavaklioglu ◽

Mucahit Cevik

Keyword(s):

Approximation Algorithms ◽

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision ◽

Partially Observable Markov ◽

Partially Observable ◽

Grid Based

Download Full-text

Stochastic Approximation for Risk-aware Markov Decision Processes

IEEE Transactions on Automatic Control ◽

10.1109/tac.2020.2989702 ◽

2020 ◽

pp. 1-1

Author(s):

Wenjie Huang ◽

William B. Haskell

Keyword(s):

Markov Decision Processes ◽

Stochastic Approximation ◽

Decision Processes ◽

Markov Decision

Download Full-text

Policy gradient in Lipschitz Markov Decision Processes

Machine Learning ◽

10.1007/s10994-015-5484-1 ◽

2015 ◽

Vol 100 (2-3) ◽

pp. 255-283 ◽

Cited By ~ 5

Author(s):

Matteo Pirotta ◽

Marcello Restelli ◽

Luca Bascetta

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Policy Gradient ◽

Markov Decision

Download Full-text

Time-varying Markov decision processes with state-action-dependent discount factors and unbounded costs

Kybernetika ◽

10.14736/kyb-2019-1-0166 ◽

2019 ◽

pp. 166-182

Author(s):

Beatris A. Escobedo-Trujillo ◽

Carmen G. Higuera-Chan

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Time Varying ◽

State Action ◽

Discount Factors ◽

Markov Decision

Download Full-text

Adaptive control of M/M/1 queues—continuous-time Markov decision process approach

Journal of Applied Probability ◽

10.1017/s0021900200023512 ◽

1983 ◽

Vol 20 (02) ◽

pp. 368-379

Author(s):

Lam Yeh ◽

L. C. Thomas

Keyword(s):

Adaptive Control ◽

Markov Decision Process ◽

Markov Decision Processes ◽

Optimal Policy ◽

Continuous Time ◽

Decision Process ◽

Process Approach ◽

Decision Processes ◽

Markov Decision ◽

Discounted Costs

By considering continuous-time Markov decision processes where decisions can be made at any time, we show in the case of M/M/1 queues with discounted costs that there exists a monotone optimal policy among all the regular policies.

Download Full-text

A Simultaneous Perturbation Stochastic Approximation-Based Actor–Critic Algorithm for Markov Decision Processes

IEEE Transactions on Automatic Control ◽

10.1109/tac.2004.825622 ◽

2004 ◽

Vol 49 (4) ◽

pp. 592-598 ◽

Cited By ~ 23

Author(s):

S. Bhatnagar ◽

S. Kumar

Keyword(s):

Markov Decision Processes ◽

Stochastic Approximation ◽

Decision Processes ◽

Simultaneous Perturbation Stochastic Approximation ◽

Markov Decision

Download Full-text

On the adaptive control of a class of partially observed Markov decision processes

2009 American Control Conference ◽

10.1109/acc.2009.5159826 ◽

2009 ◽

Author(s):

Shun-Pin Hsu ◽

Dong-Ming Chuang ◽

Ari Arapostathis

Keyword(s):

Adaptive Control ◽

Markov Decision Processes ◽

Decision Processes ◽

Partially Observed ◽

Markov Decision

Download Full-text

Adaptive control of M/M/1 queues—continuous-time Markov decision process approach

Journal of Applied Probability ◽

10.2307/3213809 ◽

1983 ◽

Vol 20 (2) ◽

pp. 368-379 ◽

Cited By ~ 6

Author(s):

Lam Yeh ◽

L. C. Thomas

Keyword(s):

Adaptive Control ◽

Markov Decision Process ◽

Markov Decision Processes ◽

Optimal Policy ◽

Continuous Time ◽

Decision Process ◽

Process Approach ◽

Decision Processes ◽

Markov Decision ◽

Discounted Costs

By considering continuous-time Markov decision processes where decisions can be made at any time, we show in the case of M/M/1 queues with discounted costs that there exists a monotone optimal policy among all the regular policies.

Download Full-text