Policy gradient stochastic approximation algorithms for adaptive control of constrained time varying markov decision processes

Author(s):  
F.J. Vazquez Abad ◽  
V. Krishnamurthy
2015 ◽  
Vol 100 (2-3) ◽  
pp. 255-283 ◽  
Author(s):  
Matteo Pirotta ◽  
Marcello Restelli ◽  
Luca Bascetta

1983 ◽  
Vol 20 (02) ◽  
pp. 368-379
Author(s):  
Lam Yeh ◽  
L. C. Thomas

By considering continuous-time Markov decision processes where decisions can be made at any time, we show in the case of M/M/1 queues with discounted costs that there exists a monotone optimal policy among all the regular policies.


1983 ◽  
Vol 20 (2) ◽  
pp. 368-379 ◽  
Author(s):  
Lam Yeh ◽  
L. C. Thomas

By considering continuous-time Markov decision processes where decisions can be made at any time, we show in the case of M/M/1 queues with discounted costs that there exists a monotone optimal policy among all the regular policies.


Sign in / Sign up

Export Citation Format

Share Document