Identification and Adaptive Control of Markov Chains

SIAM Journal on Control and Optimization ◽

10.1137/0320035 ◽

1982 ◽

Vol 20 (4) ◽

pp. 470-489 ◽

Cited By ~ 32

Author(s):

Vivek Borkar ◽

Pravin Varaiya

Keyword(s):

Adaptive Control ◽

Markov Chains

Download Full-text

On the arcsine law in the adaptive control of Markov chains

Probability Theory and Applications ◽

10.1515/9783112314227-044 ◽

1987 ◽

pp. 331-336

Keyword(s):

Adaptive Control ◽

Markov Chains

Download Full-text

Adaptive control of Markov chains with local updates

Systems & Control Letters ◽

10.1016/0167-6911(90)90015-m ◽

1990 ◽

Vol 14 (3) ◽

pp. 209-218 ◽

Cited By ~ 5

Author(s):

Ahmad Jalali ◽

Michael Ferguson

Keyword(s):

Adaptive Control ◽

Markov Chains

Download Full-text

Computationally efficient adaptive control algorithms for Markov chains

10.1109/cdc.1989.70344 ◽

2003 ◽

Cited By ~ 7

Author(s):

A. Jalali ◽

M. Ferguson

Keyword(s):

Adaptive Control ◽

Markov Chains ◽

Control Algorithms ◽

Computationally Efficient

Download Full-text

An optimization-oriented approach to the adaptive control of Markov chains

IEEE Transactions on Automatic Control ◽

10.1109/tac.1987.1104709 ◽

1987 ◽

Vol 32 (9) ◽

pp. 754-762 ◽

Cited By ~ 10

Author(s):

R. Milito ◽

J. Cruz

Keyword(s):

Adaptive Control ◽

Markov Chains ◽

Oriented Approach

Download Full-text

On the Milito-Cruz adaptive control scheme for Markov chains

Journal of Optimization Theory and Applications ◽

10.1007/bf00940719 ◽

1993 ◽

Vol 77 (2) ◽

pp. 387-397 ◽

Cited By ~ 1

Author(s):

V. S. Borkar

Keyword(s):

Adaptive Control ◽

Markov Chains ◽

Control Scheme ◽

Adaptive Control Scheme

Download Full-text

Adaptive control of Markov chains, I: Finite parameter set

IEEE Transactions on Automatic Control ◽

10.1109/tac.1979.1102191 ◽

1979 ◽

Vol 24 (6) ◽

pp. 953-957 ◽

Cited By ~ 80

Author(s):

V. Borkar ◽

P. Varaiya

Keyword(s):

Adaptive Control ◽

Markov Chains

Download Full-text

ADAPTIVE CONTROL OF MARKOV CHAINS: A SURVEY

Theory and Application of Digital Control ◽

10.1016/b978-0-08-027618-2.50020-6 ◽

1982 ◽

pp. 89-93 ◽

Cited By ~ 1

Author(s):

P. Varaiya

Keyword(s):

Adaptive Control ◽

Markov Chains

Download Full-text

Adaptive Control of Finite Markov Chains Employing Stochastic Approximation Method

IFAC Proceedings Volumes ◽

10.1016/s1474-6670(17)59825-0 ◽

1986 ◽

Vol 19 (5) ◽

pp. 365-369

Author(s):

A.V. Nazin ◽

A.S. Poznyak

Keyword(s):

Adaptive Control ◽

Markov Chains ◽

Stochastic Approximation ◽

Approximation Method ◽

Finite Markov Chains

Download Full-text

A methodology for the adaptive control of Markov chains under partial state information

[1992] Proceedings of the 31st IEEE Conference on Decision and Control ◽

10.1109/cdc.1992.371318 ◽

2005 ◽

Cited By ~ 3

Author(s):

E. Fernandez-Gaucherand ◽

A. Arapostathis ◽

S.I. Marcus

Keyword(s):

Adaptive Control ◽

Markov Chains ◽

State Information ◽

Partial State

Download Full-text

Minimizing the learning loss in adaptive control of Markov chains under the weak accessibility condition

Journal of Applied Probability ◽

10.1017/s0021900200042698 ◽

1991 ◽

Vol 28 (04) ◽

pp. 779-790 ◽

Cited By ~ 1

Author(s):

Rajeev Agrawal

Keyword(s):

Adaptive Control ◽

Maximum Likelihood ◽

Markov Chains ◽

Upper Bound ◽

Unknown Parameter ◽

Learning Loss ◽

Probability Of Error ◽

Control Scheme ◽

Rate Of Decay ◽

Randomized Control

We consider the adaptive control of Markov chains under the weak accessibility condition with a view to minimizing the learning loss. A certainty equivalence control with a forcing scheme is constructed. We use a stationary randomized control scheme for forcing and compute a maximum likelihood estimate of the unknown parameter from the resulting observations. We obtain an exponential upper bound on the rate of decay of the probability of error. This allows us to choose the rate of forcing appropriately, whereby we achieve a o(f(n) log n) learning loss for any function as .

Download Full-text