Logarithmic Regret for Episodic Continuous-Time Linear-Quadratic Reinforcement Learning Over a Finite-Time Horizon

SSRN Electronic Journal ◽

10.2139/ssrn.3848428 ◽

2021 ◽

Author(s):

Matteo Basei ◽

Xin Guo ◽

Anran Hu ◽

Yufei Zhang

Keyword(s):

Reinforcement Learning ◽

Continuous Time ◽

Finite Time ◽

Time Horizon ◽

Linear Quadratic ◽

Finite Time Horizon ◽

Time Linear

Download Full-text

Multi-criteria dynamic optimization of mean-field stochastic linear-quadratic cooperative difference games in the finite time horizon

2020 IEEE 16th International Conference on Control & Automation (ICCA) ◽

10.1109/icca51439.2020.9264371 ◽

2020 ◽

Author(s):

Chenchen Peng ◽

Weihai Zhang

Keyword(s):

Dynamic Optimization ◽

Finite Time ◽

Time Horizon ◽

Mean Field ◽

Linear Quadratic ◽

Finite Time Horizon

Download Full-text

Discrete-Time, Linear Periodic Time-Varying System Norm Estimation Using Finite Time Horizon Transfer Operators

Automatika ◽

10.1080/00051144.2010.11828388 ◽

2010 ◽

Vol 51 (4) ◽

pp. 325-332 ◽

Cited By ~ 1

Author(s):

Przemysław Orłowski

Keyword(s):

Discrete Time ◽

Finite Time ◽

Time Horizon ◽

Time Varying ◽

Transfer Operators ◽

Finite Time Horizon ◽

Time Linear

Download Full-text

Output Feedback Reinforcement Learning Control for the Continuous-Time Linear Quadratic Regulator Problem

2018 Annual American Control Conference (ACC) ◽

10.23919/acc.2018.8431290 ◽

2018 ◽

Cited By ~ 1

Author(s):

Syed Ali Asad Rizvi ◽

Zongli Lin

Keyword(s):

Reinforcement Learning ◽

Output Feedback ◽

Continuous Time ◽

Linear Quadratic Regulator ◽

Learning Control ◽

Linear Quadratic ◽

Regulator Problem ◽

Time Linear

Download Full-text

Continuous time vs. backward induction a new approach to modelling reputation in the finite time horizon context

Journal of Economic Dynamics and Control ◽

10.1016/0165-1889(94)00837-8 ◽

1995 ◽

Vol 19 (8) ◽

pp. 1449-1469 ◽

Cited By ~ 1

Author(s):

John R. Conlon

Keyword(s):

Continuous Time ◽

Finite Time ◽

Time Horizon ◽

Backward Induction ◽

New Approach ◽

Finite Time Horizon

Download Full-text

Optimal preview control for a linear continuous-time stochastic control system in finite-time horizon

International Journal of Systems Science ◽

10.1080/00207721.2016.1160456 ◽

2016 ◽

Vol 48 (1) ◽

pp. 129-137 ◽

Cited By ~ 33

Author(s):

Jiang Wu ◽

Fucheng Liao ◽

Masayoshi Tomizuka

Keyword(s):

Control System ◽

Stochastic Control ◽

Continuous Time ◽

Finite Time ◽

Time Horizon ◽

Preview Control ◽

Stochastic Control System ◽

Finite Time Horizon ◽

Continuous Time Stochastic Control

Download Full-text

Numerical Versus Analytic Calculation of Optima and Equilibria in Fish Wars Model with Finite Time Horizon

SSRN Electronic Journal ◽

10.2139/ssrn.2778639 ◽

2016 ◽

Author(s):

Agnieszka Wiszniewska-Matyszkiel ◽

Rajani Singh

Keyword(s):

Finite Time ◽

Time Horizon ◽

Analytic Calculation ◽

Finite Time Horizon

Download Full-text

Robust Policy Iteration for Continuous-Time Linear Quadratic Regulation

IEEE Transactions on Automatic Control ◽

10.1109/tac.2021.3085510 ◽

2021 ◽

pp. 1-1

Author(s):

Bo Pang ◽

Tao Bian ◽

Zhong-Ping Jiang

Keyword(s):

Continuous Time ◽

Policy Iteration ◽

Linear Quadratic ◽

Linear Quadratic Regulation ◽

Time Linear

Download Full-text

A Constrained Markovian Diffusion Model for Controlling the Pollution Accumulation

Mathematics ◽

10.3390/math9131466 ◽

2021 ◽

Vol 9 (13) ◽

pp. 1466

Author(s):

Beatris Adriana Escobedo-Trujillo ◽

José Daniel López-Barrientos ◽

Javier Garrido-Meléndez

Keyword(s):

Dynamic Programming ◽

Dirichlet Problem ◽

Stochastic Control ◽

Finite Time ◽

Time Horizon ◽

Closed Loop ◽

Programming Techniques ◽

Pollution Accumulation ◽

Finite Time Horizon ◽

The Cost

This work presents a study of a finite-time horizon stochastic control problem with restrictions on both the reward and the cost functions. To this end, it uses standard dynamic programming techniques, and an extension of the classic Lagrange multipliers approach. The coefficients considered here are supposed to be unbounded, and the obtained strategies are of non-stationary closed-loop type. The driving thread of the paper is a sequence of examples on a pollution accumulation model, which is used for the purpose of showing three algorithms for the purpose of replicating the results. There, the reader can find a result on the interchangeability of limits in a Dirichlet problem.

Download Full-text

Finite-dimensional approximation of state-delay systems: Hankel operator approach in finite time Horizon

IFAC Proceedings Volumes ◽

10.1016/s1474-6670(17)36959-8 ◽

2000 ◽

Vol 33 (23) ◽

pp. 291-296

Author(s):

Tomonori Izumi ◽

Akira Kojima ◽

Shintaro Ishijima

Keyword(s):

Finite Time ◽

Time Horizon ◽

Hankel Operator ◽

Delay Systems ◽

Operator Approach ◽

State Delay ◽

Finite Dimensional Approximation ◽

Finite Dimensional ◽

Dimensional Approximation ◽

Finite Time Horizon

Download Full-text

A Finite-Time-Horizon Model of Suicide When a Person's Income is at Risk: A Research Note

Australian Economic Papers ◽

10.1111/1467-8454.12038 ◽

2015 ◽

Vol 54 (1) ◽

pp. 43-51

Author(s):

Tomoya Suzuki

Keyword(s):

At Risk ◽

Finite Time ◽

Time Horizon ◽

Research Note ◽

Finite Time Horizon

Download Full-text