Logarithmic Regret for Episodic Continuous-Time Linear-Quadratic Reinforcement Learning Over a Finite-Time Horizon
Keyword(s):
Keyword(s):
Keyword(s):
1995 ◽
Vol 19
(8)
◽
pp. 1449-1469
◽
Keyword(s):
2016 ◽
Vol 48
(1)
◽
pp. 129-137
◽