Online regret bounds for Markov decision processes with deterministic transitions
2010 ◽
Vol 411
(29-30)
◽
pp. 2684-2695
◽
2008 ◽
pp. 123-137
◽
2012 ◽
Vol 38
(5)
◽
pp. 673-687
◽
1992 ◽
Vol 43
(11)
◽
pp. 1095-1102
Keyword(s):