Multi-agent temporal-difference learning with linear function approximation: Weak convergence under time-varying network topologies
2008 ◽
Vol 7
(4)
◽
pp. 432-442
◽