On Monte Carlo Tree Search and Reinforcement Learning

Journal of Artificial Intelligence Research ◽

10.1613/jair.5507 ◽

2017 ◽

Vol 60 ◽

pp. 881-936 ◽

Cited By ~ 8

Author(s):

Tom Vodopivec ◽

Spyridon Samothrakis ◽

Branko Ster

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Video Game ◽

Close Relation ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Computer Go ◽

Board Games ◽

Planning Methods ◽

Unified View

Fuelled by successes in Computer Go, Monte Carlo tree search (MCTS) has achieved widespread adoption within the games community. Its links to traditional reinforcement learning (RL) methods have been outlined in the past; however, the use of RL techniques within tree search has not been thoroughly studied yet. In this paper we re-examine in depth this close relation between the two fields; our goal is to improve the cross-awareness between the two communities. We show that a straightforward adaptation of RL semantics within tree search can lead to a wealth of new algorithms, for which the traditional MCTS is only one of the variants. We confirm that planning methods inspired by RL in conjunction with online search demonstrate encouraging results on several classic board games and in arcade video game competitions, where our algorithm recently ranked first. Our study promotes a unified view of learning, planning, and search.

Download Full-text

Monte Carlo Tree Search for Bayesian Reinforcement Learning

2012 11th International Conference on Machine Learning and Applications ◽

10.1109/icmla.2012.30 ◽

2012 ◽

Cited By ~ 2

Author(s):

Ngo Anh Vien ◽

Wolfgang Ertel

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Bayesian Reinforcement Learning

Download Full-text

Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning

Chemical Science ◽

10.1039/d0sc04184j ◽

2020 ◽

Vol 11 (40) ◽

pp. 10959-10972

Author(s):

Xiaoxue Wang ◽

Yujie Qian ◽

Hanyu Gao ◽

Connor W. Coley ◽

Yiming Mo ◽

...

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Prediction Model ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Value Network ◽

Synthesis Routes

A new MCTS variant with a reinforcement learning value network and solvent prediction model proposes shorter synthesis routes with greener solvents.

Download Full-text

Combining pathfmding algorithm with Knowledge-based Monte-Carlo tree search in general video game playing

2015 IEEE Conference on Computational Intelligence and Games (CIG) ◽

10.1109/cig.2015.7317898 ◽

2015 ◽

Cited By ~ 3

Author(s):

Chun Yin Chu ◽

Hisaaki Hashizume ◽

Zikun Guo ◽

Tomohiro Harada ◽

Ruck Thawonmas

Keyword(s):

Monte Carlo ◽

Video Game ◽

Tree Search ◽

Game Playing ◽

Monte Carlo Tree Search ◽

Video Game Playing ◽

Knowledge Based

Download Full-text

Monte Carlo Tree Search with options for general video game playing

2016 IEEE Conference on Computational Intelligence and Games (CIG) ◽

10.1109/cig.2016.7860383 ◽

2016 ◽

Cited By ~ 6

Author(s):

Maarten de Waard ◽

Diederik M. Roijers ◽

Sander C.J. Bakkes

Keyword(s):

Monte Carlo ◽

Video Game ◽

Tree Search ◽

Game Playing ◽

Monte Carlo Tree Search ◽

Video Game Playing

Download Full-text

Distributed Monte Carlo Tree Search: A Novel Technique and its Application to Computer Go

IEEE Transactions on Computational Intelligence and AI in Games ◽

10.1109/tciaig.2014.2346997 ◽

2015 ◽

Vol 7 (4) ◽

pp. 361-374 ◽

Cited By ~ 2

Author(s):

Lars Schaefers ◽

Marco Platzner

Keyword(s):

Monte Carlo ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Computer Go ◽

Novel Technique

Download Full-text

ME-MCTS: Online Generalization by Combining Multiple Value Estimators

Proceedings of the Thirtieth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2021/555 ◽

2021 ◽

Author(s):

Hendrik Baier ◽

Michael Kaisers

Keyword(s):

Monte Carlo ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Board Games ◽

Evaluation Functions ◽

Recent Advances ◽

Action Value ◽

Multiple Value ◽

Static Evaluation

This paper addresses the challenge of online generalization in tree search. We propose Multiple Estimator Monte Carlo Tree Search (ME-MCTS), with a two-fold contribution: first, we introduce a formalization of online generalization that can represent existing techniques such as "history heuristics", "RAVE", or "OMA" -- contextual action value estimators or abstractors that generalize across specific contexts. Second, we incorporate recent advances in estimator averaging that enable guiding search by combining the online action value estimates of any number of such abstractors or similar types of action value estimators. Unlike previous work, which usually proposed a single abstractor for either the selection or the rollout phase of MCTS simulations, our approach focuses on the combination of multiple estimators and applies them to all move choices in MCTS simulations. As the MCTS tree itself is just another value estimator -- unbiased, but without abstraction -- this blurs the traditional distinction between action choices inside and outside of the MCTS tree. Experiments with three abstractors in four board games show significant improvements of ME-MCTS over MCTS using only a single abstractor, both for MCTS with random rollouts as well as for MCTS with static evaluation functions. While we used deterministic, fully observable games, ME-MCTS naturally extends to more challenging settings.

Download Full-text

Enhancing the Monte Carlo Tree Search Algorithm for Video Game Testing

2020 IEEE Conference on Games (CoG) ◽

10.1109/cog47356.2020.9231670 ◽

2020 ◽

Author(s):

Sinan Ariyurek ◽

Aysu Betin-Can ◽

Elif Surer

Keyword(s):

Monte Carlo ◽

Video Game ◽

Search Algorithm ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

Scalable and Efficient Bayes-Adaptive Reinforcement Learning Based on Monte-Carlo Tree Search

Journal of Artificial Intelligence Research ◽

10.1613/jair.4117 ◽

2013 ◽

Vol 48 ◽

pp. 841-883 ◽

Cited By ~ 11

Author(s):

A. Guez ◽

D. Silver ◽

P. Dayan

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Search Space ◽

Search Tree ◽

Benchmark Problems ◽

Tree Search ◽

Monte Carlo Tree Search ◽

The Face ◽

Almost All ◽

Infinite State

Bayesian planning is a formally elegant approach to learning optimal behaviour under model uncertainty, trading off exploration and exploitation in an ideal way. Unfortunately, planning optimally in the face of uncertainty is notoriously taxing, since the search space is enormous. In this paper we introduce a tractable, sample-based method for approximate Bayes-optimal planning which exploits Monte-Carlo tree search. Our approach avoids expensive applications of Bayes rule within the search tree by sampling models from current beliefs, and furthermore performs this sampling in a lazy manner. This enables it to outperform previous Bayesian model-based reinforcement learning algorithms by a significant margin on several well-known benchmark problems. As we show, our approach can even work in problems with an infinite state space that lie qualitatively out of reach of almost all previous work in Bayesian exploration.

Download Full-text

Monte-Carlo Tree Search and Computer Go

Advances in Information and Intelligent Systems - Studies in Computational Intelligence ◽

10.1007/978-3-642-04141-9_10 ◽

2009 ◽

pp. 201-225 ◽

Cited By ~ 6

Author(s):

Keh-Hsun Chen ◽

Dawei Du ◽

Peigang Zhang

Keyword(s):

Monte Carlo ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Computer Go

Download Full-text

A Study of the Artificial Intelligence Factorization Game Agent Using Reinforcement Learning and Monte Carlo Tree Search

Journal of Korean institute of intelligent systems ◽

10.5391/jkiis.2020.30.5.406 ◽

2020 ◽

Vol 30 (5) ◽

pp. 406-415

Author(s):

Byung-Sun Won ◽

Jon-Lark Kim ◽

Sukwon Han

Keyword(s):

Artificial Intelligence ◽

Monte Carlo ◽

Reinforcement Learning ◽

Tree Search ◽

Monte Carlo Tree Search

Download Full-text