Design of a Reinforcement Learning-Based Lane Keeping Planning Agent for Automated Vehicles

Bálint Kővári; Ferenc Hegedüs; Tamás Bécsi

doi:10.3390/app10207171

Design of a Reinforcement Learning-Based Lane Keeping Planning Agent for Automated Vehicles

Applied Sciences ◽

10.3390/app10207171 ◽

2020 ◽

Vol 10 (20) ◽

pp. 7171

Author(s):

Bálint Kővári ◽

Ferenc Hegedüs ◽

Tamás Bécsi

Keyword(s):

Reinforcement Learning ◽

Real Time ◽

Autonomous Vehicles ◽

High Performance ◽

Search Algorithm ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Learning Agents ◽

Combined Solution ◽

Lane Keeping

Reinforcement learning-based approaches are widely studied in the literature for solving different control tasks for Connected and Autonomous Vehicles, from which this paper deals with the problem of lateral control of a dynamic nonlinear vehicle model, performing the task of lane-keeping. In this area, the appropriate formulation of the goals and environment information is crucial, for which the research outlines the importance of lookahead information, enabling to accomplish maneuvers with complex trajectories. Another critical part is the real-time manner of the problem. On the one hand, optimization or search based methods, such as the presented Monte Carlo Tree Search method, can solve the problem with the trade-off of high numerical complexity. On the other hand, single Reinforcement Learning agents struggle to learn these tasks with high performance, though they have the advantage that after the training process, they can operate in a real-time manner. Two planning agent structures are proposed in the paper to resolve this duality, where the machine learning agents aid the tree search algorithm. As a result, the combined solution provides high performance and low computational needs.

Download Full-text

Enhanced Reinforcement Learning Method Combining One-Hot Encoding-Based Vectors for CNN-Based Alternative High-Level Decisions

Applied Sciences ◽

10.3390/app11031291 ◽

2021 ◽

Vol 11 (3) ◽

pp. 1291

Author(s):

Bonwoo Gu ◽

Yunsick Sung

Keyword(s):

Reinforcement Learning ◽

Search Algorithm ◽

Classification Criteria ◽

Tree Search ◽

Learning Method ◽

Board Game ◽

Ancient China ◽

Monte Carlo Tree Search ◽

High Level ◽

Tree Search Algorithm

Gomoku is a two-player board game that originated in ancient China. There are various cases of developing Gomoku using artificial intelligence, such as a genetic algorithm and a tree search algorithm. Alpha-Gomoku, Gomoku AI built with Alpha-Go’s algorithm, defines all possible situations in the Gomoku board using Monte-Carlo tree search (MCTS), and minimizes the probability of learning other correct answers in the duplicated Gomoku board situation. However, in the tree search algorithm, the accuracy drops, because the classification criteria are manually set. In this paper, we propose an improved reinforcement learning-based high-level decision approach using convolutional neural networks (CNN). The proposed algorithm expresses each state as One-Hot Encoding based vectors and determines the state of the Gomoku board by combining the similar state of One-Hot Encoding based vectors. Thus, in a case where a stone that is determined by CNN has already been placed or cannot be placed, we suggest a method for selecting an alternative. We verify the proposed method of Gomoku AI in GuPyEngine, a Python-based 3D simulation platform.

Download Full-text

A reinforcement learning application of a guided Monte Carlo Tree Search algorithm for beam orientation selection in radiation therapy

Machine Learning: Science and Technology ◽

10.1088/2632-2153/abe528 ◽

2021 ◽

Author(s):

Azar Sadeghnejad Barkousaraie ◽

Gyanendra Bohara ◽

Steve B Jiang ◽

Dan Nguyen

Keyword(s):

Monte Carlo ◽

Radiation Therapy ◽

Reinforcement Learning ◽

Search Algorithm ◽

Tree Search ◽

Orientation Selection ◽

Monte Carlo Tree Search ◽

Beam Orientation ◽

Tree Search Algorithm

Download Full-text

Deep learning inspired routing in ICN using Monte Carlo Tree Search algorithm

Journal of Parallel and Distributed Computing ◽

10.1016/j.jpdc.2020.12.014 ◽

2021 ◽

Author(s):

Nitul Dutta ◽

Shobhit K. Patel ◽

Vadim Samusenkov ◽

Vigneswaran D.

Keyword(s):

Monte Carlo ◽

Deep Learning ◽

Search Algorithm ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

An Efficiency Enhancing Methodology for Multiple Autonomous Vehicles in an Urban Network Adopting Deep Reinforcement Learning

Applied Sciences ◽

10.3390/app11041514 ◽

2021 ◽

Vol 11 (4) ◽

pp. 1514 ◽

Cited By ~ 2

Author(s):

Quang-Duy Tran ◽

Sang-Hoon Bae

Keyword(s):

Reinforcement Learning ◽

Traffic Congestion ◽

Autonomous Vehicles ◽

Penetration Rate ◽

Autonomous Vehicle ◽

Effective Means ◽

Urban Network ◽

Learning Agents ◽

Policy Optimization ◽

The Impact

To reduce the impact of congestion, it is necessary to improve our overall understanding of the influence of the autonomous vehicle. Recently, deep reinforcement learning has become an effective means of solving complex control tasks. Accordingly, we show an advanced deep reinforcement learning that investigates how the leading autonomous vehicles affect the urban network under a mixed-traffic environment. We also suggest a set of hyperparameters for achieving better performance. Firstly, we feed a set of hyperparameters into our deep reinforcement learning agents. Secondly, we investigate the leading autonomous vehicle experiment in the urban network with different autonomous vehicle penetration rates. Thirdly, the advantage of leading autonomous vehicles is evaluated using entire manual vehicle and leading manual vehicle experiments. Finally, the proximal policy optimization with a clipped objective is compared to the proximal policy optimization with an adaptive Kullback–Leibler penalty to verify the superiority of the proposed hyperparameter. We demonstrate that full automation traffic increased the average speed 1.27 times greater compared with the entire manual vehicle experiment. Our proposed method becomes significantly more effective at a higher autonomous vehicle penetration rate. Furthermore, the leading autonomous vehicles could help to mitigate traffic congestion.

Download Full-text

Monte Carlo Tree Search for Bayesian Reinforcement Learning

2012 11th International Conference on Machine Learning and Applications ◽

10.1109/icmla.2012.30 ◽

2012 ◽

Cited By ~ 2

Author(s):

Ngo Anh Vien ◽

Wolfgang Ertel

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Bayesian Reinforcement Learning

Download Full-text

Real-Time Monte Carlo Tree Search in Ms Pac-Man

IEEE Transactions on Computational Intelligence and AI in Games ◽

10.1109/tciaig.2013.2291577 ◽

2014 ◽

Vol 6 (3) ◽

pp. 245-257 ◽

Cited By ~ 24

Author(s):

Tom Pepels ◽

Mark H. M. Winands ◽

Marc Lanctot

Keyword(s):

Monte Carlo ◽

Real Time ◽

Tree Search ◽

Monte Carlo Tree Search

Download Full-text

Development of rehabilitation system (RehabGame) through Monte-Carlo tree search algorithm using kinect and Myo sensor interface

2017 Computing Conference ◽

10.1109/sai.2017.8252217 ◽

2017 ◽

Cited By ~ 3

Author(s):

Shabnam Sadeghi Esfahlani ◽

George Wilson

Keyword(s):

Monte Carlo ◽

Search Algorithm ◽

Tree Search ◽

Sensor Interface ◽

Monte Carlo Tree Search ◽

Rehabilitation System ◽

Tree Search Algorithm

Download Full-text

Adjustment of Difficulty Level on Wobble Board-Based Game Using Monte Carlo Tree Search Algorithm

2018 5th International Conference on Data and Software Engineering (ICoDSE) ◽

10.1109/icodse.2018.8705843 ◽

2018 ◽

Author(s):

Adi Purnama ◽

Saiful Akbar ◽

Dody Dharma

Keyword(s):

Monte Carlo ◽

Search Algorithm ◽

Difficulty Level ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Tree Search Algorithm

Download Full-text

Towards efficient discovery of green synthetic pathways with Monte Carlo tree search and reinforcement learning

Chemical Science ◽

10.1039/d0sc04184j ◽

2020 ◽

Vol 11 (40) ◽

pp. 10959-10972

Author(s):

Xiaoxue Wang ◽

Yujie Qian ◽

Hanyu Gao ◽

Connor W. Coley ◽

Yiming Mo ◽

...

Keyword(s):

Monte Carlo ◽

Reinforcement Learning ◽

Prediction Model ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Value Network ◽

Synthesis Routes

A new MCTS variant with a reinforcement learning value network and solvent prediction model proposes shorter synthesis routes with greener solvents.

Download Full-text

A modified Monte-Carlo Tree Search Algorithm for Two-sided Assembly Line Balancing Problem

IFAC-PapersOnLine ◽

10.1016/j.ifacol.2019.11.483 ◽

2019 ◽

Vol 52 (13) ◽

pp. 1920-1924

Author(s):

Chuanxun Wu ◽

Xiaofeng Hu ◽

Yahui Zhang ◽

Pengfei Wang

Keyword(s):

Monte Carlo ◽

Assembly Line ◽

Search Algorithm ◽

Assembly Line Balancing ◽

Line Balancing ◽

Tree Search ◽

Monte Carlo Tree Search ◽

Assembly Line Balancing Problem ◽

Tree Search Algorithm

Download Full-text