A Collision Avoidance Method Based on Deep Reinforcement Learning

Shumin Feng; Bijo Sebastian; Pinhas Ben-Tzvi

doi:10.3390/robotics10020073

A Collision Avoidance Method Based on Deep Reinforcement Learning

Robotics ◽

10.3390/robotics10020073 ◽

2021 ◽

Vol 10 (2) ◽

pp. 73

Author(s):

Shumin Feng ◽

Bijo Sebastian ◽

Pinhas Ben-Tzvi

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Real World ◽

Future Research ◽

Training Process ◽

Compact Spaces ◽

Unknown Environment ◽

The Neural Networks ◽

Collision Avoidance Problems ◽

Window Approach

This paper set out to investigate the usefulness of solving collision avoidance problems with the help of deep reinforcement learning in an unknown environment, especially in compact spaces, such as a narrow corridor. This research aims to determine whether a deep reinforcement learning-based collision avoidance method is superior to the traditional methods, such as potential field-based methods and dynamic window approach. Besides, the proposed obstacle avoidance method was developed as one of the capabilities to enable each robot in a novel robotic system, namely the Self-reconfigurable and Transformable Omni-Directional Robotic Modules (STORM), to navigate intelligently and safely in an unknown environment. A well-conceived hardware and software architecture with features that enable further expansion and parallel development designed for the ongoing STORM projects is also presented in this work. A virtual STORM module with skid-steer kinematics was simulated in Gazebo to reduce the gap between the simulations and the real-world implementations. Moreover, comparisons among multiple training runs of the neural networks with different parameters related to balance the exploitation and exploration during the training process, as well as tests and experiments conducted in both simulation and real-world, are presented in detail. Directions for future research are also provided in the paper.

Download Full-text

Multilayered reinforcement learning for complicated collision avoidance problems

Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146) ◽

10.1109/robot.1998.680648 ◽

2002 ◽

Cited By ~ 22

Author(s):

T. Fujii ◽

Y. Arai ◽

H. Asama ◽

I. Endo

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Collision Avoidance Problems

Download Full-text

The Societal Implications of Deep Reinforcement Learning

Journal of Artificial Intelligence Research ◽

10.1613/jair.1.12360 ◽

2021 ◽

Vol 70 ◽

Author(s):

Jess Whittlestone ◽

Kai Arulkumaran ◽

Matthew Crosby

Keyword(s):

Artificial Intelligence ◽

Reinforcement Learning ◽

Real World ◽

Recent Progress ◽

Research Community ◽

Societal Implications ◽

Future Research ◽

Real World Application ◽

Safety Risks ◽

Physical Infrastructure

Deep Reinforcement Learning (DRL) is an avenue of research in Artificial Intelligence (AI) that has received increasing attention within the research community in recent years, and is beginning to show potential for real-world application. DRL is one of the most promising routes towards developing more autonomous AI systems that interact with and take actions in complex real-world environments, and can more flexibly solve a range of problems for which we may not be able to precisely specify a correct ‘answer’. This could have substantial implications for people’s lives: for example by speeding up automation in various sectors, changing the nature and potential harms of online influence, or introducing new safety risks in physical infrastructure. In this paper, we review recent progress in DRL, discuss how this may introduce novel and pressing issues for society, ethics, and governance, and highlight important avenues for future research to better understand DRL’s societal implications. This article appears in the special track on AI and Society.

Download Full-text

Actor-Critic Traction Control Based on Reinforcement Learning with Open-Loop Training

Modelling and Simulation in Engineering ◽

10.1155/2021/4641450 ◽

2021 ◽

Vol 2021 ◽

pp. 1-10

Author(s):

M. Funk Drechsler ◽

T. A. Fiorentin ◽

H. Göllinger

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Autonomous Driving ◽

Stability Control ◽

Open Loop ◽

Training Process ◽

Slip Ratio ◽

Traction Control ◽

Antilock Braking System ◽

The Neural Networks

The use of actor-critic algorithms can improve the controllers currently implemented in automotive applications. This method combines reinforcement learning (RL) and neural networks to achieve the possibility of controlling nonlinear systems with real-time capabilities. Actor-critic algorithms were already applied with success in different controllers including autonomous driving, antilock braking system (ABS), and electronic stability control (ESC). However, in the current researches, virtual environments are implemented for the training process instead of using real plants to obtain the datasets. This limitation is given by trial and error methods implemented for the training process, which generates considerable risks in case the controller directly acts on the real plant. In this way, the present research proposes and evaluates an open-loop training process, which permits the data acquisition without the control interaction and an open-loop training of the neural networks. The performance of the trained controllers is evaluated by a design of experiments (DOE) to understand how it is affected by the generated dataset. The results present a successful application of open-loop training architecture. The controller can maintain the slip ratio under adequate levels during maneuvers on different floors, including grounds that are not applied during the training process. The actor neural network is also able to identify the different floors and change the acceleration profile according to the characteristics of each ground.

Download Full-text

iADA*-RL: Anytime Graph-Based Path Planning with Deep Reinforcement Learning for an Autonomous UAV

Applied Sciences ◽

10.3390/app11093948 ◽

2021 ◽

Vol 11 (9) ◽

pp. 3948

Author(s):

Aye Aye Maw ◽

Maxim Tyan ◽

Tuan Anh Nguyen ◽

Jae-Woo Lee

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Real Time ◽

Collision Avoidance ◽

Real World ◽

Mission Planning ◽

Planning System ◽

Local Planning ◽

Planning Algorithm ◽

Path Planning Algorithm

Path planning algorithms are of paramount importance in guidance and collision systems to provide trustworthiness and safety for operations of autonomous unmanned aerial vehicles (UAV). Previous works showed different approaches mostly focusing on shortest path discovery without a sufficient consideration on local planning and collision avoidance. In this paper, we propose a hybrid path planning algorithm that uses an anytime graph-based path planning algorithm for global planning and deep reinforcement learning for local planning which applied for a real-time mission planning system of an autonomous UAV. In particular, we aim to achieve a highly autonomous UAV mission planning system that is adaptive to real-world environments consisting of both static and moving obstacles for collision avoidance capabilities. To achieve adaptive behavior for real-world problems, a simulator is required that can imitate real environments for learning. For this reason, the simulator must be sufficiently flexible to allow the UAV to learn about the environment and to adapt to real-world conditions. In our scheme, the UAV first learns about the environment via a simulator, and only then is it applied to the real-world. The proposed system is divided into two main parts: optimal flight path generation and collision avoidance. A hybrid path planning approach is developed by combining a graph-based path planning algorithm with a learning-based algorithm for local planning to allow the UAV to avoid a collision in real time. The global path planning problem is solved in the first stage using a novel anytime incremental search algorithm called improved Anytime Dynamic A* (iADA*). A reinforcement learning method is used to carry out local planning between waypoints, to avoid any obstacles within the environment. The developed hybrid path planning system was investigated and validated in an AirSim environment. A number of different simulations and experiments were performed using AirSim platform in order to demonstrate the effectiveness of the proposed system for an autonomous UAV. This study helps expand the existing research area in designing efficient and safe path planning algorithms for UAVs.

Download Full-text

Unmanned Aerial Vehicle Pitch Control under Delay Using Deep Reinforcement Learning with Continuous Action in Wind Tunnel Test

Aerospace ◽

10.3390/aerospace8090258 ◽

2021 ◽

Vol 8 (9) ◽

pp. 258

Author(s):

Daichi Wada ◽

Sergio A. Araujo-Estrada ◽

Shane Windsor

Keyword(s):

Neural Networks ◽

Reinforcement Learning ◽

Wind Tunnel ◽

Real World ◽

Time Delays ◽

Wind Tunnel Test ◽

The Real ◽

Pitch Control ◽

Controller Performance ◽

The Neural Networks

Nonlinear flight controllers for fixed-wing unmanned aerial vehicles (UAVs) can potentially be developed using deep reinforcement learning. However, there is often a reality gap between the simulation models used to train these controllers and the real world. This study experimentally investigated the application of deep reinforcement learning to the pitch control of a UAV in wind tunnel tests, with a particular focus of investigating the effect of time delays on flight controller performance. Multiple neural networks were trained in simulation with different assumed time delays and then wind tunnel tested. The neural networks trained with shorter delays tended to be susceptible to delay in the real tests and produce fluctuating behaviour. The neural networks trained with longer delays behaved more conservatively and did not produce oscillations but suffered steady state errors under some conditions due to unmodeled frictional effects. These results highlight the importance of performing physical experiments to validate controller performance and how the training approach used with reinforcement learning needs to be robust to reality gaps between simulation and the real world.

Download Full-text

Recent Advances in Reinforcement Learning for Traffic Signal Control

ACM SIGKDD Explorations Newsletter ◽

10.1145/3447556.3447565 ◽

2021 ◽

Vol 22 (2) ◽

pp. 12-18 ◽

Cited By ~ 1

Author(s):

Hua Wei ◽

Guanjie Zheng ◽

Vikash Gayah ◽

Zhenhui Li

Keyword(s):

Reinforcement Learning ◽

Real World ◽

Intelligent Transportation Systems ◽

Transportation Systems ◽

Traffic Signal ◽

Signal Control ◽

Traffic Signal Control ◽

Control Methods ◽

Advantages And Disadvantages ◽

Recent Advances

Traffic signal control is an important and challenging real-world problem that has recently received a large amount of interest from both transportation and computer science communities. In this survey, we focus on investigating the recent advances in using reinforcement learning (RL) techniques to solve the traffic signal control problem. We classify the known approaches based on the RL techniques they use and provide a review of existing models with analysis on their advantages and disadvantages. Moreover, we give an overview of the simulation environments and experimental settings that have been developed to evaluate the traffic signal control methods. Finally, we explore future directions in the area of RLbased traffic signal control methods. We hope this survey could provide insights to researchers dealing with real-world applications in intelligent transportation systems

Download Full-text

Multi-Robot Collision Avoidance with Map-based Deep Reinforcement Learning

2020 IEEE 32nd International Conference on Tools with Artificial Intelligence (ICTAI) ◽

10.1109/ictai50040.2020.00088 ◽

2020 ◽

Author(s):

Shunyi Yao ◽

Guangda Chen ◽

Lifan Pan ◽

Jun Ma ◽

Jianmin Ji ◽

...

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Multi Robot

Download Full-text

Collision Avoidance in IEEE 802.11 DCF using a Reinforcement Learning Method

2020 International Conference on Information and Communication Technology Convergence (ICTC) ◽

10.1109/ictc49870.2020.9289402 ◽

2020 ◽

Author(s):

Chang Kyu Lee ◽

Seung Hyong Rhee

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Ieee 802.11 ◽

Learning Method ◽

Ieee 802.11 Dcf ◽

802.11 Dcf

Download Full-text

Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player

2021 IEEE/SICE International Symposium on System Integration (SII) ◽

10.1109/ieeeconf49454.2021.9382693 ◽

2021 ◽

Author(s):

Hanlin Niu ◽

Ze Ji ◽

Farshad Arvin ◽

Barry Lennox ◽

Hujun Yin ◽

...

Keyword(s):

Reinforcement Learning ◽

Collision Avoidance ◽

Human Player

Download Full-text

Challenges of real-world reinforcement learning: definitions, benchmarks and analysis

Machine Learning ◽

10.1007/s10994-021-05961-4 ◽

2021 ◽

Author(s):

Gabriel Dulac-Arnold ◽

Nir Levine ◽

Daniel J. Mankowitz ◽

Jerry Li ◽

Cosmin Paduraru ◽

...

Keyword(s):

Reinforcement Learning ◽

Real World

Download Full-text