Dynamic attention network for multi-UAV reinforcement learning

2021 ◽  
Author(s):  
Dongsheng Xu ◽  
Shang Wu
Author(s):  
Yi Zhou ◽  
Xiaoyong Ma ◽  
Shuting Hu ◽  
Danyang Zhou ◽  
Nan Cheng ◽  
...  

Author(s):  
Abegaz Mohammed Seid ◽  
Gordon Owusu Boateng ◽  
Stephen Anokye ◽  
Thomas Kwantwi ◽  
Guolin Sun ◽  
...  

Author(s):  
Omar Sami Oubbati ◽  
Mohammed Atiquzzaman ◽  
Abderrahmane Lakas ◽  
Abdullah Baz ◽  
Hosam Alhakami ◽  
...  

Sensors ◽  
2020 ◽  
Vol 20 (16) ◽  
pp. 4546
Author(s):  
Weiwei Zhao ◽  
Hairong Chu ◽  
Xikui Miao ◽  
Lihong Guo ◽  
Honghai Shen ◽  
...  

Multiple unmanned aerial vehicle (UAV) collaboration has great potential. To increase the intelligence and environmental adaptability of multi-UAV control, we study the application of deep reinforcement learning algorithms in the field of multi-UAV cooperative control. Aiming at the problem of a non-stationary environment caused by the change of learning agent strategy in reinforcement learning in a multi-agent environment, the paper presents an improved multiagent reinforcement learning algorithm—the multiagent joint proximal policy optimization (MAJPPO) algorithm with the centralized learning and decentralized execution. This algorithm uses the moving window averaging method to make each agent obtain a centralized state value function, so that the agents can achieve better collaboration. The improved algorithm enhances the collaboration and increases the sum of reward values obtained by the multiagent system. To evaluate the performance of the algorithm, we use the MAJPPO algorithm to complete the task of multi-UAV formation and the crossing of multiple-obstacle environments. To simplify the control complexity of the UAV, we use the six-degree of freedom and 12-state equations of the dynamics model of the UAV with an attitude control loop. The experimental results show that the MAJPPO algorithm has better performance and better environmental adaptability.


IEEE Access ◽  
2019 ◽  
Vol 7 ◽  
pp. 146264-146272 ◽  
Author(s):  
Han Qie ◽  
Dianxi Shi ◽  
Tianlong Shen ◽  
Xinhai Xu ◽  
Yuan Li ◽  
...  

2019 ◽  
Vol 9 (22) ◽  
pp. 4964 ◽  
Author(s):  
Yue ◽  
Guan ◽  
Wang

In this paper, the important topic of cooperative searches for multi-dynamic targets in unknown sea areas by unmanned aerial vehicles (UAVs) is studied based on a reinforcement learning (RL) algorithm. A novel multi-UAV sea area search map is established, in which models of the environment, UAV dynamics, target dynamics, and sensor detection are involved. Then, the search map is updated and extended using the concept of the territory awareness information map. Finally, according to the search efficiency function, a reward and punishment function is designed, and an RL method is used to generate a multi-UAV cooperative search path online. The simulation results show that the proposed algorithm could effectively perform the search task in the sea area with no prior information.


Sign in / Sign up

Export Citation Format

Share Document