Dynamic attention network for multi-UAV reinforcement learning

QoE-driven Adaptive Deployment Strategy of Multi-UAV Networks Based on Hybrid Deep Reinforcement Learning

IEEE Internet of Things Journal ◽

10.1109/jiot.2021.3066368 ◽

2021 ◽

pp. 1-1

Author(s):

Yi Zhou ◽

Xiaoyong Ma ◽

Shuting Hu ◽

Danyang Zhou ◽

Nan Cheng ◽

...

Keyword(s):

Reinforcement Learning ◽

Deployment Strategy ◽

Multi Uav

Download Full-text

Collaborative Computation Offloading and Resource Allocation in Multi-UAV Assisted IoT Networks: A Deep Reinforcement Learning Approach

IEEE Internet of Things Journal ◽

10.1109/jiot.2021.3063188 ◽

2021 ◽

pp. 1-1

Author(s):

Abegaz Mohammed Seid ◽

Gordon Owusu Boateng ◽

Stephen Anokye ◽

Thomas Kwantwi ◽

Guolin Sun ◽

...

Keyword(s):

Resource Allocation ◽

Reinforcement Learning ◽

Computation Offloading ◽

Learning Approach ◽

Multi Uav

Download Full-text

Multi-UAV Assisted Offloading Optimization: A Game Combined Reinforcement Learning Approach

IEEE Communications Letters ◽

10.1109/lcomm.2021.3078469 ◽

2021 ◽

pp. 1-1

Author(s):

Ang Gao ◽

Qi Wang ◽

Kaiyue Chen ◽

Wei Liang

Keyword(s):

Reinforcement Learning ◽

Learning Approach ◽

Multi Uav

Download Full-text

A Reinforcement Learning-based Decentralized Method of Avoiding Multi-UAV Collision in 3-D Airspace

Proceedings of the 2019 3rd International Conference on Computer Science and Artificial Intelligence ◽

10.1145/3374587.3374599 ◽

2019 ◽

Author(s):

Jian Sun ◽

Yingzhou Zhang

Keyword(s):

Reinforcement Learning ◽

Multi Uav

Download Full-text

A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems

Communications in Computer and Information Science - Artificial Intelligence Algorithms and Applications ◽

10.1007/978-981-15-5577-0_51 ◽

2020 ◽

pp. 636-650 ◽

Cited By ~ 1

Author(s):

Bo Peng ◽

Jiahai Wang ◽

Zizhen Zhang

Keyword(s):

Reinforcement Learning ◽

Vehicle Routing ◽

Learning Algorithm ◽

Vehicle Routing Problems ◽

Routing Problems ◽

Attention Model ◽

Dynamic Attention ◽

Reinforcement Learning Algorithm

Download Full-text

Multi-UAV-enabled AoI-aware WPCN: A Multi-agent Reinforcement Learning Strategy

IEEE INFOCOM 2021 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS) ◽

10.1109/infocomwkshps51825.2021.9484496 ◽

2021 ◽

Author(s):

Omar Sami Oubbati ◽

Mohammed Atiquzzaman ◽

Abderrahmane Lakas ◽

Abdullah Baz ◽

Hosam Alhakami ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Strategy ◽

Multi Agent ◽

Multi Uav

Download Full-text

Research on the Multiagent Joint Proximal Policy Optimization Algorithm Controlling Cooperative Fixed-Wing UAV Obstacle Avoidance

Sensors ◽

10.3390/s20164546 ◽

2020 ◽

Vol 20 (16) ◽

pp. 4546

Author(s):

Weiwei Zhao ◽

Hairong Chu ◽

Xikui Miao ◽

Lihong Guo ◽

Honghai Shen ◽

...

Keyword(s):

Reinforcement Learning ◽

Attitude Control ◽

Cooperative Control ◽

Learning Algorithm ◽

State Equations ◽

Learning Agent ◽

Environmental Adaptability ◽

Decentralized Execution ◽

Policy Optimization ◽

Multi Uav

Multiple unmanned aerial vehicle (UAV) collaboration has great potential. To increase the intelligence and environmental adaptability of multi-UAV control, we study the application of deep reinforcement learning algorithms in the field of multi-UAV cooperative control. Aiming at the problem of a non-stationary environment caused by the change of learning agent strategy in reinforcement learning in a multi-agent environment, the paper presents an improved multiagent reinforcement learning algorithm—the multiagent joint proximal policy optimization (MAJPPO) algorithm with the centralized learning and decentralized execution. This algorithm uses the moving window averaging method to make each agent obtain a centralized state value function, so that the agents can achieve better collaboration. The improved algorithm enhances the collaboration and increases the sum of reward values obtained by the multiagent system. To evaluate the performance of the algorithm, we use the MAJPPO algorithm to complete the task of multi-UAV formation and the crossing of multiple-obstacle environments. To simplify the control complexity of the UAV, we use the six-degree of freedom and 12-state equations of the dynamics model of the UAV with an attitude control loop. The experimental results show that the MAJPPO algorithm has better performance and better environmental adaptability.

Download Full-text

Joint Optimization of Multi-UAV Target Assignment and Path Planning Based on Multi-Agent Reinforcement Learning

IEEE Access ◽

10.1109/access.2019.2943253 ◽

2019 ◽

Vol 7 ◽

pp. 146264-146272 ◽

Cited By ~ 10

Author(s):

Han Qie ◽

Dianxi Shi ◽

Tianlong Shen ◽

Xinhai Xu ◽

Yuan Li ◽

...

Keyword(s):

Reinforcement Learning ◽

Path Planning ◽

Joint Optimization ◽

Target Assignment ◽

Multi Agent ◽

Multi Uav

Download Full-text

Multi-UAV Target-Finding in Simulated Indoor Environments using Deep Reinforcement Learning

2020 IEEE Aerospace Conference ◽

10.1109/aero47225.2020.9172262 ◽

2020 ◽

Author(s):

Ory Walker ◽

Fernando Vanegas ◽

Felipe Gonzalez ◽

Sven Koenig

Keyword(s):

Reinforcement Learning ◽

Indoor Environments ◽

Multi Uav

Download Full-text

A Novel Searching Method Using Reinforcement Learning Scheme for Multi-UAVs in Unknown Environments

Applied Sciences ◽

10.3390/app9224964 ◽

2019 ◽

Vol 9 (22) ◽

pp. 4964 ◽

Cited By ~ 4

Author(s):

Yue ◽

Guan ◽

Wang

Keyword(s):

Reinforcement Learning ◽

Prior Information ◽

Search Task ◽

Efficiency Function ◽

Awareness Information ◽

Simulation Results ◽

Searching Method ◽

Sea Area ◽

Sensor Detection ◽

Multi Uav

In this paper, the important topic of cooperative searches for multi-dynamic targets in unknown sea areas by unmanned aerial vehicles (UAVs) is studied based on a reinforcement learning (RL) algorithm. A novel multi-UAV sea area search map is established, in which models of the environment, UAV dynamics, target dynamics, and sensor detection are involved. Then, the search map is updated and extended using the concept of the territory awareness information map. Finally, according to the search efficiency function, a reward and punishment function is designed, and an RL method is used to generate a multi-UAV cooperative search path online. The simulation results show that the proposed algorithm could effectively perform the search task in the sea area with no prior information.

Download Full-text