Decentralized Multi-Agent Control of a Manipulator in Continuous Task Learning

Asad Ali Shahid; Jorge Said Vidal Sesin; Damjan Pecioski; Francesco Braghin; Dario Piga; Loris Roveda

doi:10.3390/app112110227

Decentralized Multi-Agent Control of a Manipulator in Continuous Task Learning

Applied Sciences ◽

10.3390/app112110227 ◽

2021 ◽

Vol 11 (21) ◽

pp. 10227

Author(s):

Asad Ali Shahid ◽

Jorge Said Vidal Sesin ◽

Damjan Pecioski ◽

Francesco Braghin ◽

Dario Piga ◽

...

Keyword(s):

Learning Process ◽

Degrees Of Freedom ◽

Action Learning ◽

Single Agent ◽

Computational Effort ◽

Control Action ◽

Multiple Agents ◽

Task Learning ◽

Multi Agent ◽

Agent Control

Many real-world tasks require multiple agents to work together. When talking about multiple agents in robotics, it is usually referenced to multiple manipulators in collaboration to solve a given task, where each one is controlled by a single agent. However, due to the increasing development of modular and re-configurable robots, it is also important to investigate the possibility of implementing multi-agent controllers that learn how to manage the manipulator’s degrees of freedom (DoF) in separated clusters for the execution of a given application (e.g., being able to face faults or, partially, new kinematics configurations). Within this context, this paper focuses on the decentralization of the robot control action learning and (re)execution considering a generic multi-DoF manipulator. Indeed, the proposed framework employs a multi-agent paradigm and investigates how such a framework impacts the control action learning process. Multiple variations of the multi-agent framework have been proposed and tested in this research, comparing the achieved performance w.r.t. a centralized (i.e., single-agent) control action learning framework, previously proposed by some of the authors. As a case study, a manipulation task (i.e., grasping and lifting) of an unknown object (to the robot controller) has been considered for validation, employing a Franka EMIKA panda robot. The MuJoCo environment has been employed to implement and test the proposed multi-agent framework. The achieved results show that the proposed decentralized approach is capable of accelerating the learning process at the beginning with respect to the single-agent framework while also reducing the computational effort. In fact, when decentralizing the controller, it is shown that the number of variables involved in the action space can be efficiently separated into several groups and several agents. This simplifies the original complex problem into multiple ones, efficiently improving the task learning process.

Download Full-text

Multi-Agent Pathfinding with Continuous Time

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/6 ◽

2019 ◽

Cited By ~ 6

Author(s):

Anton Andreychuk ◽

Konstantin Yakovlev ◽

Dor Atzmon ◽

Roni Stern

Keyword(s):

Path Planning ◽

Continuous Time ◽

Single Agent ◽

Optimal Solutions ◽

Prior Work ◽

Multiple Agents ◽

Multi Agent ◽

Pros And Cons ◽

Planning Algorithms

Multi-Agent Pathfinding (MAPF) is the problem of finding paths for multiple agents such that every agent reaches its goal and the agents do not collide. Most prior work on MAPF were on grids, assumed agents' actions have uniform duration, and that time is discretized into timesteps. In this work, we propose a MAPF algorithm that do not assume any of these assumptions, is complete, and provides provably optimal solutions. This algorithm is based on a novel combination of Safe Interval Path Planning (SIPP), a continuous time single agent planning algorithms, and Conflict-Based Search (CBS). We analyze this algorithm, discuss its pros and cons, and evaluate it experimentally on several standard benchmarks.

Download Full-text

Collective stabilisation of multi-agent dynamical systems with single agent control

IET Control Theory and Applications ◽

10.1049/iet-cta.2014.0667 ◽

2015 ◽

Vol 9 (6) ◽

pp. 929-934 ◽

Cited By ~ 2

Author(s):

Myunggon Yoon

Keyword(s):

Dynamical Systems ◽

Single Agent ◽

Multi Agent ◽

Agent Control

Download Full-text

Multi-agent deep reinforcement learning: a survey

Artificial Intelligence Review ◽

10.1007/s10462-021-09996-w ◽

2021 ◽

Author(s):

Sven Gronauer ◽

Klaus Diepold

Keyword(s):

Reinforcement Learning ◽

Real World ◽

Single Agent ◽

Research Area ◽

Learning Gains ◽

Multiple Agents ◽

Agent Behavior ◽

Multi Agent ◽

Training Schemes ◽

Future Work

AbstractThe advances in reinforcement learning have recorded sublime success in various domains. Although the multi-agent domain has been overshadowed by its single-agent counterpart during this progress, multi-agent reinforcement learning gains rapid traction, and the latest accomplishments address problems with real-world complexity. This article provides an overview of the current developments in the field of multi-agent deep reinforcement learning. We focus primarily on literature from recent years that combines deep reinforcement learning methods with a multi-agent scenario. To survey the works that constitute the contemporary landscape, the main contents are divided into three parts. First, we analyze the structure of training schemes that are applied to train multiple agents. Second, we consider the emergent patterns of agent behavior in cooperative, competitive and mixed scenarios. Third, we systematically enumerate challenges that exclusively arise in the multi-agent domain and review methods that are leveraged to cope with these challenges. To conclude this survey, we discuss advances, identify trends, and outline possible directions for future work in this research area.

Download Full-text

Single agent control for multi-agent dynamical consensus systems

IET Control Theory and Applications ◽

10.1049/iet-cta.2011.0236 ◽

2012 ◽

Vol 6 (10) ◽

pp. 1478-1485 ◽

Cited By ~ 11

Author(s):

M.-G. Yoon

Keyword(s):

Single Agent ◽

Multi Agent ◽

Agent Control

Download Full-text

Multi-agent Control of Customer^|^apos;s Loads for Influence of Renewable Energy on the Power System

IEEJ Transactions on Electronics Information and Systems ◽

10.1541/ieejeiss.134.1515 ◽

2014 ◽

Vol 134 (10) ◽

pp. 1515-1523

Author(s):

Akihiro Ogawa ◽

Kazunari Maki ◽

Kiyoshi Hata ◽

Yasunori Takeuchi ◽

Fumio Ishikawa

Keyword(s):

Renewable Energy ◽

Power System ◽

Multi Agent ◽

Agent Control

Download Full-text

Multi-Agent Reinforcement Learning: A Review of Challenges and Applications

Applied Sciences ◽

10.3390/app11114948 ◽

2021 ◽

Vol 11 (11) ◽

pp. 4948

Author(s):

Lorenzo Canese ◽

Gian Carlo Cardarilli ◽

Luca Di Di Nunzio ◽

Rocco Fazzolari ◽

Daniele Giardino ◽

...

Keyword(s):

Reinforcement Learning ◽

Mathematical Models ◽

Learning Algorithms ◽

Single Agent ◽

Critical Issues ◽

Multi Agent ◽

Pros And Cons ◽

Application Fields

In this review, we present an analysis of the most used multi-agent reinforcement learning algorithms. Starting with the single-agent reinforcement learning algorithms, we focus on the most critical issues that must be taken into account in their extension to multi-agent scenarios. The analyzed algorithms were grouped according to their features. We present a detailed taxonomy of the main multi-agent approaches proposed in the literature, focusing on their related mathematical models. For each algorithm, we describe the possible application fields, while pointing out its pros and cons. The described multi-agent algorithms are compared in terms of the most important characteristics for multi-agent reinforcement learning applications—namely, nonstationarity, scalability, and observability. We also describe the most common benchmark environments used to evaluate the performances of the considered methods.

Download Full-text

Proposed framework for Integration of Optimal Current Injection and Multi-Agent Control of the LV Distribution network

2020 6th IEEE International Energy Conference (ENERGYCon) ◽

10.1109/energycon48941.2020.9236499 ◽

2020 ◽

Author(s):

George. H. Gabriels ◽

Mobolaji O. Windapo ◽

David T. O. Oyedokun ◽

Simona Ruggeri

Keyword(s):

Distribution Network ◽

Current Injection ◽

Optimal Current ◽

Multi Agent ◽

Agent Control

Download Full-text

On-Demand Channel Bonding in Heterogeneous WLANs: A Multi-Agent Deep Reinforcement Learning Approach

Sensors ◽

10.3390/s20102789 ◽

2020 ◽

Vol 20 (10) ◽

pp. 2789 ◽

Cited By ~ 1

Author(s):

Hang Qi ◽

Hao Huang ◽

Zhiqun Hu ◽

Xiangming Wen ◽

Zhaoming Lu

Keyword(s):

Reinforcement Learning ◽

Transmission Rate ◽

Single Agent ◽

Time Of Day ◽

Action Space ◽

Traffic Load ◽

Traffic Demand ◽

Channel Bonding ◽

On Demand ◽

Multi Agent

In order to meet the ever-increasing traffic demand of Wireless Local Area Networks (WLANs), channel bonding is introduced in IEEE 802.11 standards. Although channel bonding effectively increases the transmission rate, the wider channel reduces the number of non-overlapping channels and is more susceptible to interference. Meanwhile, the traffic load differs from one access point (AP) to another and changes significantly depending on the time of day. Therefore, the primary channel and channel bonding bandwidth should be carefully selected to meet traffic demand and guarantee the performance gain. In this paper, we proposed an On-Demand Channel Bonding (O-DCB) algorithm based on Deep Reinforcement Learning (DRL) for heterogeneous WLANs to reduce transmission delay, where the APs have different channel bonding capabilities. In this problem, the state space is continuous and the action space is discrete. However, the size of action space increases exponentially with the number of APs by using single-agent DRL, which severely affects the learning rate. To accelerate learning, Multi-Agent Deep Deterministic Policy Gradient (MADDPG) is used to train O-DCB. Real traffic traces collected from a campus WLAN are used to train and test O-DCB. Simulation results reveal that the proposed algorithm has good convergence and lower delay than other algorithms.

Download Full-text