Interval-Based Markov Decision Processes for Regulating Interactions Between Two Agents in Multi-agent Systems

Online learning for Markov decision processes applied to multi-agent systems

2017 IEEE 56th Annual Conference on Decision and Control (CDC) ◽

10.1109/cdc.2017.8263879 ◽

2017 ◽

Author(s):

Mahmoud El Chamie ◽

Behcet Acikmese ◽

Mehran Mesbahi

Keyword(s):

Online Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Multi Agent Systems ◽

Agent Systems ◽

Markov Decision ◽

Multi Agent

Download Full-text

Solving Transition Independent Decentralized Markov Decision Processes

Journal of Artificial Intelligence Research ◽

10.1613/jair.1497 ◽

2004 ◽

Vol 22 ◽

pp. 423-455 ◽

Cited By ~ 51

Author(s):

R. Becker ◽

S. Zilberstein ◽

V. Lesser ◽

C. V. Goldman

Keyword(s):

Markov Decision Processes ◽

Optimal Algorithm ◽

Decision Processes ◽

Specific Class ◽

Multi Agent Systems ◽

Sequential Decision ◽

Anytime Algorithm ◽

Reward Function ◽

Markov Decision ◽

Multi Agent

Formal treatment of collaborative multi-agent systems has been lagging behind the rapid progress in sequential decision making by individual agents. Recent work in the area of decentralized Markov Decision Processes (MDPs) has contributed to closing this gap, but the computational complexity of these models remains a serious obstacle. To overcome this complexity barrier, we identify a specific class of decentralized MDPs in which the agents' transitions are independent. The class consists of independent collaborating agents that are tied together through a structured global reward function that depends on all of their histories of states and actions. We present a novel algorithm for solving this class of problems and examine its properties, both as an optimal algorithm and as an anytime algorithm. To our best knowledge, this is the first algorithm to optimally solve a non-trivial subclass of decentralized MDPs. It lays the foundation for further work in this area on both exact and approximate algorithms.

Download Full-text

Using Intelligent Multi-Agent Systems to Model and Foster Self-Regulated Learning: A Theoretically-Based Approach Using Markov Decision Process

2013 IEEE 27th International Conference on Advanced Information Networking and Applications (AINA) ◽

10.1109/aina.2013.70 ◽

2013 ◽

Cited By ~ 1

Author(s):

B. Khosravifar ◽

F. Bouchet ◽

R. Feyzi-Behnagh ◽

R. Azevedo ◽

J. M. Harley

Keyword(s):

Markov Decision Process ◽

Decision Process ◽

Multi Agent Systems ◽

Agent Systems ◽

Self Regulated Learning ◽

Regulated Learning ◽

Markov Decision ◽

Multi Agent

Download Full-text

Value Function Transfer for Deep Multi-Agent Reinforcement Learning Based on N-Step Returns

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/65 ◽

2019 ◽

Cited By ~ 2

Author(s):

Yong Liu ◽

Yujing Hu ◽

Yang Gao ◽

Yingfeng Chen ◽

Changjie Fan

Keyword(s):

Reinforcement Learning ◽

Knowledge Transfer ◽

Value Function ◽

Single Agent ◽

Multi Agent Systems ◽

Agent Systems ◽

Markov Decision ◽

Dimensional State Space ◽

Multi Agent ◽

Function Transfer

Many real-world problems, such as robot control and soccer game, are naturally modeled as sparse-interaction multi-agent systems. Reutilizing single-agent knowledge in multi-agent systems with sparse interactions can greatly accelerate the multi-agent learning process. Previous works rely on bisimulation metric to define Markov decision process (MDP) similarity for controlling knowledge transfer. However, bisimulation metric is costly to compute and is not suitable for high-dimensional state space problems. In this work, we propose more scalable transfer learning methods based on a novel MDP similarity concept. We start by defining the MDP similarity based on the N-step return (NSR) values of an MDP. Then, we propose two knowledge transfer methods based on deep neural networks called direct value function transfer and NSR-based value function transfer. We conduct experiments in image-based grid world, multi-agent particle environment (MPE) and Ms. Pac-Man game. The results indicate that the proposed methods can significantly accelerate multi-agent reinforcement learning and meanwhile get better asymptotic performance.

Download Full-text

A Novel Heterogeneous Swarm Reinforcement Learning Method for Sequential Decision Making Problems

Machine Learning and Knowledge Extraction ◽

10.3390/make1020035 ◽

2019 ◽

Vol 1 (2) ◽

pp. 590-610

Author(s):

Zohreh Akbari ◽

Rainer Unland

Keyword(s):

Decision Making ◽

Reinforcement Learning ◽

Single Agent ◽

Sequential Decision Making ◽

Multi Agent Systems ◽

Sequential Decision ◽

Agent Systems ◽

Novel Approach ◽

Markov Decision ◽

Multi Agent

Sequential Decision Making Problems (SDMPs) that can be modeled as Markov Decision Processes can be solved using methods that combine Dynamic Programming (DP) and Reinforcement Learning (RL). Depending on the problem scenarios and the available Decision Makers (DMs), such RL algorithms may be designed for single-agent systems or multi-agent systems that either consist of agents with individual goals and decision making capabilities, which are influenced by other agent’s decisions, or behave as a swarm of agents that collaboratively learn a single objective. Many studies have been conducted in this area; however, when concentrating on available swarm RL algorithms, one obtains a clear view of the areas that still require attention. Most of the studies in this area focus on homogeneous swarms and so far, systems introduced as Heterogeneous Swarms (HetSs) merely include very few, i.e., two or three sub-swarms of homogeneous agents, which either, according to their capabilities, deal with a specific sub-problem of the general problem or exhibit different behaviors in order to reduce the risk of bias. This study introduces a novel approach that allows agents, which are originally designed to solve different problems and hence have higher degrees of heterogeneity, to behave as a swarm when addressing identical sub-problems. In fact, the affinity between two agents, which measures the compatibility of agents to work together towards solving a specific sub-problem, is used in designing a Heterogeneous Swarm RL (HetSRL) algorithm that allows HetSs to solve the intended SDMPs.

Download Full-text

CHQ: a multi-agent reinforcement learning scheme for partially observable markov decision processes

Proceedings. IEEE/WIC/ACM International Conference on Intelligent Agent Technology, 2004. (IAT 2004). ◽

10.1109/iat.2004.1342918 ◽

2004 ◽

Author(s):

H. Osada ◽

S. Fujita

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Learning Scheme ◽

Markov Decision ◽

Multi Agent ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

CHQ: A Multi-Agent Reinforcement Learning Scheme for Partially Observable Markov Decision Processes

IEICE Transactions on Information and Systems ◽

10.1093/ietisy/e88-d.5.1004 ◽

2005 ◽

Vol E88-D (5) ◽

pp. 1004-1011 ◽

Cited By ~ 3

Author(s):

H. OSADA

Keyword(s):

Reinforcement Learning ◽

Markov Decision Processes ◽

Decision Processes ◽

Learning Scheme ◽

Markov Decision ◽

Multi Agent ◽

Partially Observable Markov ◽

Partially Observable

Download Full-text

Communication in multi-agent Markov decision processes

Proceedings Fourth International Conference on MultiAgent Systems ◽

10.1109/icmas.2000.858528 ◽

2002 ◽

Cited By ~ 3

Author(s):

Ping Xuan ◽

V. Lesser ◽

S. Zilberstein

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision ◽

Multi Agent

Download Full-text

A robust crew pairing based on Multi-agent Markov Decision Processes

2014 Second World Conference on Complex Systems (WCCS) ◽

10.1109/icocs.2014.7060940 ◽

2014 ◽

Cited By ~ 3

Author(s):

Oussama Aoun ◽

Abdellatif El Afia

Keyword(s):

Markov Decision Processes ◽

Decision Processes ◽

Markov Decision ◽

Crew Pairing ◽

Multi Agent

Download Full-text

A Value-based Trust Assessment Model for Multi-agent Systems

Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence ◽

10.24963/ijcai.2019/28 ◽

2019 ◽

Author(s):

Kinzang Chhogyal ◽

Abhaya Nayak ◽

Aditya Ghose ◽

Hoa K. Dam

Keyword(s):

Simple Approach ◽

Assessment Model ◽

Multi Agent Systems ◽

Agent Systems ◽

Two Agents ◽

A Value ◽

Multi Agent

An agent's assessment of its trust in another agent is commonly taken to be a measure of the reliability/predictability of the latter's actions. It is based on the trustor's past observations of the behaviour of the trustee and requires no knowledge of the inner-workings of the trustee. However, in situations that are new or unfamiliar, past observations are of little help in assessing trust. In such cases, knowledge about the trustee can help. A particular type of knowledge is that of values - things that are important to the trustor and the trustee. In this paper, based on the premise that the more values two agents share, the more they should trust one another, we propose a simple approach to trust assessment between agents based on values, taking into account if agents trust cautiously or boldly, and if they depend on others in carrying out a task.

Download Full-text