scholarly journals Cooperative Information Sharing to Improve Distributed Learning in Multi-Agent Systems

2005 ◽  
Vol 24 ◽  
pp. 407-463 ◽  
Author(s):  
P. S. Dutta ◽  
N. R. Jennings ◽  
L. Moreau

Effective coordination of agents' actions in partially-observable domains is a major challenge of multi-agent systems research. To address this, many researchers have developed techniques that allow the agents to make decisions based on estimates of the states and actions of other agents that are typically learnt using some form of machine learning algorithm. Nevertheless, many of these approaches fail to provide an actual means by which the necessary information is made available so that the estimates can be learnt. To this end, we argue that cooperative communication of state information between agents is one such mechanism. However, in a dynamically changing environment, the accuracy and timeliness of this communicated information determine the fidelity of the learned estimates and the usefulness of the actions taken based on these. Given this, we propose a novel information-sharing protocol, post-task-completion sharing, for the distribution of state information. We then show, through a formal analysis, the improvement in the quality of estimates produced using our strategy over the widely used protocol of sharing information between nearest neighbours. Moreover, communication heuristics designed around our information-sharing principle are subjected to empirical evaluation along with other benchmark strategies (including Littman's Q-routing and Stone's TPOT-RL) in a simulated call-routing application. These studies, conducted across a range of environmental settings, show that, compared to the different benchmarks used, our strategy generates an improvement of up to 60% in the call connection rate; of more than 1000% in the ability to connect long-distance calls; and incurs as low as 0.25 of the message overhead.

2018 ◽  
Vol 41 (7) ◽  
pp. 1957-1964 ◽  
Author(s):  
Ming-Can Fan ◽  
Miaomiao Wang

This paper investigates the leaderless and leader-following consensus problem for a class of second-order multi-agent systems subject to input saturation, that is, the control input is required to be a priori bounded. Moreover, the control coefficients are assumed to be unavailable, which cannot be lower or upper bounded by any known constants. Distributed consensus protocols are proposed based only on agents’ own velocity state information and relative position state information among neighbouring agents and the leader. By virtue of the adaptive control technique, algebraic graph theory and Barbalat’s lemma, it is proved that the states of the multi-agent systems can achieve consensus under the assumption that the interconnection topology is undirected and connected. Finally, two simulation examples are provided to illustrate the effectiveness of the theoretical results.


2020 ◽  
pp. 160-183
Author(s):  
Steven Walczak

The development of multiple agent systems faces many challenges, including agent coordination and collaboration on tasks. Minsky's The Society of Mind provides a conceptual view for addressing these multi-agent system problems. A new classification ontology is introduced for comparing multi-agent systems. Next, a new framework called the Society of Agents is developed from Minsky's conceptual foundation. A Society of Agents framework-based problem-solving and a Game Society is developed and applied to the domain of single player logic puzzles and two player games. The Game Society solved 100% of presented Sudoku and Kakuro problems and never lost a tic-tac-toe game. The advantage of the Society of Agents approach is the efficient re-utilization of agents across multiple independent game domain problems and a centralized problem-solving architecture with efficient cross-agent information sharing.


Studia Logica ◽  
2019 ◽  
Vol 108 (1) ◽  
pp. 129-158 ◽  
Author(s):  
Réka Markovich

Abstract Hohfeld’s analysis (Fundamental Legal Conceptions as Applied in Judicial Reasoning, 1913, 1917) on the different types of rights and duties is highly influential in analytical legal theory, and it is considered as a fundamental theory in AI&Law and normative multi-agent systems. Yet a century later, the formalization of this theory remains, in various ways, unresolved. In this paper I provide a formal analysis of how the working of a system containing Hohfeldian rights and duties can be delineated. This formalization starts from using the same tools as the classical ones by Kanger and Lindahl used, but instead of focusing on the algebraic features of rights and duties, it aims at providing a comprehensive analysis of what these rights and duties actually are and how they behave and at saying something substantial on Power too—maintaining all along the Hohfeldian intentions that these rights and duties are sui generis and inherently relational.


2002 ◽  
Vol 17 (3) ◽  
pp. 295-301
Author(s):  
MARK DINVERNO ◽  
MICHAEL LUCK ◽  
UKMAS 2001 Contributors

Author(s):  
Yuan Lin ◽  
Nicole Abaid

Current models for multi-agent systems almost exclusively employ sensory modalities such as vision where agents passively receive information from the environment. Active sensing, defined as acquiring environmental information using self-generated signals, allows widespread sharing of sensory information among agents and thus gives rise to more complex interactions within engineered multi-agent systems using radar or sonar, for example. In nature, bat swarms are animal groups that successfully employ active sensing with each individual broadcasting echolocation pulses in the environment and responding to echoes. Bats flying in groups may cope with the dense sound environment through their behavior; one hypothesized strategy is the cessation of echolocation pulses in the presence of peers and “eavesdropping”, which has been demonstrated in controlled laboratory settings. In this work, we build a self-propelled-particle model with each agent avoiding obstacles in three dimensions by emitting echolocation pulses of a unique frequency. We implement a bat-inspired rule of eavesdropping to take advantage of information sharing via active sensing while reducing the energy expenditure of the group. Through a simulation study, we show that agents indeed capitalize on peers’ pulses and echoes for obstacle avoidance and we find a maximum of this effect for a set of model parameters which relate to the domain size.


Author(s):  
Sahar Yazdani ◽  
Mohammad Haeri

This paper studies the leader–follower flocking of multi-agent systems for the linear second-order dynamics, subject to the external disturbance problem. It is assumed that the dynamic of the leader is Lipschitz-type. Also, the velocity is the output of the system, and full-state information is not available for feedback. A distributed full-order observer is employed to estimate every agent's states and external disturbance. A control protocol for each agent is designed based on the measurement of its output/velocity and relative velocity of its neighbors. Under the proposed protocol, the velocity convergence of whole agents to the velocity of the virtual leader is guaranteed as well as the connectivity of network and collision avoidance among agents are ensured. Finally, a simulation example is provided to show the effectiveness of the results.


Sign in / Sign up

Export Citation Format

Share Document