Training and inferring neural network function with multi-agent reinforcement learning

Mapping Intimacies ◽

10.1101/598086 ◽

2019 ◽

Cited By ~ 1

Author(s):

Matthew Chalk ◽

Gasper Tkacik ◽

Olivier Marre

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Recurrent Network ◽

Neural Systems ◽

Neural Recordings ◽

Reward Function ◽

Network Function ◽

Structure Changes ◽

Theoretical Predictions ◽

Multi Agent

AbstractA central goal in systems neuroscience is to understand the functions performed by neural circuits. Previous top-down models addressed this question by comparing the behaviour of an ideal model circuit, optimised to perform a given function, with neural recordings. However, this requires guessing in advance what function is being performed, which may not be possible for many neural systems. To address this, we propose a new framework for optimising a recurrent network using multi-agent reinforcement learning (RL). In this framework, a reward function quantifies how desirable each state of the network is for performing a given function. Each neuron is treated as an ‘agent’, which optimises its responses so as to drive the network towards rewarded states. Three applications follow from this. First, one can use multi-agent RL algorithms to optimise a recurrent neural network to perform diverse functions (e.g. efficient sensory coding or motor control). Second, one could use inverse RL to infer the function of a recorded neural network from data. Third, the theory predicts how neural networks should adapt their dynamics to maintain the same function when the external environment or network structure changes. This could lead to theoretical predictions about how neural network dynamics adapt to deal with cell death and/or varying sensory stimulus statistics.

Download Full-text

Inferring the function performed by a recurrent neural network

PLoS ONE ◽

10.1371/journal.pone.0248940 ◽

2021 ◽

Vol 16 (4) ◽

pp. e0248940

Author(s):

Matthew Chalk ◽

Gasper Tkacik ◽

Olivier Marre

Keyword(s):

Neural Network ◽

Neural Systems ◽

Inverse Reinforcement Learning ◽

Neural Recordings ◽

Reward Function ◽

Structure Changes ◽

The Neural Network ◽

Theoretical Predictions ◽

Neural Network Dynamics ◽

Model Circuit

A central goal in systems neuroscience is to understand the functions performed by neural circuits. Previous top-down models addressed this question by comparing the behaviour of an ideal model circuit, optimised to perform a given function, with neural recordings. However, this requires guessing in advance what function is being performed, which may not be possible for many neural systems. To address this, we propose an inverse reinforcement learning (RL) framework for inferring the function performed by a neural network from data. We assume that the responses of each neuron in a network are optimised so as to drive the network towards ‘rewarded’ states, that are desirable for performing a given function. We then show how one can use inverse RL to infer the reward function optimised by the network from observing its responses. This inferred reward function can be used to predict how the neural network should adapt its dynamics to perform the same function when the external environment or network structure changes. This could lead to theoretical predictions about how neural network dynamics adapt to deal with cell death and/or varying sensory stimulus statistics.

Download Full-text

Combat Robot Strategy Adaptation Using Multiple Learning Agents

Volume 4: Dynamics, Control and Uncertainty, Parts A and B ◽

10.1115/imece2012-87521 ◽

2012 ◽

Author(s):

Thomas Recchia ◽

Jae Chung ◽

Kishore Pochiraju

Keyword(s):

Reinforcement Learning ◽

Robotic Systems ◽

Multi Agent System ◽

Learning Agents ◽

Loosely Coupled ◽

Reward Function ◽

Strategy Adaptation ◽

Agent Learning ◽

Multi Agent ◽

Reward Functions

As robotic systems become more prevalent, it is highly desirable for them to be able to operate in highly dynamic environments. A common approach is to use reinforcement learning to allow an agent controlling the robot to learn and adapt its behavior based on a reward function. This paper presents a novel multi-agent system that cooperates to control a single robot battle tank in a melee battle scenario, with no prior knowledge of its opponents’ strategies. The agents learn through reinforcement learning, and are loosely coupled by their reward functions. Each agent controls a different aspect of the robot’s behavior. In addition, the problem of delayed reward is addressed through a time-averaged reward applied to several sequential actions at once. This system was evaluated in a simulated melee combat scenario and was shown to learn to improve its performance over time. This was accomplished by each agent learning to pick specific battle strategies for each different opponent it faced.

Download Full-text

Role differentiation process by division of reward function in multi-agent reinforcement learning

2008 SICE Annual Conference ◽

10.1109/sice.2008.4654685 ◽

2008 ◽

Cited By ~ 1

Author(s):

Tadahiro Taniguchi ◽

Kazuma Tabuchi ◽

Tetsuo Sawaragi

Keyword(s):

Reinforcement Learning ◽

Differentiation Process ◽

Role Differentiation ◽

Reward Function ◽

Multi Agent

Download Full-text

Sigmoid-weighted linear units for neural network function approximation in reinforcement learning

Neural Networks ◽

10.1016/j.neunet.2017.12.012 ◽

2018 ◽

Vol 107 ◽

pp. 3-11 ◽

Cited By ~ 55

Author(s):

Stefan Elfwing ◽

Eiji Uchibe ◽

Kenji Doya

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Function Approximation ◽

Network Function

Download Full-text

Virtualized Network Function Forwarding Graph Placing in sdn and nfv-Enabled iot Networks: A Graph Neural Network Assisted Deep Reinforcement Learning Method

IEEE Transactions on Network and Service Management ◽

10.1109/tnsm.2021.3123460 ◽

2021 ◽

pp. 1-1

Author(s):

Yanghao Xie ◽

Lin Huang ◽

Yuyang Kong ◽

Sheng Wang ◽

Shizhong Xu ◽

...

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Learning Method ◽

Network Function

Download Full-text

Adaptive Design of Role Differentiation by Division of Reward Function in Multi-Agent Reinforcement Learning

SICE Journal of Control Measurement and System Integration ◽

10.9746/jcmsi.3.26 ◽

2010 ◽

Vol 3 (1) ◽

pp. 26-34

Author(s):

Tadahiro TANIGUCHI ◽

Kazuma TABUCHI ◽

Tetsuo SAWARAGI

Keyword(s):

Reinforcement Learning ◽

Adaptive Design ◽

Role Differentiation ◽

Reward Function ◽

Multi Agent

Download Full-text

Graph neural network and reinforcement learning for multi‐agent cooperative control of connected autonomous vehicles

Computer-Aided Civil and Infrastructure Engineering ◽

10.1111/mice.12702 ◽

2021 ◽

Vol 36 (7) ◽

pp. 838-857

Author(s):

Sikai Chen ◽

Jiqian Dong ◽

Paul (Young Joun) Ha ◽

Yujie Li ◽

Samuel Labi

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Autonomous Vehicles ◽

Cooperative Control ◽

Multi Agent

Download Full-text

Simulation of football sport PID controller based on BP neural network

Journal of Intelligent & Fuzzy Systems ◽

10.3233/jifs-189570 ◽

2020 ◽

pp. 1-13

Author(s):

L.V. Qiangguo

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Bp Neural Network ◽

Pid Controller ◽

Single Agent ◽

Control Effect ◽

Learning Problem ◽

Learning Space ◽

Multi Agent ◽

Multi Agents

Multi-agent reinforcement learning in football simulation can be extended by single-agent reinforcement learning. However, compared with single agents, the learning space of multi-agents will increase dramatically with the increase in the number of agents, so the learning difficulty will also increase. Based on BP neural network as the model structure foundation, this research combines PID controller to control the process of model operation. In order to improve the calculation accuracy to improve the control effect, the prediction output is obtained through the prediction model instead of the actual measured value. In addition, with the football robot as the object, this research studies the multi-agent reinforcement learning problem and its application in the football robot. The content includes single-agent reinforcement learning, multi-agent system reinforcement learning, and ball hunting, role assignment, and action selection in football robot decision strategies based on this. The simulation results show that the method proposed in this paper has certain effects.

Download Full-text

Multi-agent Reinforcement Learning for Swarm Retrieval with Evolving Neural Network

Biomimetic and Biohybrid Systems - Lecture Notes in Computer Science ◽

10.1007/978-3-319-95972-6_56 ◽

2018 ◽

pp. 522-526 ◽

Cited By ~ 1

Author(s):

Neil Vaughan

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Multi Agent

Download Full-text

Integrating self-organizing neural network and Motivated Learning for coordinated multi-agent reinforcement learning in multi-stage stochastic game

2014 International Joint Conference on Neural Networks (IJCNN) ◽

10.1109/ijcnn.2014.6889624 ◽

2014 ◽

Cited By ~ 3

Author(s):

Teck-Hou Teng ◽

Ah-Hwee Tan ◽

Janusz A. Starzyk ◽

Yuan-Sin Tan ◽

Loo-Nin Teow

Keyword(s):

Neural Network ◽

Reinforcement Learning ◽

Stochastic Game ◽

Multi Stage ◽

Multi Agent ◽

Self Organizing

Download Full-text