Model-Free Algorithms for Containment Control of Saturated Discrete-Time Multiagent Systems via Q-Learning Method

In this paper, the Q-learning method for quadratic optimal control problem of discrete-time linear systems is reconsidered. The theoretical results prove that the quadratic optimal controller cannot be solved directly due to the linear correlation of the data sets. The following corollaries have been made: (1) The correlation of data is the key factor in the success for the calculation of quadratic optimal control laws by Q-learning method; (2) The control laws for linear systems cannot be derived directly by the existing Q-learning method; (3) For nonlinear systems, there are some doubts about the data independence of current method. Therefore, it is necessary to discuss the probability of the controllers established by the existing Q-learning method. To solve this problem, based on the ridge regression, an improved model-free Q-learning quadratic optimal control method for discrete-time linear systems is proposed in this paper. Therefore, the computation process can be implemented correctly, and the effective controller can be solved. The simulation results show that the proposed method can not only overcome the problem caused by the data correlation, but also derive proper control laws for discrete-time linear systems.

Download Full-text

Distributed Fault-Tolerant Containment Control Protocols for the Discrete-Time Multiagent Systems via Reinforcement Learning Method

IEEE Transactions on Neural Networks and Learning Systems ◽

10.1109/tnnls.2021.3121403 ◽

2021 ◽

pp. 1-13

Author(s):

Tieshan Li ◽

Weiwei Bai ◽

Qi Liu ◽

Yue Long ◽

C. L. Philip Chen

Keyword(s):

Reinforcement Learning ◽

Multiagent Systems ◽

Discrete Time ◽

Fault Tolerant ◽

Learning Method ◽

Containment Control ◽

Control Protocols

Download Full-text

Energy Optimization of Solar Micro-Grid Using Multi Agent Reinforcement Learning

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.787.843 ◽

2015 ◽

Vol 787 ◽

pp. 843-847

Author(s):

Leo Raju ◽

R.S. Milton ◽

S. Sakthiyanandan

Keyword(s):

Reinforcement Learning ◽

Energy Savings ◽

Learning Method ◽

Solar Pv ◽

Q Learning ◽

Pv Systems ◽

Model Free ◽

Individual Unit ◽

Multi Agent ◽

Micro Grid

In this paper, two solar Photovoltaic (PV) systems are considered; one in the department with capacity of 100 kW and the other in the hostel with capacity of 200 kW. Each one has battery and load. The capital cost and energy savings by conventional methods are compared and it is proved that the energy dependency from grid is reduced in solar micro-grid element, operating in distributed environment. In the smart grid frame work, the grid energy consumption is further reduced by optimal scheduling of the battery, using Reinforcement Learning. Individual unit optimization is done by a model free reinforcement learning method, called Q-Learning and it is compared with distributed operations of solar micro-grid using a Multi Agent Reinforcement Learning method, called Joint Q-Learning. The energy planning is designed according to the prediction of solar PV energy production and observed load pattern of department and the hostel. A simulation model was developed using Python programming.

Download Full-text

Optimal tracking control for discrete-time systems by model-free off-policy Q-learning approach

2017 11th Asian Control Conference (ASCC) ◽

10.1109/ascc.2017.8287094 ◽

2017 ◽

Author(s):

Jinna Li ◽

Decheng Yuan ◽

Zhengtao Ding

Keyword(s):

Discrete Time ◽

Tracking Control ◽

Learning Approach ◽

Q Learning ◽

Optimal Tracking ◽

Optimal Tracking Control ◽

Model Free ◽

Discrete Time Systems ◽

Time Systems

Download Full-text

Model‐free adaptive formation control for unknown multiinput‐multioutput nonlinear heterogeneous discrete‐time multiagent systems with bounded disturbance

International Journal of Robust and Nonlinear Control ◽

10.1002/rnc.5097 ◽

2020 ◽

Vol 30 (15) ◽

pp. 6330-6350 ◽

Cited By ~ 1

Author(s):

Shuangshuang Xiong ◽

Zhongsheng Hou ◽

Shangtai Jin

Keyword(s):

Multiagent Systems ◽

Discrete Time ◽

Formation Control ◽

Model Free ◽

Bounded Disturbance

Download Full-text

Model-Free Event-Triggered Consensus Algorithm for Multiagent Systems Using Reinforcement Learning Method

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2021.3120008 ◽

2021 ◽

pp. 1-10

Author(s):

Mingkang Long ◽

Housheng Su ◽

Zhigang Zeng

Keyword(s):

Reinforcement Learning ◽

Multiagent Systems ◽

Consensus Algorithm ◽

Learning Method ◽

Model Free ◽

Event Triggered

Download Full-text

Model-free H∞ control design for unknown linear discrete-time systems via Q-learning with LMI

Automatica ◽

10.1016/j.automatica.2010.05.002 ◽

2010 ◽

Vol 46 (8) ◽

pp. 1320-1326 ◽

Cited By ~ 41

Author(s):

J.-H. Kim ◽

F.L. Lewis

Keyword(s):

Discrete Time ◽

Control Design ◽

Q Learning ◽

Model Free ◽

Discrete Time Systems ◽

Time Systems

Download Full-text

Model-Free Distributed Consensus Control Based on Actor–Critic Framework for Discrete-Time Nonlinear Multiagent Systems

IEEE Transactions on Systems Man and Cybernetics Systems ◽

10.1109/tsmc.2018.2883801 ◽

2020 ◽

Vol 50 (11) ◽

pp. 4123-4134 ◽

Cited By ~ 2

Author(s):

Wei Wang ◽

Xin Chen ◽

Hao Fu ◽

Min Wu

Keyword(s):

Multiagent Systems ◽

Discrete Time ◽

Distributed Consensus ◽

Consensus Control ◽

Model Free

Download Full-text