Decentralized Reinforcement Learning for the Online Optimization of Distributed Systems

Reinforcement learning (RL) is a machine learning paradigm, like supervised or unsupervised learning, which learns the best actions an agent needs to perform to maximize its rewards in a particular environment. Research into RL has been proven to have made a real contribution to the protection of cyberphysical distributed systems. In this paper, the authors propose an analytic framework constituted of five security fields and eight industrial areas. This framework allows structuring a systematic review of the research in artificial intelligence that contributes to cybersecurity. In this contribution, the framework is used to analyse the trends and future fields of interest for the RL-based research in information system security.

Download Full-text

Optimal control design based on reinforcement learning for a class of nonlinear distributed systems

2013 10th IEEE International Conference on Control and Automation (ICCA) ◽

10.1109/icca.2013.6565092 ◽

2013 ◽

Author(s):

Zhen He ◽

YanBin Liu

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Distributed Systems ◽

Control Design

Download Full-text

Improving reliability in resource management through adaptive reinforcement learning for distributed systems

Journal of Parallel and Distributed Computing ◽

10.1016/j.jpdc.2014.10.001 ◽

2015 ◽

Vol 75 ◽

pp. 93-100 ◽

Cited By ~ 14

Author(s):

Masnida Hussin ◽

Nor Asilah Wati Abdul Hamid ◽

Khairul Azhar Kasmiran

Keyword(s):

Reinforcement Learning ◽

Distributed Systems ◽

Resource Management

Download Full-text

New scheduling approach using reinforcement learning for heterogeneous distributed systems

Journal of Parallel and Distributed Computing ◽

10.1016/j.jpdc.2017.05.001 ◽

2018 ◽

Vol 117 ◽

pp. 292-302 ◽

Cited By ~ 37

Author(s):

Alexandru Iulian Orhean ◽

Florin Pop ◽

Ioan Raicu

Keyword(s):

Reinforcement Learning ◽

Distributed Systems ◽

Heterogeneous Distributed Systems

Download Full-text

Reinforcement Learning for Online Optimization of Banner Format and Delivery

Advances in Multimedia and Interactive Technologies - Online Multimedia Advertising ◽

10.4018/978-1-60960-189-8.ch002 ◽

2011 ◽

pp. 13-31

Author(s):

Benoit Baccot ◽

Romulus Grigoras ◽

Vincent Charvillat

Keyword(s):

Reinforcement Learning ◽

Ground Truth ◽

Test Site ◽

Online Optimization ◽

Optimal Advertising

Results, showing the power and the efficiency of the two models to solve our problems, are also given. By comparing to a “ground truth” acquired by observing user browsing session on a test site, we conclude that our models are able to determine optimal advertising policies concerning banner formats and delivery.

Download Full-text

An Artificial Intelligence Approach for Online Optimization of Flexible Manufacturing Systems

Applied Mechanics and Materials ◽

10.4028/www.scientific.net/amm.882.96 ◽

2018 ◽

Vol 882 ◽

pp. 96-108 ◽

Cited By ~ 3

Author(s):

Jupiter Bakakeu ◽

Schirin Tolksdorf ◽

Jochen Bauer ◽

Hans-Henning Klos ◽

Jörn Peschke ◽

...

Keyword(s):

Reinforcement Learning ◽

Flexible Manufacturing ◽

Manufacturing Systems ◽

Learning Algorithm ◽

Electricity Consumption ◽

Control Policy ◽

Online Optimization ◽

Energy Prices ◽

Sequential Decision ◽

Time Step

This paper addresses the problem of efficiently operating a flexible manufacturing machine in an electricity micro-grid featuring a high volatility of electricity prices. The problem of finding the optimal control policy is formulated as a sequential decision making problem under uncertainty where, at every time step the uncertainty comes from the lack of knowledge about fu-ture electricity consumption and future weather dependent energy prices. We propose to address this problem using deep reinforcement learning. To this purpose, we designed a deep learning architecture to forecast the load profile of future manufacturing schedule from past production time series. Combined with the forecast of future energy prices, the reinforcement-learning algorithm is trained to perform an online optimization of the production ma-chine in order to reduce the long-term energy costs. The concept is empirical-ly validated on a flexible production machine, where the machine speed can be optimized during the production.

Download Full-text