Resilient Computing with Reinforcement Learning on a Dynamical System: Case Study in Sorting

<p>In medicinal chemistry programs it is key to design and make compounds that are efficacious and safe. This is a long, complex and difficult multi-parameter optimization process, often including several properties with orthogonal trends. New methods for the automated design of compounds against profiles of multiple properties are thus of great value. Here we present a fragment-based reinforcement learning approach based on an actor-critic model, for the generation of novel molecules with optimal properties. The actor and the critic are both modelled with bidirectional long short-term memory (LSTM) networks. The AI method learns how to generate new compounds with desired properties by starting from an initial set of lead molecules and then improve these by replacing some of their fragments. A balanced binary tree based on the similarity of fragments is used in the generative process to bias the output towards structurally similar molecules. The method is demonstrated by a case study showing that 93% of the generated molecules are chemically valid, and a third satisfy the targeted objectives, while there were none in the initial set.</p>

Download Full-text

Enhancing Energy Trading Between Different Islanded Microgrids A Reinforcement Learning Algorithm Case Study in Northern Kordofan State

2020 International Conference on Computer, Control, Electrical, and Electronics Engineering (ICCCEEE) ◽

10.1109/iccceee49695.2021.9429584 ◽

2021 ◽

Author(s):

Moayad ELamin ◽

Fay Elhassan ◽

Mahmoud A. Manzoul

Keyword(s):

Reinforcement Learning ◽

Learning Algorithm ◽

Energy Trading ◽

Reinforcement Learning Algorithm

Download Full-text

Dynamic Data-Driven Spatiotemporal System Behavior Prediction With Simulations and Sensor Measurement Data

Volume 2B: 44th Design Automation Conference ◽

10.1115/detc2018-86043 ◽

2018 ◽

Author(s):

Xiangxue Zhao ◽

Shapour Azarm ◽

Balakumar Balachandran

Keyword(s):

Dynamical System ◽

Real Time ◽

Measurement Data ◽

Data Driven ◽

Sensor Data ◽

Model Parameters ◽

High Fidelity ◽

System Behavior ◽

Sensor Measurement

Online prediction of dynamical system behavior based on a combination of simulation data and sensor measurement data has numerous applications. Examples include predicting safe flight configurations, forecasting storms and wildfire spread, estimating railway track and pipeline health conditions. In such applications, high-fidelity simulations may be used to accurately predict a system’s dynamical behavior offline (“non-real time”). However, due to the computational expense, these simulations have limited usage for online (“real-time”) prediction of a system’s behavior. To remedy this, one possible approach is to allocate a significant portion of the computational effort to obtain data through offline simulations. The obtained offline data can then be combined with online sensor measurements for online estimation of the system’s behavior with comparable accuracy as the off-line, high-fidelity simulation. The main contribution of this paper is in the construction of a fast data-driven spatiotemporal prediction framework that can be used to estimate general parametric dynamical system behavior. This is achieved through three steps. First, high-order singular value decomposition is applied to map high-dimensional offline simulation datasets into a subspace. Second, Gaussian processes are constructed to approximate model parameters in the subspace. Finally, reduced-order particle filtering is used to assimilate sparsely located sensor data to further improve the prediction. The effectiveness of the proposed approach is demonstrated through a case study. In this case study, aeroelastic response data obtained for an aircraft through simulations is integrated with measurement data obtained from a few sparsely located sensors. Through this case study, the authors show that along with dynamic enhancement of the state estimates, one can also realize a reduction in uncertainty of the estimates.

Download Full-text

A Case Study in Hybrid Multi-threading and Hierarchical Reinforcement Learning Approach for Cooperative Multi-agent Systems

2015 Fourteenth Mexican International Conference on Artificial Intelligence (MICAI) ◽

10.1109/micai.2015.20 ◽

2015 ◽

Author(s):

Hiram Ponce ◽

Ricardo Padilla ◽

Alan Davalos ◽

Alvaro Herrasti ◽

Cynthia Pichardo ◽

...

Keyword(s):

Reinforcement Learning ◽

Learning Approach ◽

Multi Agent Systems ◽

Agent Systems ◽

Hierarchical Reinforcement Learning ◽

Multi Agent

Download Full-text

Adaptive Multi-objective Reinforcement Learning for Pareto Frontier Approximation: A Case Study of Resource Allocation Network in Massive MIMO

10.23919/eusipco54536.2021.9615934 ◽

2021 ◽

Author(s):

Ruiqing Chen ◽

Fanglei Sun ◽

Liang Chen ◽

Kai Li ◽

Liantao Wu ◽

...

Keyword(s):

Resource Allocation ◽

Reinforcement Learning ◽

Massive Mimo ◽

Pareto Frontier ◽

Multi Objective

Download Full-text

A Reinforcement Learning - Great-Deluge Hyper-Heuristic for Examination Timetabling

International Journal of Applied Metaheuristic Computing ◽

10.4018/jamc.2010102603 ◽

2010 ◽

Vol 1 (1) ◽

pp. 39-59 ◽

Cited By ~ 70

Author(s):

Ender Özcan ◽

Mustafa Misir ◽

Gabriela Ochoa ◽

Edmund K. Burke

Keyword(s):

Reinforcement Learning ◽

Complete Solution ◽

Examination Timetabling ◽

Low Level ◽

Termination Criteria ◽

Candidate Solution ◽

Wide Range ◽

Finite Set ◽

Different Characteristics

Hyper-heuristics can be identified as methodologies that search the space generated by a finite set of low level heuristics for solving search problems. An iterative hyper-heuristic framework can be thought of as requiring a single candidate solution and multiple perturbation low level heuristics. An initially generated complete solution goes through two successive processes (heuristic selection and move acceptance) until a set of termination criteria is satisfied. A motivating goal of hyper-heuristic research is to create automated techniques that are applicable to a wide range of problems with different characteristics. Some previous studies show that different combinations of heuristic selection and move acceptance as hyper-heuristic components might yield different performances. This study investigates whether learning heuristic selection can improve the performance of a great deluge based hyper-heuristic using an examination timetabling problem as a case study.

Download Full-text

A Robust Reinforcement Learning System Using Concept of Sliding Mode Control for Unknown Nonlinear Dynamical System

Robust Control, Theory and Applications ◽

10.5772/14378 ◽

2011 ◽

Cited By ~ 1

Author(s):

Masanao Obayashi ◽

Norihiro Nakahara ◽

Katsumi Yamada ◽

Takashi Kuremoto ◽

Kunikazu Kobayashi ◽

...

Keyword(s):

Dynamical System ◽

Reinforcement Learning ◽

Sliding Mode Control ◽

Sliding Mode ◽

Nonlinear Dynamical System ◽

Learning System ◽

Nonlinear Dynamical ◽

Mode Control

Download Full-text

Accelerate Personalized IoT Service Provision by Cloud-Aided Edge Reinforcement Learning: A Case Study on Smart Lighting

Service-Oriented Computing - Lecture Notes in Computer Science ◽

10.1007/978-3-030-65310-1_6 ◽

2020 ◽

pp. 69-84

Author(s):

Jun Na ◽

Handuo Zhang ◽

Xin Deng ◽

Bin Zhang ◽

Ziyi Ye

Keyword(s):

Reinforcement Learning ◽

Service Provision ◽

Smart Lighting

Download Full-text

Multi-Context Generation in Virtual Reality Environments Using Deep Reinforcement Learning

Volume 9: 40th Computers and Information in Engineering Conference (CIE) ◽

10.1115/detc2020-22624 ◽

2020 ◽

Author(s):

James Cunningham ◽

Christian Lopez ◽

Omar Ashour ◽

Conrad S. Tucker

Keyword(s):

Virtual Reality ◽

Reinforcement Learning ◽

Virtual Environments ◽

Probability Distributions ◽

Automatic Generation ◽

Grocery Store ◽

Training Data ◽

Learning Approaches ◽

Common Concept

Abstract In this work, a Deep Reinforcement Learning (RL) approach is proposed for Procedural Content Generation (PCG) that seeks to automate the generation of multiple related virtual reality (VR) environments for enhanced personalized learning. This allows for the user to be exposed to multiple virtual scenarios that demonstrate a consistent theme, which is especially valuable in an educational context. RL approaches to PCG offer the advantage of not requiring training data, as opposed to other PCG approaches that employ supervised learning approaches. This work advances the state of the art in RL-based PCG by demonstrating the ability to generate a diversity of contexts in order to teach the same underlying concept. A case study is presented that demonstrates the feasibility of the proposed RL-based PCG method using examples of probability distributions in both manufacturing facility and grocery store virtual environments. The method demonstrated in this paper has the potential to enable the automatic generation of a variety of virtual environments that are connected by a common concept or theme.

Download Full-text