Evaluation of Reinforcement Learning for Optimal Control of Building Active and Passive Thermal Storage Inventory

Simeng Liu; Gregor P. Henze

doi:10.1115/1.2710491

Evaluation of Reinforcement Learning for Optimal Control of Building Active and Passive Thermal Storage Inventory

Journal of Solar Energy Engineering ◽

10.1115/1.2710491 ◽

2006 ◽

Vol 129 (2) ◽

pp. 215-225 ◽

Cited By ~ 24

Author(s):

Simeng Liu ◽

Gregor P. Henze

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Storage Capacity ◽

Control Strategies ◽

Cost Savings ◽

Thermal Storage ◽

Learning Control ◽

Learning Performance ◽

Optimal Control Strategy ◽

Model Free

This paper describes an investigation of machine learning for supervisory control of active and passive thermal storage capacity in buildings. Previous studies show that the utilization of active or passive thermal storage, or both, can yield significant peak cooling load reduction and associated electrical demand and operational cost savings. In this study, a model-free learning control is investigated for the operation of electrically driven chilled water systems in heavy-mass commercial buildings. The reinforcement learning controller learns to operate the building and cooling plant based on the reinforcement feedback (monetary cost of each action, in this study) it receives for past control actions. The learning agent interacts with its environment by commanding the global zone temperature setpoints and thermal energy storage charging∕discharging rate. The controller extracts information about the environment based solely on the reinforcement signal; the controller does not contain a predictive or system model. Over time and by exploring the environment, the reinforcement learning controller establishes a statistical summary of plant operation, which is continuously updated as operation continues. The present analysis shows that learning control is a feasible methodology to find a near-optimal control strategy for exploiting the active and passive building thermal storage capacity, and also shows that the learning performance is affected by the dimensionality of the action and state space, the learning rate and several other factors. It is found that it takes a long time to learn control strategies for tasks associated with large state and action spaces.

Download Full-text

Evaluation of Reinforcement Learning for Optimal Control of Building Active and Passive Thermal Storage Inventory

Solar Energy ◽

10.1115/isec2005-76085 ◽

2005 ◽

Cited By ~ 1

Author(s):

Simeng Liu ◽

Gregor P. Henze

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Cost Savings ◽

Thermal Storage ◽

Learning Control ◽

Learning Performance ◽

Optimal Control Strategy ◽

Model Free ◽

Learning Agent ◽

Electrical Demand

This paper describes an investigation of machine-learning control for the supervisory control of building active and passive thermal storage inventory. Previous studies show that the utilization of either active or passive, or both can yield significant peak cooling load reduction and associated electrical demand and operational cost savings. In this study, a model-free learning control is investigated for the operation of electrically driven chilled water systems in heavy-mass commercial buildings. The reinforcement learning controller learns to operate the building and cooling plant optimally based on the feedback it receives from past control actions. The learning agent interacts with its environment by commanding the global zone temperature setpoints and TES charging/discharging rate. The controller extracts cues about the environment solely based on the reinforcement feedback it receives, which in this study is the monetary cost of each control action. No prediction or system model is required. Over time and by exploring the environment, the reinforcement learning controller establishes a statistical summary of plant operation, which is continuously updated as operation continues. This presented analysis revealed that learning control is a feasible methodology to find a near-optimal control strategy for exploiting the active and passive building thermal storage capacity, and also shows that the learning performance is affected by the dimensionality of the action and state space, the learning rate and several other factors. Moreover learning speed proved to be relatively low when dealing with tasks associated with large state and action spaces.

Download Full-text

Model-Free Reinforcement Learning for Branching Markov Decision Processes

Computer Aided Verification - Lecture Notes in Computer Science ◽

10.1007/978-3-030-81688-9_30 ◽

2021 ◽

pp. 651-673

Author(s):

Ernst Moritz Hahn ◽

Mateo Perez ◽

Sven Schewe ◽

Fabio Somenzi ◽

Ashutosh Trivedi ◽

...

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Markov Decision Processes ◽

Control Strategy ◽

Natural Extension ◽

Decision Processes ◽

Optimal Control Strategy ◽

Model Free ◽

Learning Techniques ◽

Markov Decision

AbstractWe study reinforcement learning for the optimal control of Branching Markov Decision Processes (BMDPs), a natural extension of (multitype) Branching Markov Chains (BMCs). The state of a (discrete-time) BMCs is a collection of entities of various types that, while spawning other entities, generate a payoff. In comparison with BMCs, where the evolution of a each entity of the same type follows the same probabilistic pattern, BMDPs allow an external controller to pick from a range of options. This permits us to study the best/worst behaviour of the system. We generalise model-free reinforcement learning techniques to compute an optimal control strategy of an unknown BMDP in the limit. We present results of an implementation that demonstrate the practicality of the approach.

Download Full-text

Optimal Control Strategy for Ice-Storage System Based on Hourly Dynamic Load Simulation

Advanced Materials Research ◽

10.4028/www.scientific.net/amr.671-674.2515 ◽

2013 ◽

Vol 671-674 ◽

pp. 2515-2519

Author(s):

Xue Mei Wang ◽

Zhen Hai Wang ◽

Xing Long Wu

Keyword(s):

Optimal Control ◽

Air Conditioning ◽

Control Strategies ◽

Storage System ◽

Simulation Software ◽

Ice Storage ◽

Optimal Control Strategy ◽

Air Conditioning System ◽

Energy Plus ◽

Load Simulation

This project aims to study the optimal control model of the ice-storage system which is theoretically close to the optimal control and also applicable to actual engineering. Using Energy Plus, the energy consumption simulation software, and the simple solution method of optimal control, researchers can analyze and compare the annual operation costs of the ice-storage air-conditioning system of a project in Beijing under different control strategies. Researchers obtained the power rates of the air-conditioning system in the office building under the conditions of chiller-priority and optimal contro1 throughout the cooling season. Through analysis and comparison, they find that after the implementation of optimal control, the annually saved power bills mainly result from non-design conditions, especially in the transitional seasons.

Download Full-text

Optimal Control Strategy Design Based on Dynamic Programming for a Dual-Motor Coupling-Propulsion System

The Scientific World JOURNAL ◽

10.1155/2014/958239 ◽

2014 ◽

Vol 2014 ◽

pp. 1-9 ◽

Cited By ~ 9

Author(s):

Shuo Zhang ◽

Chengning Zhang ◽

Guangwei Han ◽

Qinghui Wang

Keyword(s):

Optimal Control ◽

Dynamic Programming ◽

Energy Loss ◽

Control Strategy ◽

Control Strategies ◽

Propulsion System ◽

Dynamic Features ◽

Optimal Control Strategy ◽

Control Rules ◽

Electric Bus

A dual-motor coupling-propulsion electric bus (DMCPEB) is modeled, and its optimal control strategy is studied in this paper. The necessary dynamic features of energy loss for subsystems is modeled. Dynamic programming (DP) technique is applied to find the optimal control strategy including upshift threshold, downshift threshold, and power split ratio between the main motor and auxiliary motor. Improved control rules are extracted from the DP-based control solution, forming near-optimal control strategies. Simulation results demonstrate that a significant improvement in reducing energy loss due to the dual-motor coupling-propulsion system (DMCPS) running is realized without increasing the frequency of the mode switch.

Download Full-text

Designing Optimal Arbitrage Policies for Distributed Energy Systems in Building Clusters Using Reinforcement Learning

Volume 2A: 45th Design Automation Conference ◽

10.1115/detc2019-97190 ◽

2019 ◽

Author(s):

Philip Odonkor ◽

Kemper Lewis

Keyword(s):

Renewable Energy ◽

Reinforcement Learning ◽

Electricity Markets ◽

Energy Cost ◽

Cost Minimization ◽

Control Strategies ◽

Cost Savings ◽

Price Volatility ◽

Maximization Problem ◽

Distributed Energy

Abstract In the wake of increasing proliferation of renewable energy and distributed energy resources (DERs), grid designers and operators alike are faced with several emerging challenges in curbing allocative grid inefficiencies and maintaining operational stability. One such challenge relates to the increased price volatility within real-time electricity markets, a result of the inherent intermittency of renewable energy. With this challenge, however, comes heightened economic interest in exploiting the arbitrage potential of price volatility towards demand-side energy cost savings. To this end, this paper aims to maximize the arbitrage value of electricity through the optimal design of control strategies for DERs. Formulated as an arbitrage maximization problem using design optimization, and solved using reinforcement learning, the proposed approach is applied towards shared DERs within multi-building residential clusters. We demonstrate its feasibility across three unique building cluster demand profiles, observing notable energy cost reductions over baseline values. This highlights a capability for generalized learning across multiple building clusters and the ability to design efficient arbitrage policies towards energy cost minimization. Finally, the approach is shown to be computationally tractable, designing efficient strategies in approximately 5 hours of training over a simulation time horizon of 1 month.

Download Full-text

Optimal Control Strategies for Wheeled Mobile Robot With Bounded Inputs

ASME 2010 10th Biennial Conference on Engineering Systems Design and Analysis, Volume 5 ◽

10.1115/esda2010-24076 ◽

2010 ◽

Author(s):

Ilan Zohar ◽

Amit Ailon

Keyword(s):

Optimal Control ◽

Mobile Robot ◽

Control Strategies ◽

Equations Of Motion ◽

Wheeled Mobile Robots ◽

Optimal Control Strategy ◽

Input Signals ◽

Control Objective ◽

Quadratic Index ◽

Algebraic Constraints

This paper presents a simple approach for solving optimal control problems in wheeled mobile robots with bounded inputs. The control objective is to minimize a quadratic index of performance subject to differential constraints (the mobile robot equations of motion). The solution to the problem is obtained by utilizing an explicit trajectory parametrization method, which allows us to establish a sub-optimal control strategy by minimizing a multivariable function subject to a set of algebraic constraints. The approach is based on the flatness property, which allows us to represent the flat output by a polynomial. The bounds on the input signals are taken into consideration in the current analysis.

Download Full-text

Model-free control of Lorenz chaos using an approximate optimal control strategy

Communications in Nonlinear Science and Numerical Simulation ◽

10.1016/j.cnsns.2012.05.024 ◽

2012 ◽

Vol 17 (12) ◽

pp. 4891-4900 ◽

Cited By ~ 21

Author(s):

Shuai Li ◽

Yangming Li ◽

Bu Liu ◽

Timmy Murray

Keyword(s):

Optimal Control ◽

Control Strategy ◽

Optimal Control Strategy ◽

Model Free ◽

Model Free Control

Download Full-text

Modeling the Impact of Optimal Control Strategies on the Dynamics of Zika Virus Disease Using the Sterile Insect Technology

Journal of Advances in Mathematics and Computer Science ◽

10.9734/jamcs/2020/v35i830310 ◽

2020 ◽

pp. 13-33

Author(s):

Atokolo William ◽

Akpa Johnson ◽

Daniel Musa Alih ◽

Olayemi Kehinde Samuel ◽

C. E. Mbah Godwin

Keyword(s):

Optimal Control ◽

Control Strategy ◽

Zika Virus ◽

Human Population ◽

Necessary Conditions ◽

Virus Disease ◽

Control Strategies ◽

Control Measures ◽

Optimal Control Strategy ◽

The Impact

This work is aimed at formulating a mathematical model for the control of zika virus infection using Sterile Insect Technology (SIT). The model is extended to incorporate optimal control strategy by introducing three control measures. The optimal control is aimed at minimizing the number of Exposed human, Infected human and the total number of Mosquitoes in a population and as such reducing contacts between mosquitoes and human, human to human and above all, eliminates the population of Mosquitoes. The Pontryagin’s maximum principle was used to obtain the necessary conditions, find the optimality system of our model and to obtain solution to the control problem. Numerical simulations result shows that; reduction in the number of Exposed human population, Infected human population and reduction in the entire population of Mosquito population is best achieved using the optimal control strategy.

Download Full-text

Optimal control strategy for COVID-19 concerning both life and economy based on deep reinforcement learning

Chinese Physics B ◽

10.1088/1674-1056/ac3229 ◽

2021 ◽

Author(s):

Wei Deng ◽

Guoyuan Qi ◽

Xinchen Yu

Keyword(s):

Optimal Control ◽

Reinforcement Learning ◽

Control Strategy ◽

Optimal Control Strategy

Download Full-text

Study of Two Control Strategies Based in Fuzzy Logic and Artificial Neural Network Compared with an Optimal Control Strategy Applied to a Buck Converter

NAFIPS 2007 - 2007 Annual Meeting of the North American Fuzzy Information Processing Society ◽

10.1109/nafips.2007.383857 ◽

2007 ◽

Cited By ~ 9

Author(s):

N. L. Diaz ◽

J. J. Soriano

Keyword(s):

Neural Network ◽

Optimal Control ◽

Artificial Neural Network ◽

Fuzzy Logic ◽

Control Strategy ◽

Control Strategies ◽

Buck Converter ◽

Optimal Control Strategy ◽

Artificial Neural

Download Full-text