Generalized reinforcement learning for building control using Behavioral Cloning

My bibliography Save this article

Generalized reinforcement learning for building control using Behavioral Cloning

Author

Listed:

Lee, Zachary E.
Zhang, K. Max

Registered:

Abstract

Advanced building control methods such as model predictive control (MPC) offer significant benefits to both consumers and grid operators, but high computational requirements have acted as barriers to more widespread adoption. Local control computation requires installation of expensive computational hardware, while cloud computing introduces data security and privacy concerns. In this paper, we drastically reduce the local computational requirements of advanced building control through a reinforcement learning (RL)-based approach called Behavioral Cloning, which represents the MPC policy as a neural network that can be locally implemented and quickly computed on a low-cost programmable logic controller. While previous RL and approximate MPC methods must be specifically trained for each building, our key improvement is that the proposed controller can generalize to many buildings, electricity rates, and thermostat setpoint schedules without additional, effort-intensive retraining. To provide this versatility, we have adapted the traditional Behavioral Cloning approach through two innovations: (1) a constraint-informed parameter grouping (CIPG) method that provides a more efficient representation of the training data and (2) a new deep learning model-structure called reverse-time recurrent neural networks (RT-RNN) that allows future information to flow backward in time to more effectively interpret the temporal information in disturbance predictions. The result is an easy-to-deploy, generalized behavioral clone of MPC that can be implemented on a programmable logic controller and requires little building-specific controller tuning, reducing the effort and costs associated with implementing smart residential heat pump control.

Suggested Citation

Lee, Zachary E. & Zhang, K. Max, 2021. "Generalized reinforcement learning for building control using Behavioral Cloning," Applied Energy, Elsevier, vol. 304(C).

Handle: RePEc:eee:appene:v:304:y:2021:i:c:s0306261921009740
DOI: 10.1016/j.apenergy.2021.117602

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Blum, D.H. & Arendt, K. & Rivalin, L. & Piette, M.A. & Wetter, M. & Veje, C.T., 2019. "Practical factors of envelope model setup and their effects on the performance of model predictive control for building heating, ventilating, and air conditioning systems," Applied Energy, Elsevier, vol. 236(C), pages 410-425.
Gianluca Serale & Massimo Fiorentini & Alfonso Capozzoli & Daniele Bernardini & Alberto Bemporad, 2018. "Model Predictive Control (MPC) for Enhancing Building and HVAC System Energy Efficiency: Problem Formulation, Applications and Opportunities," Energies, MDPI, vol. 11(3), pages 1-35, March.
Sengupta, Manajit & Xie, Yu & Lopez, Anthony & Habte, Aron & Maclaurin, Galen & Shelby, James, 2018. "The National Solar Radiation Data Base (NSRDB)," Renewable and Sustainable Energy Reviews, Elsevier, vol. 89(C), pages 51-60.
Wang, Zhe & Hong, Tianzhen, 2020. "Reinforcement learning for building controls: The opportunities and challenges," Applied Energy, Elsevier, vol. 269(C).
Yang, Shiyu & Wan, Man Pun & Chen, Wanyu & Ng, Bing Feng & Dubey, Swapnil, 2021. "Experiment study of machine-learning-based approximate model predictive control for energy-efficient building control," Applied Energy, Elsevier, vol. 288(C).
Drgoňa, Ján & Picard, Damien & Kvasnica, Michal & Helsen, Lieve, 2018. "Approximate model predictive building control via machine learning," Applied Energy, Elsevier, vol. 218(C), pages 199-216.
Lee, Zachary E. & Zhang, K. Max, 2021. "Scalable identification and control of residential heat pumps: A minimal hardware approach," Applied Energy, Elsevier, vol. 286(C).
Finck, Christian & Li, Rongling & Zeiler, Wim, 2019. "Economic model predictive control for demand flexibility of a residential building," Energy, Elsevier, vol. 176(C), pages 365-379.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Montazeri, Mina & Remlinger, Carl & Bejar Haro, Benjamin & Heer, Philipp, 2025. "Fully data-driven and modular building thermal control with physically consistent modeling," Applied Energy, Elsevier, vol. 390(C).
Seppo Sierla & Heikki Ihasalo & Valeriy Vyatkin, 2022. "A Review of Reinforcement Learning Applications to Control of Heating, Ventilation and Air Conditioning Systems," Energies, MDPI, vol. 15(10), pages 1-25, May.
Liu, Yuntao & Song, Yutong & Cui, Can, 2025. "Towards smart control and energy efficiency for multi-zone ventilation systems via an imitation-interaction learning method in energy-aware buildings," Energy, Elsevier, vol. 314(C).
Lee, Zachary E. & Zhang, K. Max, 2023. "Regulated peer-to-peer energy markets for harnessing decentralized demand flexibility," Applied Energy, Elsevier, vol. 336(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Deng, Zhipeng & Wang, Xuezheng & Jiang, Zixin & Zhou, Nianxin & Ge, Haiwang & Dong, Bing, 2023. "Evaluation of deploying data-driven predictive controls in buildings on a large scale for greenhouse gas emission reduction," Energy, Elsevier, vol. 270(C).
Jiang, Fuyang & Kazmi, Hussain, 2025. "What-if: A causal machine learning approach to control-oriented modelling for building thermal dynamics," Applied Energy, Elsevier, vol. 377(PC).
Zhan, Sicheng & Chong, Adrian, 2021. "Data requirements and performance evaluation of model predictive control in buildings: A modeling perspective," Renewable and Sustainable Energy Reviews, Elsevier, vol. 142(C).
Dong, Zhe & Huang, Xiaojin & Dong, Yujie & Zhang, Zuoyi, 2020. "Multilayer perception based reinforcement learning supervisory control of energy systems with application to a nuclear steam supply system," Applied Energy, Elsevier, vol. 259(C).
Bünning, Felix & Huber, Benjamin & Schalbetter, Adrian & Aboudonia, Ahmed & Hudoba de Badyn, Mathias & Heer, Philipp & Smith, Roy S. & Lygeros, John, 2022. "Physics-informed linear regression is competitive with two Machine Learning methods in residential building MPC," Applied Energy, Elsevier, vol. 310(C).
Pergantis, Elias N. & Priyadarshan, & Theeb, Nadah Al & Dhillon, Parveen & Ore, Jonathan P. & Ziviani, Davide & Groll, Eckhard A. & Kircher, Kevin J., 2024. "Field demonstration of predictive heating control for an all-electric house in a cold climate," Applied Energy, Elsevier, vol. 360(C).
Yang, Shiyu & Wan, Man Pun & Ng, Bing Feng & Dubey, Swapnil & Henze, Gregor P. & Chen, Wanyu & Baskaran, Krishnamoorthy, 2020. "Experimental study of model predictive control for an air-conditioning system with dedicated outdoor air system," Applied Energy, Elsevier, vol. 257(C).
Amadeh, Ali & Lee, Zachary E. & Zhang, K. Max, 2022. "Quantifying demand flexibility of building energy systems under uncertainty," Energy, Elsevier, vol. 246(C).
Pinto, Giuseppe & Piscitelli, Marco Savino & Vázquez-Canteli, José Ramón & Nagy, Zoltán & Capozzoli, Alfonso, 2021. "Coordinated energy management for a cluster of buildings through deep reinforcement learning," Energy, Elsevier, vol. 229(C).
Dong, Zihang & Zhang, Xi & Li, Yijun & Strbac, Goran, 2023. "Values of coordinated residential space heating in demand response provision," Applied Energy, Elsevier, vol. 330(PB).
Zhe Tian & Chuang Ye & Jie Zhu & Jide Niu & Yakai Lu, 2023. "Accelerating Optimal Control Strategy Generation for HVAC Systems Using a Scenario Reduction Method: A Case Study," Energies, MDPI, vol. 16(7), pages 1-20, March.
Yang, Shiyu & Wan, Man Pun & Chen, Wanyu & Ng, Bing Feng & Dubey, Swapnil, 2021. "Experiment study of machine-learning-based approximate model predictive control for energy-efficient building control," Applied Energy, Elsevier, vol. 288(C).
Pinto, Giuseppe & Deltetto, Davide & Capozzoli, Alfonso, 2021. "Data-driven district energy management with surrogate models and deep reinforcement learning," Applied Energy, Elsevier, vol. 304(C).
Dongsu Kim & Jongman Lee & Sunglok Do & Pedro J. Mago & Kwang Ho Lee & Heejin Cho, 2022. "Energy Modeling and Model Predictive Control for HVAC in Buildings: A Review of Current Research Trends," Energies, MDPI, vol. 15(19), pages 1-30, October.
Pinto, Giuseppe & Kathirgamanathan, Anjukan & Mangina, Eleni & Finn, Donal P. & Capozzoli, Alfonso, 2022. "Enhancing energy management in grid-interactive buildings: A comparison among cooperative and coordinated architectures," Applied Energy, Elsevier, vol. 310(C).
Wang, Xuezheng & Dong, Bing, 2024. "Long-term experimental evaluation and comparison of advanced controls for HVAC systems," Applied Energy, Elsevier, vol. 371(C).
Lee, Zachary E. & Zhang, K. Max, 2021. "Scalable identification and control of residential heat pumps: A minimal hardware approach," Applied Energy, Elsevier, vol. 286(C).
Yang, Shiyu & Wan, Man Pun, 2022. "Machine-learning-based model predictive control with instantaneous linearization – A case study on an air-conditioning and mechanical ventilation system," Applied Energy, Elsevier, vol. 306(PB).
Huang, Sen & Lin, Yashen & Chinde, Venkatesh & Ma, Xu & Lian, Jianming, 2021. "Simulation-based performance evaluation of model predictive control for building energy systems," Applied Energy, Elsevier, vol. 281(C).
Clara Ceccolini & Roozbeh Sangi, 2022. "Benchmarking Approaches for Assessing the Performance of Building Control Strategies: A Review," Energies, MDPI, vol. 15(4), pages 1-30, February.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:304:y:2021:i:c:s0306261921009740. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Generalized reinforcement learning for building control using Behavioral Cloning

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data