IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v304y2021ics0306261921009740.html
   My bibliography  Save this article

Generalized reinforcement learning for building control using Behavioral Cloning

Author

Listed:
  • Lee, Zachary E.
  • Zhang, K. Max

Abstract

Advanced building control methods such as model predictive control (MPC) offer significant benefits to both consumers and grid operators, but high computational requirements have acted as barriers to more widespread adoption. Local control computation requires installation of expensive computational hardware, while cloud computing introduces data security and privacy concerns. In this paper, we drastically reduce the local computational requirements of advanced building control through a reinforcement learning (RL)-based approach called Behavioral Cloning, which represents the MPC policy as a neural network that can be locally implemented and quickly computed on a low-cost programmable logic controller. While previous RL and approximate MPC methods must be specifically trained for each building, our key improvement is that the proposed controller can generalize to many buildings, electricity rates, and thermostat setpoint schedules without additional, effort-intensive retraining. To provide this versatility, we have adapted the traditional Behavioral Cloning approach through two innovations: (1) a constraint-informed parameter grouping (CIPG) method that provides a more efficient representation of the training data and (2) a new deep learning model-structure called reverse-time recurrent neural networks (RT-RNN) that allows future information to flow backward in time to more effectively interpret the temporal information in disturbance predictions. The result is an easy-to-deploy, generalized behavioral clone of MPC that can be implemented on a programmable logic controller and requires little building-specific controller tuning, reducing the effort and costs associated with implementing smart residential heat pump control.

Suggested Citation

  • Lee, Zachary E. & Zhang, K. Max, 2021. "Generalized reinforcement learning for building control using Behavioral Cloning," Applied Energy, Elsevier, vol. 304(C).
  • Handle: RePEc:eee:appene:v:304:y:2021:i:c:s0306261921009740
    DOI: 10.1016/j.apenergy.2021.117602
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261921009740
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2021.117602?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Lee, Zachary E. & Zhang, K. Max, 2021. "Scalable identification and control of residential heat pumps: A minimal hardware approach," Applied Energy, Elsevier, vol. 286(C).
    2. Finck, Christian & Li, Rongling & Zeiler, Wim, 2019. "Economic model predictive control for demand flexibility of a residential building," Energy, Elsevier, vol. 176(C), pages 365-379.
    3. Gianluca Serale & Massimo Fiorentini & Alfonso Capozzoli & Daniele Bernardini & Alberto Bemporad, 2018. "Model Predictive Control (MPC) for Enhancing Building and HVAC System Energy Efficiency: Problem Formulation, Applications and Opportunities," Energies, MDPI, vol. 11(3), pages 1-35, March.
    4. Sengupta, Manajit & Xie, Yu & Lopez, Anthony & Habte, Aron & Maclaurin, Galen & Shelby, James, 2018. "The National Solar Radiation Data Base (NSRDB)," Renewable and Sustainable Energy Reviews, Elsevier, vol. 89(C), pages 51-60.
    5. Wang, Zhe & Hong, Tianzhen, 2020. "Reinforcement learning for building controls: The opportunities and challenges," Applied Energy, Elsevier, vol. 269(C).
    6. Yang, Shiyu & Wan, Man Pun & Chen, Wanyu & Ng, Bing Feng & Dubey, Swapnil, 2021. "Experiment study of machine-learning-based approximate model predictive control for energy-efficient building control," Applied Energy, Elsevier, vol. 288(C).
    7. Drgoňa, Ján & Picard, Damien & Kvasnica, Michal & Helsen, Lieve, 2018. "Approximate model predictive building control via machine learning," Applied Energy, Elsevier, vol. 218(C), pages 199-216.
    8. Blum, D.H. & Arendt, K. & Rivalin, L. & Piette, M.A. & Wetter, M. & Veje, C.T., 2019. "Practical factors of envelope model setup and their effects on the performance of model predictive control for building heating, ventilating, and air conditioning systems," Applied Energy, Elsevier, vol. 236(C), pages 410-425.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Lee, Zachary E. & Zhang, K. Max, 2023. "Regulated peer-to-peer energy markets for harnessing decentralized demand flexibility," Applied Energy, Elsevier, vol. 336(C).
    2. Seppo Sierla & Heikki Ihasalo & Valeriy Vyatkin, 2022. "A Review of Reinforcement Learning Applications to Control of Heating, Ventilation and Air Conditioning Systems," Energies, MDPI, vol. 15(10), pages 1-25, May.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Deng, Zhipeng & Wang, Xuezheng & Jiang, Zixin & Zhou, Nianxin & Ge, Haiwang & Dong, Bing, 2023. "Evaluation of deploying data-driven predictive controls in buildings on a large scale for greenhouse gas emission reduction," Energy, Elsevier, vol. 270(C).
    2. Zhan, Sicheng & Chong, Adrian, 2021. "Data requirements and performance evaluation of model predictive control in buildings: A modeling perspective," Renewable and Sustainable Energy Reviews, Elsevier, vol. 142(C).
    3. Dong, Zhe & Huang, Xiaojin & Dong, Yujie & Zhang, Zuoyi, 2020. "Multilayer perception based reinforcement learning supervisory control of energy systems with application to a nuclear steam supply system," Applied Energy, Elsevier, vol. 259(C).
    4. Bünning, Felix & Huber, Benjamin & Schalbetter, Adrian & Aboudonia, Ahmed & Hudoba de Badyn, Mathias & Heer, Philipp & Smith, Roy S. & Lygeros, John, 2022. "Physics-informed linear regression is competitive with two Machine Learning methods in residential building MPC," Applied Energy, Elsevier, vol. 310(C).
    5. Yang, Shiyu & Wan, Man Pun & Ng, Bing Feng & Dubey, Swapnil & Henze, Gregor P. & Chen, Wanyu & Baskaran, Krishnamoorthy, 2020. "Experimental study of model predictive control for an air-conditioning system with dedicated outdoor air system," Applied Energy, Elsevier, vol. 257(C).
    6. Amadeh, Ali & Lee, Zachary E. & Zhang, K. Max, 2022. "Quantifying demand flexibility of building energy systems under uncertainty," Energy, Elsevier, vol. 246(C).
    7. Pinto, Giuseppe & Piscitelli, Marco Savino & Vázquez-Canteli, José Ramón & Nagy, Zoltán & Capozzoli, Alfonso, 2021. "Coordinated energy management for a cluster of buildings through deep reinforcement learning," Energy, Elsevier, vol. 229(C).
    8. Dong, Zihang & Zhang, Xi & Li, Yijun & Strbac, Goran, 2023. "Values of coordinated residential space heating in demand response provision," Applied Energy, Elsevier, vol. 330(PB).
    9. Zhe Tian & Chuang Ye & Jie Zhu & Jide Niu & Yakai Lu, 2023. "Accelerating Optimal Control Strategy Generation for HVAC Systems Using a Scenario Reduction Method: A Case Study," Energies, MDPI, vol. 16(7), pages 1-20, March.
    10. Yang, Shiyu & Wan, Man Pun & Chen, Wanyu & Ng, Bing Feng & Dubey, Swapnil, 2021. "Experiment study of machine-learning-based approximate model predictive control for energy-efficient building control," Applied Energy, Elsevier, vol. 288(C).
    11. Pinto, Giuseppe & Deltetto, Davide & Capozzoli, Alfonso, 2021. "Data-driven district energy management with surrogate models and deep reinforcement learning," Applied Energy, Elsevier, vol. 304(C).
    12. Dongsu Kim & Jongman Lee & Sunglok Do & Pedro J. Mago & Kwang Ho Lee & Heejin Cho, 2022. "Energy Modeling and Model Predictive Control for HVAC in Buildings: A Review of Current Research Trends," Energies, MDPI, vol. 15(19), pages 1-30, October.
    13. Pinto, Giuseppe & Kathirgamanathan, Anjukan & Mangina, Eleni & Finn, Donal P. & Capozzoli, Alfonso, 2022. "Enhancing energy management in grid-interactive buildings: A comparison among cooperative and coordinated architectures," Applied Energy, Elsevier, vol. 310(C).
    14. Lee, Zachary E. & Zhang, K. Max, 2021. "Scalable identification and control of residential heat pumps: A minimal hardware approach," Applied Energy, Elsevier, vol. 286(C).
    15. Yang, Shiyu & Wan, Man Pun, 2022. "Machine-learning-based model predictive control with instantaneous linearization – A case study on an air-conditioning and mechanical ventilation system," Applied Energy, Elsevier, vol. 306(PB).
    16. Huang, Sen & Lin, Yashen & Chinde, Venkatesh & Ma, Xu & Lian, Jianming, 2021. "Simulation-based performance evaluation of model predictive control for building energy systems," Applied Energy, Elsevier, vol. 281(C).
    17. Clara Ceccolini & Roozbeh Sangi, 2022. "Benchmarking Approaches for Assessing the Performance of Building Control Strategies: A Review," Energies, MDPI, vol. 15(4), pages 1-30, February.
    18. Bampoulas, Adamantios & Pallonetto, Fabiano & Mangina, Eleni & Finn, Donal P., 2022. "An ensemble learning-based framework for assessing the energy flexibility of residential buildings with multicomponent energy systems," Applied Energy, Elsevier, vol. 315(C).
    19. Lee, Zachary E. & Zhang, K. Max, 2023. "Regulated peer-to-peer energy markets for harnessing decentralized demand flexibility," Applied Energy, Elsevier, vol. 336(C).
    20. Savadkoohi, Marjan & Macarulla, Marcel & Casals, Miquel, 2023. "Facilitating the implementation of neural network-based predictive control to optimize building heating operation," Energy, Elsevier, vol. 263(PB).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:304:y:2021:i:c:s0306261921009740. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.