IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v391y2025ics0306261925005811.html

Coupling time-scale reinforcement learning methods for building operational optimization with waste heat

Author

Listed:
  • Chen, Zhe
  • Xing, Tian
  • Wang, Yu
  • Zhuang, Yunlin
  • Zheng, Meng
  • Zhao, Qianchuan
  • Jia, Qing-Shan

Abstract

This paper focuses on the joint optimization of fan coil units (FCUs) and heat pumps in HVAC systems for multi-zone building environments. The core problem involves balancing fast local control of FCUs with slower global control of heat pumps to ensure energy efficiency and indoor comfort. To tackle this, we propose a coupling time-scale reinforcement learning (RL) algorithm, specifically a Deep Q-Network (DQN)-based approach that employs a multi-task learning network to manage both FCU and heat pump control efficiently through the utilization of shared state information. Moreover, the agents are trained and make decisions collaboratively across different timescales. In addition, we develop a high-fidelity building simulation that incorporates detailed thermal models, dynamic loading, and waste heat modules to evaluate the performance of the system in real-world conditions. The experimental results show that the proposed coupling time-scale DQN algorithm improves the accuracy of temperature control by 35.4 % and 26.21 % compared to traditional DQN and the occupant-centric control (OCC) algorithms. Additionally, it reduces regional power fluctuations by 25.18 % and 56.74 % relative to these traditional algorithms. Simultaneously, the proposed algorithm achieves the lowest heat pump energy consumption (2964 W), outperforming traditional DQN (2977 W) and OCC (3051 W) respectively, while maintaining superior temperature control accuracy. These quantitative improvements collectively demonstrate the proposed algorithm’s capability to synergistically balance thermal comfort, power fluctuation, and energy consumption.

Suggested Citation

  • Chen, Zhe & Xing, Tian & Wang, Yu & Zhuang, Yunlin & Zheng, Meng & Zhao, Qianchuan & Jia, Qing-Shan, 2025. "Coupling time-scale reinforcement learning methods for building operational optimization with waste heat," Applied Energy, Elsevier, vol. 391(C).
  • Handle: RePEc:eee:appene:v:391:y:2025:i:c:s0306261925005811
    DOI: 10.1016/j.apenergy.2025.125851
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261925005811
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2025.125851?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Lei, Yue & Zhan, Sicheng & Ono, Eikichi & Peng, Yuzhen & Zhang, Zhiang & Hasama, Takamasa & Chong, Adrian, 2022. "A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings," Applied Energy, Elsevier, vol. 324(C).
    2. Christina Turley & Margarite Jacoby & Gregory Pavlak & Gregor Henze, 2020. "Development and Evaluation of Occupancy-Aware HVAC Control for Residential Building Energy Efficiency and Occupant Comfort," Energies, MDPI, vol. 13(20), pages 1-30, October.
    3. Homod, Raad Z. & Mohammed, Hayder Ibrahim & Abderrahmane, Aissa & Alawi, Omer A. & Khalaf, Osamah Ibrahim & Mahdi, Jasim M. & Guedri, Kamel & Dhaidan, Nabeel S. & Albahri, A.S. & Sadeq, Abdellatif M. , 2023. "Deep clustering of Lagrangian trajectory for multi-task learning to energy saving in intelligent buildings using cooperative multi-agent," Applied Energy, Elsevier, vol. 351(C).
    4. Langer, Lissy & Volling, Thomas, 2022. "A reinforcement learning approach to home energy management for modulating heat pumps and photovoltaic systems," Applied Energy, Elsevier, vol. 327(C).
    5. Sha, Xinyi & Ma, Zhenjun & Sethuvenkatraman, Subbu & Li, Wanqing, 2025. "Online learning-enhanced data-driven model predictive control for optimizing HVAC energy consumption, indoor air quality and thermal comfort," Applied Energy, Elsevier, vol. 383(C).
    6. Wang, Zhe & Hong, Tianzhen, 2020. "Reinforcement learning for building controls: The opportunities and challenges," Applied Energy, Elsevier, vol. 269(C).
    7. Zhuang, Dian & Gan, Vincent J.L. & Duygu Tekler, Zeynep & Chong, Adrian & Tian, Shuai & Shi, Xing, 2023. "Data-driven predictive control for smart HVAC system in IoT-integrated buildings with time-series forecasting and reinforcement learning," Applied Energy, Elsevier, vol. 338(C).
    8. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    9. Biemann, Marco & Scheller, Fabian & Liu, Xiufeng & Huang, Lizhen, 2021. "Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control," Applied Energy, Elsevier, vol. 298(C).
    10. Reiners, Tobias & Gross, Michel & Altieri, Lisa & Wagner, Hermann-Josef & Bertsch, Valentin, 2021. "Heat pump efficiency in fifth generation ultra-low temperature district heating networks using a wastewater heat source," Energy, Elsevier, vol. 236(C).
    11. Homod, Raad Z., 2018. "Analysis and optimization of HVAC control systems based on energy and performance considerations for smart buildings," Renewable Energy, Elsevier, vol. 126(C), pages 49-64.
    12. Le, Khoa Xuan & Huang, Ming Jun & Shah, Nikhilkumar N. & Wilson, Christopher & Artain, Paul Mac & Byrne, Raymond & Hewitt, Neil J., 2019. "Techno-economic assessment of cascade air-to-water heat pump retrofitted into residential buildings using experimentally validated simulations," Applied Energy, Elsevier, vol. 250(C), pages 633-652.
    13. Bianchini, Gianni & Casini, Marco & Vicino, Antonio & Zarrilli, Donato, 2016. "Demand-response in building heating systems: A Model Predictive Control approach," Applied Energy, Elsevier, vol. 168(C), pages 159-170.
    14. Perera, A.T.D. & Kamalaruban, Parameswaran, 2021. "Applications of reinforcement learning in energy systems," Renewable and Sustainable Energy Reviews, Elsevier, vol. 137(C).
    15. Kazmi, Hussain & Mehmood, Fahad & Lodeweyckx, Stefan & Driesen, Johan, 2018. "Gigawatt-hour scale savings on a budget of zero: Deep reinforcement learning based optimal control of hot water systems," Energy, Elsevier, vol. 144(C), pages 159-168.
    16. Oldewurtel, Frauke & Sturzenegger, David & Morari, Manfred, 2013. "Importance of occupancy information for building climate control," Applied Energy, Elsevier, vol. 101(C), pages 521-532.
    17. H. Kazmi & Fahad Mehmood & S. Lodeweyckx & J. Driesen, 2018. "Gigawatt-Hour Scale Savings on a Budget of Zero: Deep Reinforcement Learning Based Optimal Control of Hot Water Systems," Post-Print hal-04317815, HAL.
    18. Peng, Yuzhen & Rysanek, Adam & Nagy, Zoltán & Schlüter, Arno, 2018. "Using machine learning techniques for occupancy-prediction-based cooling control in office buildings," Applied Energy, Elsevier, vol. 211(C), pages 1343-1358.
    19. Chow, W. K. & Chan, K. T., 1995. "Parameterization study of the overall thermal-transfer value equation for buildings," Applied Energy, Elsevier, vol. 50(3), pages 247-268.
    20. Frederik Ruelens & Sandro Iacovella & Bert J. Claessens & Ronnie Belmans, 2015. "Learning Agent for a Heat-Pump Thermostat with a Set-Back Strategy Using Model-Free Reinforcement Learning," Energies, MDPI, vol. 8(8), pages 1-19, August.
    21. Homod, Raad Z. & Gaeid, Khalaf S. & Dawood, Suroor M. & Hatami, Alireza & Sahari, Khairul S., 2020. "Evaluation of energy-saving potential for optimal time response of HVAC control system in smart buildings," Applied Energy, Elsevier, vol. 271(C).
    22. Wang, Junqi & Jiang, Lanfei & Yu, Hanhui & Feng, Zhuangbo & Castaño-Rosa, Raúl & Cao, Shi-jie, 2024. "Computer vision to advance the sensing and control of built environment towards occupant-centric sustainable development: A critical review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 192(C).
    23. Alimohammadisagvand, Behrang & Jokisalo, Juha & Sirén, Kai, 2018. "Comparison of four rule-based demand response control algorithms in an electrically and heat pump-heated residential building," Applied Energy, Elsevier, vol. 209(C), pages 167-179.
    24. Deng, Zhipeng & Wang, Xuezheng & Dong, Bing, 2023. "Quantum computing for future real-time building HVAC controls," Applied Energy, Elsevier, vol. 334(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Omar Al-Ani & Sanjoy Das, 2022. "Reinforcement Learning: Theory and Applications in HEMS," Energies, MDPI, vol. 15(17), pages 1-37, September.
    2. Homod, Raad Z. & Togun, Hussein & Kadhim Hussein, Ahmed & Noraldeen Al-Mousawi, Fadhel & Yaseen, Zaher Mundher & Al-Kouz, Wael & Abd, Haider J. & Alawi, Omer A. & Goodarzi, Marjan & Hussein, Omar A., 2022. "Dynamics analysis of a novel hybrid deep clustering for unsupervised learning by reinforcement of multi-agent to energy saving in intelligent buildings," Applied Energy, Elsevier, vol. 313(C).
    3. Zhou, Xinlei & Du, Han & Xue, Shan & Ma, Zhenjun, 2024. "Recent advances in data mining and machine learning for enhanced building energy management," Energy, Elsevier, vol. 307(C).
    4. Heidari, Amirreza & Girardin, Luc & Dorsaz, Cédric & Maréchal, François, 2025. "A trustworthy reinforcement learning framework for autonomous control of a large-scale complex heating system: Simulation and field implementation," Applied Energy, Elsevier, vol. 378(PA).
    5. Guo, Yuxiang & Qu, Shengli & Wang, Chuang & Xing, Ziwen & Duan, Kaiwen, 2024. "Optimal dynamic thermal management for data center via soft actor-critic algorithm with dynamic control interval and combined-value state space," Applied Energy, Elsevier, vol. 373(C).
    6. Wang, Xuezheng & Dong, Bing, 2024. "Long-term experimental evaluation and comparison of advanced controls for HVAC systems," Applied Energy, Elsevier, vol. 371(C).
    7. Biemann, Marco & Scheller, Fabian & Liu, Xiufeng & Huang, Lizhen, 2021. "Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control," Applied Energy, Elsevier, vol. 298(C).
    8. Ayas Shaqour & Aya Hagishima, 2022. "Systematic Review on Deep Reinforcement Learning-Based Energy Management for Different Building Types," Energies, MDPI, vol. 15(22), pages 1-27, November.
    9. Perera, A.T.D. & Kamalaruban, Parameswaran, 2021. "Applications of reinforcement learning in energy systems," Renewable and Sustainable Energy Reviews, Elsevier, vol. 137(C).
    10. Lei, Yue & Zhan, Sicheng & Ono, Eikichi & Peng, Yuzhen & Zhang, Zhiang & Hasama, Takamasa & Chong, Adrian, 2022. "A practical deep reinforcement learning framework for multivariate occupant-centric control in buildings," Applied Energy, Elsevier, vol. 324(C).
    11. Panagiotis Michailidis & Iakovos Michailidis & Elias Kosmatopoulos, 2025. "Reinforcement Learning for Optimizing Renewable Energy Utilization in Buildings: A Review on Applications and Innovations," Energies, MDPI, vol. 18(7), pages 1-40, March.
    12. Nik, Vahid M. & Hosseini, Mohammad, 2023. "CIRLEM: a synergic integration of Collective Intelligence and Reinforcement learning in Energy Management for enhanced climate resilience and lightweight computation," Applied Energy, Elsevier, vol. 350(C).
    13. Lu, Ruyuan & Li, Xin & Chen, Ronghao & Lei, Aimin & Ma, Xiaoming, 2024. "An Alternative Reinforcement Learning (ARL) control strategy for data center air-cooled HVAC systems," Energy, Elsevier, vol. 308(C).
    14. Panagiotis Michailidis & Iakovos Michailidis & Dimitrios Vamvakas & Elias Kosmatopoulos, 2023. "Model-Free HVAC Control in Buildings: A Review," Energies, MDPI, vol. 16(20), pages 1-45, October.
    15. Yassine Chemingui & Adel Gastli & Omar Ellabban, 2020. "Reinforcement Learning-Based School Energy Management System," Energies, MDPI, vol. 13(23), pages 1-21, December.
    16. Liao, Chenxin & Miyata, Shohei & Qu, Ming & Akashi, Yasunori, 2025. "Year-round operational optimization of HVAC systems using hierarchical deep reinforcement learning for enhancing indoor air quality and reducing energy consumption," Applied Energy, Elsevier, vol. 390(C).
    17. Li, Yanxue & Wang, Zixuan & Xu, Wenya & Gao, Weijun & Xu, Yang & Xiao, Fu, 2023. "Modeling and energy dynamic control for a ZEH via hybrid model-based deep reinforcement learning," Energy, Elsevier, vol. 277(C).
    18. Dongsu Kim & Jongman Lee & Sunglok Do & Pedro J. Mago & Kwang Ho Lee & Heejin Cho, 2022. "Energy Modeling and Model Predictive Control for HVAC in Buildings: A Review of Current Research Trends," Energies, MDPI, vol. 15(19), pages 1-30, October.
    19. Gao, Yuan & Hu, Zehuan & Yamate, Shun & Otomo, Junichiro & Chen, Wei-An & Liu, Mingzhe & Xu, Tingting & Ruan, Yingjun & Shang, Juan, 2025. "Unlocking predictive insights and interpretability in deep reinforcement learning for Building-Integrated Photovoltaic and Battery (BIPVB) systems," Applied Energy, Elsevier, vol. 384(C).
    20. Touzani, Samir & Prakash, Anand Krishnan & Wang, Zhe & Agarwal, Shreya & Pritoni, Marco & Kiran, Mariam & Brown, Richard & Granderson, Jessica, 2021. "Controlling distributed energy resources via deep reinforcement learning for load flexibility and energy efficiency," Applied Energy, Elsevier, vol. 304(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:391:y:2025:i:c:s0306261925005811. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.