Cross temporal-spatial transferability investigation of deep reinforcement learning control strategy in the building HVAC system level

My bibliography Save this article

Cross temporal-spatial transferability investigation of deep reinforcement learning control strategy in the building HVAC system level

Author

Listed:

Fang, Xi
Gong, Guangcai
Li, Guannan
Chun, Liang
Peng, Pei
Li, Wenqiang
Shi, Xing

Registered:

Abstract

Model free based DRL control strategies have achieved positive effects on the HVAC system optimal control. However, developing deep reinforcement learning (DRL) control strategies for different building HVAC systems is time-consuming and laborious. To address this issue, this study proposes a transfer learning and deep reinforcement learning (TL-DRL) integrated framework to achieve the DRL control strategy transfer in the building HVAC system level. Deep Q-learning (DQN) is first pre-trained in the source building until it converges to an optimal strategy. Then, the well pre-trained DQN parameters of the first few layers are transferred to the target DQN. Finally, the target DQN parameters of the last few layers are fine-tuned in the target building. An EnergyPlus-Python co-simulation testbed is developed to investigate the cross temporal-spatial transferability of DQN control strategy in the building HVAC system level. Results indicate that the proposed TL-DRL framework can effectively improve the training efficiency of control strategy by about 13.28% when transferring the first two layers compared to that of the DRL baseline models trained from scratch, while simultaneously maintaining energy consumption and indoor air temperature in an acceptable range. The proposed TL-DRL framework provides a preliminary direction for the scalability of intelligent HVAC control strategies.

Suggested Citation

Fang, Xi & Gong, Guangcai & Li, Guannan & Chun, Liang & Peng, Pei & Li, Wenqiang & Shi, Xing, 2023. "Cross temporal-spatial transferability investigation of deep reinforcement learning control strategy in the building HVAC system level," Energy, Elsevier, vol. 263(PB).

Handle: RePEc:eee:energy:v:263:y:2023:i:pb:s0360544222025658
DOI: 10.1016/j.energy.2022.125679

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Inayat, Abrar & Raza, Mohsin, 2019. "District cooling system via renewable energy sources: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 107(C), pages 360-373.
Li, Wenzhuo & Wang, Shengwei & Koo, Choongwan, 2021. "A real-time optimal control strategy for multi-zone VAV air-conditioning systems adopting a multi-agent based distributed optimization method," Applied Energy, Elsevier, vol. 287(C).
Wang, Zhe & Hong, Tianzhen, 2020. "Reinforcement learning for building controls: The opportunities and challenges," Applied Energy, Elsevier, vol. 269(C).
Biemann, Marco & Scheller, Fabian & Liu, Xiufeng & Huang, Lizhen, 2021. "Experimental evaluation of model-free reinforcement learning algorithms for continuous HVAC control," Applied Energy, Elsevier, vol. 298(C).
Gao, Yixiang & Li, Shuhui & Fu, Xingang & Dong, Weizhen & Lu, Bing & Li, Zhongwen, 2020. "Energy management and demand response with intelligent learning for multi-thermal-zone buildings," Energy, Elsevier, vol. 210(C).
Afram, Abdul & Janabi-Sharifi, Farrokh, 2015. "Gray-box modeling and validation of residential HVAC system for control system design," Applied Energy, Elsevier, vol. 137(C), pages 134-150.
Yang, Ting & Zhao, Liyuan & Li, Wei & Wu, Jianzhong & Zomaya, Albert Y., 2021. "Towards healthy and cost-effective indoor environment management in smart homes: A deep reinforcement learning approach," Applied Energy, Elsevier, vol. 300(C).
Wang, Tianjing & Tang, Yong, 2022. "Transfer-Reinforcement-Learning-Based rescheduling of differential power grids considering security constraints," Applied Energy, Elsevier, vol. 306(PB).
Fang, Xi & Gong, Guangcai & Li, Guannan & Chun, Liang & Li, Wenqiang & Peng, Pei, 2021. "A hybrid deep transfer learning strategy for short term cross-building energy prediction," Energy, Elsevier, vol. 215(PB).
Fan, Cheng & Sun, Yongjun & Xiao, Fu & Ma, Jie & Lee, Dasheng & Wang, Jiayuan & Tseng, Yen Chieh, 2020. "Statistical investigations of transfer learning-based methodology for short-term building energy predictions," Applied Energy, Elsevier, vol. 262(C).
Xiao, Tong & Xu, Peng & He, Ruikai & Sha, Huajing, 2022. "Status quo and opportunities for building energy prediction in limited data Context—Overview from a competition," Applied Energy, Elsevier, vol. 305(C).
Afroz, Zakia & Shafiullah, GM & Urmee, Tania & Higgins, Gary, 2018. "Modeling techniques used in building HVAC control systems: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 83(C), pages 64-84.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Montazeri, Mina & Remlinger, Carl & Bejar Haro, Benjamin & Heer, Philipp, 2025. "Fully data-driven and modular building thermal control with physically consistent modeling," Applied Energy, Elsevier, vol. 390(C).
Cui, Can & Xue, Jing, 2024. "Energy and comfort aware operation of multi-zone HVAC system through preference-inspired deep reinforcement learning," Energy, Elsevier, vol. 292(C).
Sulaiman, Mohd Herwan & Mustaffa, Zuriani, 2024. "Chiller energy prediction in commercial building: A metaheuristic-Enhanced deep learning approach," Energy, Elsevier, vol. 297(C).
García Vázquez, C.A. & Cotfas, D.T. & González Santos, A.I. & Cotfas, P.A. & León Ávila, B.Y., 2024. "Reduction of electricity consumption in an AHU using mathematical modelling for controller tuning," Energy, Elsevier, vol. 293(C).
Ding, Yan & Zhang, Haozheng & Yang, Xiaochen & Tian, Zhe & Huang, Chen, 2024. "An adaptive switching control model for air conditioning systems based on information completeness," Applied Energy, Elsevier, vol. 375(C).
Avisek Naug & Marcos Quinones-Grueiro & Gautam Biswas, 2025. "An End-to-End Relearning Framework for Building Energy Optimization," Energies, MDPI, vol. 18(6), pages 1-23, March.
Fan, Cheng & Lei, Yutian & Sun, Yongjun & Mo, Like, 2023. "Novel transformer-based self-supervised learning methods for improved HVAC fault diagnosis performance with limited labeled data," Energy, Elsevier, vol. 278(PB).
Jia, Zhiyang & Jin, Xinqiao & Lyu, Yuan & Xue, Qi & Du, Zhimin, 2024. "A novel load allocation strategy based on the adaptive chiller model with data augmentation," Energy, Elsevier, vol. 309(C).
Hua, Pengmin & Wang, Haichao & Xie, Zichan & Lahdelma, Risto, 2024. "Multi-criteria evaluation of novel multi-objective model predictive control method for indoor thermal comfort," Energy, Elsevier, vol. 289(C).
Lu, Ruyuan & Li, Xin & Chen, Ronghao & Lei, Aimin & Ma, Xiaoming, 2024. "An Alternative Reinforcement Learning (ARL) control strategy for data center air-cooled HVAC systems," Energy, Elsevier, vol. 308(C).
Cui, Can & Xue, Jiahui & Liu, Lanjun, 2025. "Optimal control of HVAC systems through active disturbance rejection control-assisted reinforcement learning," Energy, Elsevier, vol. 323(C).

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Cui, Can & Xue, Jing, 2024. "Energy and comfort aware operation of multi-zone HVAC system through preference-inspired deep reinforcement learning," Energy, Elsevier, vol. 292(C).
Zhang, Yunfei & Zhou, Zhihua & Du, Yahui & Shen, Jun & Li, Zhenxing & Yuan, Jianjuan, 2023. "A data transfer method based on one dimensional convolutional neural network for cross-building load prediction," Energy, Elsevier, vol. 277(C).
Omar Al-Ani & Sanjoy Das, 2022. "Reinforcement Learning: Theory and Applications in HEMS," Energies, MDPI, vol. 15(17), pages 1-37, September.
Li, Guannan & Wu, Yubei & Yoon, Sungmin & Fang, Xi, 2024. "Comprehensive transferability assessment of short-term cross-building-energy prediction using deep adversarial network transfer learning," Energy, Elsevier, vol. 299(C).
Gao, Yuan & Hu, Zehuan & Yamate, Shun & Otomo, Junichiro & Chen, Wei-An & Liu, Mingzhe & Xu, Tingting & Ruan, Yingjun & Shang, Juan, 2025. "Unlocking predictive insights and interpretability in deep reinforcement learning for Building-Integrated Photovoltaic and Battery (BIPVB) systems," Applied Energy, Elsevier, vol. 384(C).
Guo, Fangzhou & Ham, Sang woo & Kim, Donghun & Moon, Hyeun Jun, 2025. "Deep reinforcement learning control for co-optimizing energy consumption, thermal comfort, and indoor air quality in an office building," Applied Energy, Elsevier, vol. 377(PA).
Homod, Raad Z. & Togun, Hussein & Kadhim Hussein, Ahmed & Noraldeen Al-Mousawi, Fadhel & Yaseen, Zaher Mundher & Al-Kouz, Wael & Abd, Haider J. & Alawi, Omer A. & Goodarzi, Marjan & Hussein, Omar A., 2022. "Dynamics analysis of a novel hybrid deep clustering for unsupervised learning by reinforcement of multi-agent to energy saving in intelligent buildings," Applied Energy, Elsevier, vol. 313(C).
Dongsu Kim & Yongjun Lee & Kyungil Chin & Pedro J. Mago & Heejin Cho & Jian Zhang, 2023. "Implementation of a Long Short-Term Memory Transfer Learning (LSTM-TL)-Based Data-Driven Model for Building Energy Demand Forecasting," Sustainability, MDPI, vol. 15(3), pages 1-23, January.
Zhang, Qingang & Zeng, Wei & Lin, Qinjie & Chng, Chin-Boon & Chui, Chee-Kong & Lee, Poh-Seng, 2023. "Deep reinforcement learning towards real-world dynamic thermal management of data centers," Applied Energy, Elsevier, vol. 333(C).
Cui, Can & Xue, Jiahui & Liu, Lanjun, 2025. "Optimal control of HVAC systems through active disturbance rejection control-assisted reinforcement learning," Energy, Elsevier, vol. 323(C).
Ayas Shaqour & Aya Hagishima, 2022. "Systematic Review on Deep Reinforcement Learning-Based Energy Management for Different Building Types," Energies, MDPI, vol. 15(22), pages 1-27, November.
Li, Yanxue & Wang, Zixuan & Xu, Wenya & Gao, Weijun & Xu, Yang & Xiao, Fu, 2023. "Modeling and energy dynamic control for a ZEH via hybrid model-based deep reinforcement learning," Energy, Elsevier, vol. 277(C).
Wang, Xuezheng & Dong, Bing, 2024. "Long-term experimental evaluation and comparison of advanced controls for HVAC systems," Applied Energy, Elsevier, vol. 371(C).
Clara Ceccolini & Roozbeh Sangi, 2022. "Benchmarking Approaches for Assessing the Performance of Building Control Strategies: A Review," Energies, MDPI, vol. 15(4), pages 1-30, February.
Taboga, Vincent & Gehring, Clement & Cam, Mathieu Le & Dagdougui, Hanane & Bacon, Pierre-Luc, 2024. "Neural differential equations for temperature control in buildings under demand response programs," Applied Energy, Elsevier, vol. 368(C).
Chen, Siliang & Ge, Wei & Liang, Xinbin & Jin, Xinqiao & Du, Zhimin, 2024. "Lifelong learning with deep conditional generative replay for dynamic and adaptive modeling towards net zero emissions target in building energy system," Applied Energy, Elsevier, vol. 353(PB).
Di Natale, L. & Svetozarevic, B. & Heer, P. & Jones, C.N., 2022. "Physically Consistent Neural Networks for building thermal modeling: Theory and analysis," Applied Energy, Elsevier, vol. 325(C).
Zhou, Xinlei & Du, Han & Xue, Shan & Ma, Zhenjun, 2024. "Recent advances in data mining and machine learning for enhanced building energy management," Energy, Elsevier, vol. 307(C).
Ma, Yichuan X. & Yeung, Lawrence K., 2024. "BEForeGAN: An image-based deep generative approach for day-ahead forecasting of building HVAC energy consumption," Applied Energy, Elsevier, vol. 376(PA).
Li, Guannan & Li, Fan & Ahmad, Tanveer & Liu, Jiangyan & Li, Tao & Fang, Xi & Wu, Yubei, 2022. "Performance evaluation of sequence-to-sequence-Attention model for short-term multi-step ahead building energy predictions," Energy, Elsevier, vol. 259(C).

More about this item

Keywords

; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:263:y:2023:i:pb:s0360544222025658. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Cross temporal-spatial transferability investigation of deep reinforcement learning control strategy in the building HVAC system level

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data