IDEAS home Printed from https://ideas.repec.org/a/eee/energy/v332y2025ics0360544225027240.html

Multi-agent distributed reinforcement learning for energy-efficient thermal comfort control in multi-zone buildings with diverse occupancy patterns

Author

Listed:
  • Tariq, Shahzeb
  • Ali, Usama
  • Kim, Sangyoun
  • Yoo, ChangKyoo

Abstract

The rapid development of smart cities and automated infrastructures has increased building electricity demand, particularly from heating, ventilation and air conditioning (HVAC) systems. Current HVAC control methods primarily address short-term dynamics and single-zone scenarios, overlooking complexities from seasonal variability and diverse occupancy patterns in multizone buildings. Furthermore, existing data-driven frameworks lack mechanisms to transfer control policies across buildings with different thermal zone configurations. To address these limitations, this study proposes a decentralized multi-agent reinforcement learning framework for energy-efficient thermal comfort management in multizone buildings. Transfer reinforcement learning enables efficient adaptation of control strategies to buildings with differing zone configurations. Results demonstrate that occupancy and zone-specific control actions effectively balance energy efficiency and occupant comfort. The proposed method maintains thermal comfort within acceptable levels while reducing grid energy import by 51.7 % compared to conventional rule-based methods. Assigning a higher energy weight in the decentralized network structure achieved an additional 23 % reduction in energy use. The transfer learning approach successfully adapted control policies from a nine-zone office to a five-zone residential building with limited monitoring data and reduced building load by 6.4 %. Practically, this approach significantly reduces training data requirements and accelerates model deployment. Collectively, these enhancements provide building operators with effective tools to achieve significant energy savings and support city-level sustainability efforts.

Suggested Citation

  • Tariq, Shahzeb & Ali, Usama & Kim, Sangyoun & Yoo, ChangKyoo, 2025. "Multi-agent distributed reinforcement learning for energy-efficient thermal comfort control in multi-zone buildings with diverse occupancy patterns," Energy, Elsevier, vol. 332(C).
  • Handle: RePEc:eee:energy:v:332:y:2025:i:c:s0360544225027240
    DOI: 10.1016/j.energy.2025.137082
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0360544225027240
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.energy.2025.137082?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Lou, Juwei & Cao, Hua & Meng, Xin & Wang, Yaxiong & Wang, Jiangfeng & Chen, Liangqi & Sun, Lu & Wang, Mengxuan, 2024. "Power load analysis and configuration optimization of solar thermal-PV hybrid microgrid based on building," Energy, Elsevier, vol. 289(C).
    2. Wang, Chendong & Yuan, Jianjuan & Huang, Ke & Zhang, Ji & Zheng, Lihong & Zhou, Zhihua & Zhang, Yufeng, 2022. "Research on thermal load prediction of district heating station based on transfer learning," Energy, Elsevier, vol. 239(PE).
    3. Khalilnejad, Arash & French, Roger H. & Abramson, Alexis R., 2021. "Evaluation of cooling setpoint setback savings in commercial buildings using electricity and exterior temperature time series data," Energy, Elsevier, vol. 233(C).
    4. Farrokhi, Meysam & Javani, Nader & Motallebzadeh, Roghayyeh & Ebrahimpour, Abdolsalam, 2022. "Dynamic simulation and optimization of a novel energy system with Hydrogen energy storage for hotel buildings," Energy, Elsevier, vol. 257(C).
    5. Alammar, Ahmed A. & Rezk, Ahmed & Alaswad, Abed & Fernando, Julia & Olabi, A.G. & Decker, Stephanie & Ruhumuliza, Joseph & Gasana, Quénan, 2022. "The technical, economic, and environmental feasibility of a bioheat-driven adsorption cooling system for food cold storing: A case study of Rwanda," Energy, Elsevier, vol. 258(C).
    6. Qin, Haosen & Meng, Tao & Chen, Kan & Li, Zhengwei, 2024. "A comparative study of DQN and D3QN for HVAC system optimization control," Energy, Elsevier, vol. 307(C).
    7. Inayat, Abrar & Raza, Mohsin, 2019. "District cooling system via renewable energy sources: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 107(C), pages 360-373.
    8. Li, Guannan & Chen, Liang & Liu, Jiangyan & Fang, Xi, 2023. "Comparative study on deep transfer learning strategies for cross-system and cross-operation-condition building energy systems fault diagnosis," Energy, Elsevier, vol. 263(PD).
    9. Chen, Yibo & Zhang, Fengyi & Berardi, Umberto, 2020. "Day-ahead prediction of hourly subentry energy consumption in the building sector using pattern recognition algorithms," Energy, Elsevier, vol. 211(C).
    10. Zhang, Boyan & Rezgui, Yacine & Luo, Zhiwen & Zhao, Tianyi, 2024. "Fault detection research on novel transfer learning-based method for cross-condition, cross-system and cross-operation in public building HVAC sensors," Energy, Elsevier, vol. 313(C).
    11. Cui, Can & Xue, Jing, 2024. "Energy and comfort aware operation of multi-zone HVAC system through preference-inspired deep reinforcement learning," Energy, Elsevier, vol. 292(C).
    12. Fang, Xi & Gong, Guangcai & Li, Guannan & Chun, Liang & Peng, Pei & Li, Wenqiang & Shi, Xing, 2023. "Cross temporal-spatial transferability investigation of deep reinforcement learning control strategy in the building HVAC system level," Energy, Elsevier, vol. 263(PB).
    13. Fan, Cheng & Sun, Yongjun & Xiao, Fu & Ma, Jie & Lee, Dasheng & Wang, Jiayuan & Tseng, Yen Chieh, 2020. "Statistical investigations of transfer learning-based methodology for short-term building energy predictions," Applied Energy, Elsevier, vol. 262(C).
    14. Blad, Christian & Bøgh, Simon & Kallesøe, Carsten Skovmose, 2022. "Data-driven Offline Reinforcement Learning for HVAC-systems," Energy, Elsevier, vol. 261(PB).
    15. Battaglia, Vittoria & Vanoli, Laura & Verde, Clara & Nithiarasu, Perumal & Searle, Justin R., 2023. "Dynamic modelling of geothermal heat pump system coupled with positive-energy building," Energy, Elsevier, vol. 284(C).
    16. Lu, Ruyuan & Li, Xin & Chen, Ronghao & Lei, Aimin & Ma, Xiaoming, 2024. "An Alternative Reinforcement Learning (ARL) control strategy for data center air-cooled HVAC systems," Energy, Elsevier, vol. 308(C).
    17. Du, Yan & Zandi, Helia & Kotevska, Olivera & Kurte, Kuldeep & Munk, Jeffery & Amasyali, Kadir & Mckee, Evan & Li, Fangxing, 2021. "Intelligent multi-zone residential HVAC control strategy based on deep reinforcement learning," Applied Energy, Elsevier, vol. 281(C).
    18. Liu, Xiangfei & Ren, Mifeng & Yang, Zhile & Yan, Gaowei & Guo, Yuanjun & Cheng, Lan & Wu, Chengke, 2022. "A multi-step predictive deep reinforcement learning algorithm for HVAC control systems in smart buildings," Energy, Elsevier, vol. 259(C).
    19. Barone, Giovanni & Buonomano, Annamaria & Del Papa, Gianluca & Giuzio, Giovanni Francesco & Palombo, Adolfo & Russo, Giuseppe, 2025. "Towards sustainable ships: Advancing energy efficiency of HVAC systems onboard through digital twin," Energy, Elsevier, vol. 317(C).
    20. Choi, Kwangwon & Lee, Donggun & Park, Semi & Joe, Jaewan, 2024. "Infrared signal-based implementation of model-based predictive control (MPC) for cost saving in a campus building," Energy, Elsevier, vol. 306(C).
    21. Rashad, Magdi & Żabnieńska-Góra, Alina & Norman, Les & Jouhara, Hussam, 2022. "Analysis of energy demand in a residential building using TRNSYS," Energy, Elsevier, vol. 254(PB).
    22. Cui, Can & Xue, Jiahui & Liu, Lanjun, 2025. "Optimal control of HVAC systems through active disturbance rejection control-assisted reinforcement learning," Energy, Elsevier, vol. 323(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Wenxiao Chu & Maria Vicidomini & Francesco Calise & Neven Duić & Poul Alberg Østergaard & Qiuwang Wang, 2025. "Innovative Solutions for a Sustainable Future: Main Topics of Selected Papers in the 19th SDEWES Conference in 2024," Energies, MDPI, vol. 18(17), pages 1-17, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Sulaiman, Mohd Herwan & Mustaffa, Zuriani, 2024. "Chiller energy prediction in commercial building: A metaheuristic-Enhanced deep learning approach," Energy, Elsevier, vol. 297(C).
    2. Han, Hua & Ren, Zhengxiong & Cui, Xiaoyu & Gu, Bo, 2025. "Variable-condition fault diagnosis for building chiller based on deep feature extraction and discrepancy minimization," Energy, Elsevier, vol. 330(C).
    3. Yan, Ke & He, Changfu & Wang, Chuan & Gao, Yuan & Du, Yang & Afshari, Afshin, 2026. "A few-shot learning framework for HVAC fault diagnosis in data centers with minimal data required," Applied Energy, Elsevier, vol. 402(PC).
    4. Zhang, Boyan & Wang, Jiaming & Rezgui, Yacine & Zhao, Tianyi, 2025. "Enhancing the generalizability of public building energy system fault detection method: A research on unknown multi-source fault detection and diagnosis method based on data-driven heuristic reasoning (DHR)," Energy, Elsevier, vol. 335(C).
    5. Cui, Can & Xue, Jiahui & Liu, Lanjun, 2025. "Optimal control of HVAC systems through active disturbance rejection control-assisted reinforcement learning," Energy, Elsevier, vol. 323(C).
    6. Zhuang, Dian & Gan, Vincent J.L. & Duygu Tekler, Zeynep & Chong, Adrian & Tian, Shuai & Shi, Xing, 2023. "Data-driven predictive control for smart HVAC system in IoT-integrated buildings with time-series forecasting and reinforcement learning," Applied Energy, Elsevier, vol. 338(C).
    7. Qin, Haosen & Meng, Tao & Chen, Kan & Li, Zhengwei, 2024. "A comparative study of DQN and D3QN for HVAC system optimization control," Energy, Elsevier, vol. 307(C).
    8. Chen, Siliang & Liang, Xinbin & Liu, Ying & Li, Xilin & Jin, Xinqiao & Du, Zhimin, 2025. "Customized large-scale model for human-AI collaborative operation and maintenance management of building energy systems," Applied Energy, Elsevier, vol. 393(C).
    9. Guo, Fangzhou & Ham, Sang woo & Kim, Donghun & Moon, Hyeun Jun, 2025. "Deep reinforcement learning control for co-optimizing energy consumption, thermal comfort, and indoor air quality in an office building," Applied Energy, Elsevier, vol. 377(PA).
    10. Cui, Can & Xue, Jing, 2024. "Energy and comfort aware operation of multi-zone HVAC system through preference-inspired deep reinforcement learning," Energy, Elsevier, vol. 292(C).
    11. Fang, Xi & Gong, Guangcai & Li, Guannan & Chun, Liang & Peng, Pei & Li, Wenqiang & Shi, Xing, 2023. "Cross temporal-spatial transferability investigation of deep reinforcement learning control strategy in the building HVAC system level," Energy, Elsevier, vol. 263(PB).
    12. Zhang, Boyan & Rezgui, Yacine & Luo, Zhiwen & Zhao, Tianyi, 2024. "Fault detection research on novel transfer learning-based method for cross-condition, cross-system and cross-operation in public building HVAC sensors," Energy, Elsevier, vol. 313(C).
    13. Chen, Siliang & Ge, Wei & Liang, Xinbin & Jin, Xinqiao & Du, Zhimin, 2024. "Lifelong learning with deep conditional generative replay for dynamic and adaptive modeling towards net zero emissions target in building energy system," Applied Energy, Elsevier, vol. 353(PB).
    14. Zhou, Xinlei & Du, Han & Xue, Shan & Ma, Zhenjun, 2024. "Recent advances in data mining and machine learning for enhanced building energy management," Energy, Elsevier, vol. 307(C).
    15. Li, Chuang & Li, Guojie & Wang, Keyou & Han, Bei, 2022. "A multi-energy load forecasting method based on parallel architecture CNN-GRU and transfer learning for data deficient integrated energy systems," Energy, Elsevier, vol. 259(C).
    16. Kangji Li & Borui Wei & Qianqian Tang & Yufei Liu, 2022. "A Data-Efficient Building Electricity Load Forecasting Method Based on Maximum Mean Discrepancy and Improved TrAdaBoost Algorithm," Energies, MDPI, vol. 15(23), pages 1-18, November.
    17. Park, Jong-Whi & Ju, Young-Min & Kim, You-Gwon & Kim, Hak-Sung, 2023. "50% reduction in energy consumption in an actual cold storage facility using a deep reinforcement learning-based control algorithm," Applied Energy, Elsevier, vol. 352(C).
    18. Blad, C. & Bøgh, S. & Kallesøe, C. & Raftery, Paul, 2023. "A laboratory test of an Offline-trained Multi-Agent Reinforcement Learning Algorithm for Heating Systems," Applied Energy, Elsevier, vol. 337(C).
    19. Homod, Raad Z. & Mohammed, Hayder Ibrahim & Abderrahmane, Aissa & Alawi, Omer A. & Khalaf, Osamah Ibrahim & Mahdi, Jasim M. & Guedri, Kamel & Dhaidan, Nabeel S. & Albahri, A.S. & Sadeq, Abdellatif M. , 2023. "Deep clustering of Lagrangian trajectory for multi-task learning to energy saving in intelligent buildings using cooperative multi-agent," Applied Energy, Elsevier, vol. 351(C).
    20. Li, Guannan & Chen, Liang & Liu, Jiangyan & Fang, Xi, 2023. "Comparative study on deep transfer learning strategies for cross-system and cross-operation-condition building energy systems fault diagnosis," Energy, Elsevier, vol. 263(PD).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:332:y:2025:i:c:s0360544225027240. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.