IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v404y2026ics0306261925019117.html

Buildings-to-grid with generalized energy storage: A multi-agent decomposed deep reinforcement learning approach for delayed rewards

Author

Listed:
  • Jin, Jiahui
  • Sun, Guoqiang
  • Chen, Sheng
  • Li, Yaping
  • Zhu, Hong
  • Mao, Wenbo
  • Ji, Wenlu

Abstract

The growing penetration of distributed renewable energy and flexible building loads is intensifying the bidirectional building-to-grid (BtG) coupling. However, the inherent heterogeneity between electrochemical batteries and comfort-coupled thermal storage complicates coordinated control. To bridge this gap, the present study proposes a generalized energy storage system (GESS) that represents both devices with a common state of charge and generalized charge/discharge power. An adaptive self-loss term captures both battery self-discharge and temperature-dependent passive heat exchange. The model maps the generalized power of thermal energy storage to equivalent electrical power, while accounting for the thermal inertia of internal spaces and heating, ventilation, and air-conditioning systems. To address delayed rewards in the GESS, a multi-agent decomposed deep reinforcement learning approach is developed. The control problem is formulated as a sequential partially observable Markov decision process with a dual-critic architecture that redistributes immediate rewards to construct delayed rewards. Decentralized actors are optimized using a clipped surrogate objective with combined advantage estimates and control variate stabilization. Numerical experiments on the test system demonstrate that the proposed method enhances building profitability and reduces grid operating costs.

Suggested Citation

  • Jin, Jiahui & Sun, Guoqiang & Chen, Sheng & Li, Yaping & Zhu, Hong & Mao, Wenbo & Ji, Wenlu, 2026. "Buildings-to-grid with generalized energy storage: A multi-agent decomposed deep reinforcement learning approach for delayed rewards," Applied Energy, Elsevier, vol. 404(C).
  • Handle: RePEc:eee:appene:v:404:y:2026:i:c:s0306261925019117
    DOI: 10.1016/j.apenergy.2025.127181
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261925019117
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2025.127181?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Alzahrani, Ahmad & Sajjad, Khizar & Hafeez, Ghulam & Murawwat, Sadia & Khan, Sheraz & Khan, Farrukh Aslam, 2023. "Real-time energy optimization and scheduling of buildings integrated with renewable microgrid," Applied Energy, Elsevier, vol. 335(C).
    2. Bastida, Hector & De la Cruz-Loredo, Ivan & Saikia, Pranaynil & Ugalde-Loo, Carlos E., 2024. "Discrete-time state-of-charge estimator for latent heat thermal energy storage units based on a recurrent neural network," Applied Energy, Elsevier, vol. 371(C).
    3. Hanson, Eric & Canfield, Casey & Fikru, Mahelet G. & Sumner, Jenny, 2023. "State-level trends in renewable energy procurement via solar installation versus green electricity," Renewable Energy, Elsevier, vol. 218(C).
    4. Yu, Peipei & Zhang, Hongcai & Hu, Zechun & Song, Yonghua, 2025. "Voltage control of distribution grid with district cooling systems based on scenario-classified reinforcement learning," Applied Energy, Elsevier, vol. 377(PB).
    5. Dong, Bing & Li, Zhaoxuan & Taha, Ahmad & Gatsis, Nikolaos, 2018. "Occupancy-based buildings-to-grid integration framework for smart and connected communities," Applied Energy, Elsevier, vol. 219(C), pages 123-137.
    6. Zhang, Xiaoyi & Liu, Yan & Li, Yanxue & Lv, Xiaoyu & Xiao, Fu & Gao, Weijun, 2024. "Analyzing variability and coordinated demand management for various flexible integrations of residential distributed energy resources," Renewable Energy, Elsevier, vol. 237(PA).
    7. Meng, He & Jia, Hongjie & Xu, Tao & Sun, Jianhang & Wang, Rujing & Wang, Jie, 2024. "Trading mechanism of distributed shared energy storage system considering voltage regulation," Applied Energy, Elsevier, vol. 374(C).
    8. Razmara, M. & Bharati, G.R. & Hanover, Drew & Shahbakhti, M. & Paudyal, S. & Robinett, R.D., 2017. "Building-to-grid predictive power flow control for demand response and demand flexibility programs," Applied Energy, Elsevier, vol. 203(C), pages 128-141.
    9. Khan, Abdul Mateen & Tariq, Muhammad Abubakar & Alam, Zeshan & Alaloul, Wesam Salah & Waqar, Ahsan, 2024. "Optimizing energy efficiency through building orientation and building information modelling (BIM) in diverse terrains: A case study in Pakistan," Energy, Elsevier, vol. 311(C).
    10. Aghdam, Farid Hamzeh & Mudiyanselage, Manthila Wijesooriya & Mohammadi-Ivatloo, Behnam & Marzband, Mousa, 2023. "Optimal scheduling of multi-energy type virtual energy storage system in reconfigurable distribution networks for congestion management," Applied Energy, Elsevier, vol. 333(C).
    11. Li, Yanxue & Wang, Zixuan & Xu, Wenya & Gao, Weijun & Xu, Yang & Xiao, Fu, 2023. "Modeling and energy dynamic control for a ZEH via hybrid model-based deep reinforcement learning," Energy, Elsevier, vol. 277(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Rehman, Obaid Ur & Khan, Shahid A. & Javaid, Nadeem, 2021. "Decoupled building-to-transmission-network for frequency support in PV systems dominated grid," Renewable Energy, Elsevier, vol. 178(C), pages 930-945.
    2. Abdellatif Soussi & Enrico Zero & Alessandro Bozzi & Roberto Sacile, 2024. "Enhancing Energy Systems and Rural Communities through a System of Systems Approach: A Comprehensive Review," Energies, MDPI, vol. 17(19), pages 1-43, October.
    3. Liu, Yang & Yu, Nanpeng & Wang, Wei & Guan, Xiaohong & Xu, Zhanbo & Dong, Bing & Liu, Ting, 2018. "Coordinating the operations of smart buildings in smart grids," Applied Energy, Elsevier, vol. 228(C), pages 2510-2525.
    4. Ang, Yu Qian & Berzolla, Zachary Michael & Reinhart, Christoph F., 2020. "From concept to application: A review of use cases in urban building energy modeling," Applied Energy, Elsevier, vol. 279(C).
    5. Chen, Yongbao & Chen, Zhe & Xu, Peng & Li, Weilin & Sha, Huajing & Yang, Zhiwei & Li, Guowen & Hu, Chonghe, 2019. "Quantification of electricity flexibility in demand response: Office building case study," Energy, Elsevier, vol. 188(C).
    6. Yu, Peipei & Zhang, Hongcai & Song, Yonghua & Wang, Zhenyi & Dong, Huiyu & Ji, Liang, 2025. "Safe reinforcement learning for power system control: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 223(C).
    7. Shan, Kui & Wang, Shengwei & Zhuang, Chaoqun, 2021. "Controlling a large constant speed centrifugal chiller to provide grid frequency regulation: A validation based on onsite tests," Applied Energy, Elsevier, vol. 300(C).
    8. Sulman Shahzad & Muhammad Abbas Abbasi & Hassan Ali & Muhammad Iqbal & Rania Munir & Heybet Kilic, 2023. "Possibilities, Challenges, and Future Opportunities of Microgrids: A Review," Sustainability, MDPI, vol. 15(8), pages 1-28, April.
    9. Fernando Andrade & Drielli Peyerl & Claudia A. de Mattos, 2025. "Framework for Investment in Electricity Distribution to Enable Energy Transition," Energies, MDPI, vol. 18(3), pages 1-15, February.
    10. Bay, Christopher J. & Chintala, Rohit & Chinde, Venkatesh & King, Jennifer, 2022. "Distributed model predictive control for coordinated, grid-interactive buildings," Applied Energy, Elsevier, vol. 312(C).
    11. Liu, Mingzhe & Guo, Mingyue & Fu, Yangyang & O’Neill, Zheng & Gao, Yuan, 2024. "Expert-guided imitation learning for energy management: Evaluating GAIL’s performance in building control applications," Applied Energy, Elsevier, vol. 372(C).
    12. Qianwen Li & Zhilong Chen & Jialin Min & Mengjie Xu & Yanhong Zhan & Wenyue Zhang & Chuanwang Sun, 2024. "Hybrid transaction model for optimizing the distributed power trading market," Humanities and Social Sciences Communications, Palgrave Macmillan, vol. 11(1), pages 1-13, December.
    13. Kang, Kai & Su, Yifan & Yang, Peng & Wang, Zhaojian & Liu, Feng, 2025. "Securing long-term dispatch of isolated microgrids with high-penetration renewable generation: A controlled evolution-based framework," Applied Energy, Elsevier, vol. 381(C).
    14. Mahuze, Richard A. & Amadeh, Ali & Yuan, Bo & Zhang, K. Max, 2025. "Collaborative optimization framework for capacity planning of a prosumer-based peer-to-peer electricity trading community," Applied Energy, Elsevier, vol. 384(C).
    15. Xie, Peilin & Zhou, Fan & Tan, Sen & Liso, Vincenzo & Sahlin, Simon Lennart, 2025. "Development of a two-layer control and management system for a residential microgrid with HT-PEMFC-based micro-CHP," Applied Energy, Elsevier, vol. 381(C).
    16. Hua, Lyu-Guang & Hafeez, Ghulam & Alghamdi, Baheej & Alghamdi, Hisham & Khan, Farrukh Aslam & Ullah, Safeer, 2025. "Demand response with pricing schemes and consumers mode constraints for energy balancing in smart grids," Applied Energy, Elsevier, vol. 377(PB).
    17. Niu, Jide & Li, Xiaoyuan & Tian, Zhe & Yang, Hongxing, 2024. "Uncertainty analysis of the electric vehicle potential for a household to enhance robustness in decision on the EV/V2H technologies," Applied Energy, Elsevier, vol. 365(C).
    18. Jia, Xueyong & Xia, Yang & Yan, Ziming & Gao, Hongjun & Qiu, Dawei & Guerrero, Josep M. & Li, Zhengmao, 2025. "Coordinated operation of multi-energy microgrids considering green hydrogen and congestion management via a safe policy learning approach," Applied Energy, Elsevier, vol. 401(PA).
    19. Yamaguchi, Yohei & Shoda, Yuto & Yoshizawa, Shinya & Imai, Tatsuya & Perwez, Usama & Shimoda, Yoshiyuki & Hayashi, Yasuhiro, 2023. "Feasibility assessment of net zero-energy transformation of building stock using integrated synthetic population, building stock, and power distribution network framework," Applied Energy, Elsevier, vol. 333(C).
    20. Capone, Martina & Guelpa, Elisa & Verda, Vittorio, 2021. "Multi-objective optimization of district energy systems with demand response," Energy, Elsevier, vol. 227(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:404:y:2026:i:c:s0306261925019117. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.