IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v404y2026ics0306261925019117.html

Buildings-to-grid with generalized energy storage: A multi-agent decomposed deep reinforcement learning approach for delayed rewards

Author

Listed:
  • Jin, Jiahui
  • Sun, Guoqiang
  • Chen, Sheng
  • Li, Yaping
  • Zhu, Hong
  • Mao, Wenbo
  • Ji, Wenlu

Abstract

The growing penetration of distributed renewable energy and flexible building loads is intensifying the bidirectional building-to-grid (BtG) coupling. However, the inherent heterogeneity between electrochemical batteries and comfort-coupled thermal storage complicates coordinated control. To bridge this gap, the present study proposes a generalized energy storage system (GESS) that represents both devices with a common state of charge and generalized charge/discharge power. An adaptive self-loss term captures both battery self-discharge and temperature-dependent passive heat exchange. The model maps the generalized power of thermal energy storage to equivalent electrical power, while accounting for the thermal inertia of internal spaces and heating, ventilation, and air-conditioning systems. To address delayed rewards in the GESS, a multi-agent decomposed deep reinforcement learning approach is developed. The control problem is formulated as a sequential partially observable Markov decision process with a dual-critic architecture that redistributes immediate rewards to construct delayed rewards. Decentralized actors are optimized using a clipped surrogate objective with combined advantage estimates and control variate stabilization. Numerical experiments on the test system demonstrate that the proposed method enhances building profitability and reduces grid operating costs.

Suggested Citation

  • Jin, Jiahui & Sun, Guoqiang & Chen, Sheng & Li, Yaping & Zhu, Hong & Mao, Wenbo & Ji, Wenlu, 2026. "Buildings-to-grid with generalized energy storage: A multi-agent decomposed deep reinforcement learning approach for delayed rewards," Applied Energy, Elsevier, vol. 404(C).
  • Handle: RePEc:eee:appene:v:404:y:2026:i:c:s0306261925019117
    DOI: 10.1016/j.apenergy.2025.127181
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261925019117
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2025.127181?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:404:y:2026:i:c:s0306261925019117. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.