IDEAS home Printed from https://ideas.repec.org/a/gam/jeners/v18y2025i17p4698-d1741932.html
   My bibliography  Save this article

Energy Regulation-Aware Layered Control Architecture for Building Energy Systems Using Constraint-Aware Deep Reinforcement Learning and Virtual Energy Storage Modeling

Author

Listed:
  • Siwei Li

    (Yangtze University College of Arts and Sciences, Jingzhou 434025, China)

  • Congxiang Tian

    (Yangtze University College of Arts and Sciences, Jingzhou 434025, China)

  • Ahmed N. Abdalla

    (Faculty of Electronic Information Engineering, Huaiyin Institute of Technology, Huaian 223003, China)

Abstract

In modern intelligent buildings, the control of Building Energy Systems (BES) faces increasing complexity in balancing energy costs, thermal comfort, and operational flexibility. Traditional centralized or flat deep reinforcement learning (DRL) methods often fail to effectively handle the multi-timescale dynamics, large state–action spaces, and strict constraint satisfaction required for real-world energy systems. To address these challenges, this paper proposes an energy policy-aware layered control architecture that combines Virtual Energy Storage System (VESS) modeling with a novel Dynamic Constraint-Aware Policy Optimization (DCPO) algorithm. The VESS is modeled based on the thermal inertia of building envelope components, quantifying flexibility in terms of virtual power, capacity, and state of charge, thus enabling BES to behave as if it had embedded, non-physical energy storage. Building on this, the BES control problem is structured using a hierarchical Markov Decision Process, in which the upper level handles strategic decisions (e.g., VESS dispatch, HVAC modes), while the lower level manages real-time control (e.g., temperature adjustments, load balancing). The proposed DCPO algorithm extends actor–critic learning by incorporating dynamic policy constraints, entropy regularization, and adaptive clipping to ensure feasible and efficient policy learning under both operational and comfort-related constraints. Simulation experiments demonstrate that the proposed approach outperforms established algorithms like Deep Q-Networks (DQN), Deep Deterministic Policy Gradient (DDPG), and Twin Delayed DDPG (TD3). Specifically, it achieves a 32.6% reduction in operational costs and over a 51% decrease in thermal comfort violations compared to DQN, while ensuring millisecond-level policy generation suitable for real-time BES deployment.

Suggested Citation

  • Siwei Li & Congxiang Tian & Ahmed N. Abdalla, 2025. "Energy Regulation-Aware Layered Control Architecture for Building Energy Systems Using Constraint-Aware Deep Reinforcement Learning and Virtual Energy Storage Modeling," Energies, MDPI, vol. 18(17), pages 1-23, September.
  • Handle: RePEc:gam:jeners:v:18:y:2025:i:17:p:4698-:d:1741932
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/1996-1073/18/17/4698/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/1996-1073/18/17/4698/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Mu, Yunfei & Xu, Yanze & Zhang, Jiarui & Wu, Zeqing & Jia, Hongjie & Jin, Xiaolong & Qi, Yan, 2023. "A data-driven rolling optimization control approach for building energy systems that integrate virtual energy storage systems," Applied Energy, Elsevier, vol. 346(C).
    2. Jendoubi, Imen & Bouffard, François, 2023. "Multi-agent hierarchical reinforcement learning for energy management," Applied Energy, Elsevier, vol. 332(C).
    3. Arroyo, Javier & Manna, Carlo & Spiessens, Fred & Helsen, Lieve, 2022. "Reinforced model predictive control (RL-MPC) for building energy management," Applied Energy, Elsevier, vol. 309(C).
    4. Cui, Feifei & An, Dou & Xi, Huan, 2024. "Integrated energy hub dispatch with a multi-mode CAES–BESS hybrid system: An option-based hierarchical reinforcement learning approach," Applied Energy, Elsevier, vol. 374(C).
    5. Korkas, Christos D. & Baldi, Simone & Michailidis, Iakovos & Kosmatopoulos, Elias B., 2015. "Intelligent energy and thermal comfort management in grid-connected microgrids with heterogeneous occupancy schedule," Applied Energy, Elsevier, vol. 149(C), pages 194-203.
    6. Yang, Ting & Zhao, Liyuan & Li, Wei & Wu, Jianzhong & Zomaya, Albert Y., 2021. "Towards healthy and cost-effective indoor environment management in smart homes: A deep reinforcement learning approach," Applied Energy, Elsevier, vol. 300(C).
    7. Baldi, Simone & Korkas, Christos D. & Lv, Maolong & Kosmatopoulos, Elias B., 2018. "Automating occupant-building interaction via smart zoning of thermostatic loads: A switched self-tuning approach," Applied Energy, Elsevier, vol. 231(C), pages 1246-1258.
    8. Korkas, Christos D. & Baldi, Simone & Michailidis, Iakovos & Kosmatopoulos, Elias B., 2016. "Occupancy-based demand response and thermal comfort optimization in microgrids with renewable energy sources and energy storage," Applied Energy, Elsevier, vol. 163(C), pages 93-104.
    9. Zhang, Bin & Hu, Weihao & Ghias, Amer M.Y.M. & Xu, Xiao & Chen, Zhe, 2022. "Multi-agent deep reinforcement learning-based coordination control for grid-aware multi-buildings," Applied Energy, Elsevier, vol. 328(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Song, Kwonsik & Kim, Sooyoung & Park, Moonseo & Lee, Hyun-Soo, 2017. "Energy efficiency-based course timetabling for university buildings," Energy, Elsevier, vol. 139(C), pages 394-405.
    2. Tungom, Chia E. & Niu, Ben & Wang, Hong, 2025. "SWAPP: Swarm precision policy optimization with dynamic action bound adjustment for energy management in smart cities," Applied Energy, Elsevier, vol. 377(PA).
    3. Tungom, Chia E. & Wang, Hong & Beata, Kamuya & Niu, Ben, 2024. "SWOAM: Swarm optimized agents for energy management in grid-interactive connected buildings," Energy, Elsevier, vol. 301(C).
    4. Giaouris, Damian & Papadopoulos, Athanasios I. & Patsios, Charalampos & Walker, Sara & Ziogou, Chrysovalantou & Taylor, Phil & Voutetakis, Spyros & Papadopoulou, Simira & Seferlis, Panos, 2018. "A systems approach for management of microgrids considering multiple energy carriers, stochastic loads, forecasting and demand side response," Applied Energy, Elsevier, vol. 226(C), pages 546-559.
    5. Gao, Yuan & Hu, Zehuan & Yamate, Shun & Otomo, Junichiro & Chen, Wei-An & Liu, Mingzhe & Xu, Tingting & Ruan, Yingjun & Shang, Juan, 2025. "Unlocking predictive insights and interpretability in deep reinforcement learning for Building-Integrated Photovoltaic and Battery (BIPVB) systems," Applied Energy, Elsevier, vol. 384(C).
    6. Zheng, Zhuang & Pan, Jia & Huang, Gongsheng & Luo, Xiaowei, 2022. "A bottom-up intra-hour proactive scheduling of thermal appliances for household peak avoiding based on model predictive control," Applied Energy, Elsevier, vol. 323(C).
    7. Baldi, Simone & Korkas, Christos D. & Lv, Maolong & Kosmatopoulos, Elias B., 2018. "Automating occupant-building interaction via smart zoning of thermostatic loads: A switched self-tuning approach," Applied Energy, Elsevier, vol. 231(C), pages 1246-1258.
    8. Arcos-Aviles, Diego & Pascual, Julio & Guinjoan, Francesc & Marroyo, Luis & Sanchis, Pablo & Marietta, Martin P., 2017. "Low complexity energy management strategy for grid profile smoothing of a residential grid-connected microgrid using generation and demand forecasting," Applied Energy, Elsevier, vol. 205(C), pages 69-84.
    9. Ogunjuyigbe, A.S.O. & Ayodele, T.R. & Akinola, O.A., 2017. "User satisfaction-induced demand side load management in residential buildings with user budget constraint," Applied Energy, Elsevier, vol. 187(C), pages 352-366.
    10. Quashie, Mike & Bouffard, François & Joós, Géza, 2017. "Business cases for isolated and grid connected microgrids: Methodology and applications," Applied Energy, Elsevier, vol. 205(C), pages 105-115.
    11. Wang, Wei & Chen, Jiayu & Huang, Gongsheng & Lu, Yujie, 2017. "Energy efficient HVAC control for an IPS-enabled large space in commercial buildings through dynamic spatial occupancy distribution," Applied Energy, Elsevier, vol. 207(C), pages 305-323.
    12. Ayas Shaqour & Aya Hagishima, 2022. "Systematic Review on Deep Reinforcement Learning-Based Energy Management for Different Building Types," Energies, MDPI, vol. 15(22), pages 1-27, November.
    13. Baldi, Simone & Yuan, Shuai & Endel, Petr & Holub, Ondrej, 2016. "Dual estimation: Constructing building energy models from data sampled at low rate," Applied Energy, Elsevier, vol. 169(C), pages 81-92.
    14. Fazli Wahid & Muhammad Fayaz & Ayman Aljarbouh & Masood Mir & Muhammad Aamir & Imran, 2020. "Energy Consumption Optimization and User Comfort Maximization in Smart Buildings Using a Hybrid of the Firefly and Genetic Algorithms," Energies, MDPI, vol. 13(17), pages 1-26, August.
    15. Favero, Matteo & Kloppenborg Møller, Jan & Calì, Davide & Carlucci, Salvatore, 2022. "Human-in-the-loop methods for occupant-centric building design and operation," Applied Energy, Elsevier, vol. 325(C).
    16. Jazizadeh, Farrokh & Jung, Wooyoung, 2018. "Personalized thermal comfort inference using RGB video images for distributed HVAC control," Applied Energy, Elsevier, vol. 220(C), pages 829-841.
    17. Rajvikram Madurai Elavarasan & Aritra Ghosh & Tapas K. Mallick & Apoorva Krishnamurthy & Meenal Saravanan, 2019. "Investigations on Performance Enhancement Measures of the Bidirectional Converter in PV–Wind Interconnected Microgrid System," Energies, MDPI, vol. 12(14), pages 1-22, July.
    18. Fontenot, Hannah & Dong, Bing, 2019. "Modeling and control of building-integrated microgrids for optimal energy management – A review," Applied Energy, Elsevier, vol. 254(C).
    19. Hyeunguk Ahn & Jingjing Liu & Donghun Kim & Rongxin Yin & Tianzhen Hong & Mary Ann Piette, 2021. "How Can Floor Covering Influence Buildings’ Demand Flexibility?," Energies, MDPI, vol. 14(12), pages 1-17, June.
    20. Wang, Zixuan & Xiao, Fu & Ran, Yi & Li, Yanxue & Xu, Yang, 2024. "Scalable energy management approach of residential hybrid energy system using multi-agent deep reinforcement learning," Applied Energy, Elsevier, vol. 367(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jeners:v:18:y:2025:i:17:p:4698-:d:1741932. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.