Author
Listed:
- Jiang, Ben
- Zhang, Chengyu
- Li, Yu
- Rezgui, Yacine
- Luo, Zhiwen
- Ghoroghi, Ali
- Wang, Peng
- Zhao, Tianyi
Abstract
With the diversification of building functions and the popularization of electric vehicles, the difficulty in the operation and regulation of community microgrids that integrate renewable energy and energy storage systems has increased significantly. This study integrates real data, public datasets, and a scenario-based data generation framework for charging piles to construct a virtual community-level microgrid system. To increase the number of effective samples in reinforcement learning training, a semantic masking mechanism with environmental perception capability is introduced to achieve real-time optimal regulation of the microgrid under dynamic electricity price scenarios. The study conducted a systematic analysis under three action step-size settings and a 30-day hourly optimization scenario, integrating multiple baseline models with multidimensional evaluation metrics. The results indicate that the generated charging pile power data accurately reflects both the consistency of group charging behavior and individual variation characteristics. Compared to baseline models, the Semantic mask DQN policy achieves an average reduction of 1.21%-3.73% in total electricity consumption, while simultaneously realizing operational cost savings of 1.76%-5.86%. This strategy effectively enhances the training stability of reinforcement learning models and significantly reduces the frequency of boundary triggers in energy storage systems. Under this framework, microgrids have enhanced their ability to cope with short-term power outage scenarios. The findings of this study provide intelligent optimization approaches and theoretical support for the efficient and low-carbon operation of building cluster microgrids with charging regions.
Suggested Citation
Jiang, Ben & Zhang, Chengyu & Li, Yu & Rezgui, Yacine & Luo, Zhiwen & Ghoroghi, Ali & Wang, Peng & Zhao, Tianyi, 2026.
"Deep reinforcement learning strategy integrating environmental perception and semantic masking for real-time operational optimization of building cluster microgrids,"
Energy, Elsevier, vol. 349(C).
Handle:
RePEc:eee:energy:v:349:y:2026:i:c:s0360544226007796
DOI: 10.1016/j.energy.2026.140676
Download full text from publisher
As the access to this document is restricted, you may want to
for a different version of it.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:energy:v:349:y:2026:i:c:s0360544226007796. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.journals.elsevier.com/energy .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.