Author
Listed:
- Jian Huang
(State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)
- Ming Yang
(State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)
- Li Wang
(State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)
- Mingxing Mei
(State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)
- Jianfang Ye
(State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)
- Kejia Liu
(College of Electrical Engineering, Zhejiang University, Hangzhou 310027, China
Zhejiang Key Laboratory of Electrical Technology and System on Renewable Energy, Hangzhou 310027, China)
- Yaolong Bo
(College of Electrical Engineering, Zhejiang University, Hangzhou 310027, China
Zhejiang Key Laboratory of Electrical Technology and System on Renewable Energy, Hangzhou 310027, China)
Abstract
Traditional electricity market bidding typically focuses on unilateral structures, where independent energy storage units and flexible loads act merely as price takers. This reduces bidding motivation and weakens the balancing capability of regional power systems, thereby limiting the large-scale utilization of renewable energy. To address these challenges and support sustainable power system operation, this paper proposes a double-sided auction market strategy for heterogeneous multi-resource (HMR) participation based on multi-agent reinforcement learning (MARL). The framework explicitly considers the heterogeneous bidding and quantity reporting behaviors of renewable generation, flexible demand, and energy storage. An improved incentive mechanism is introduced to enhance real-time system power balance, thereby enabling higher renewable energy integration and reducing curtailment. To efficiently solve the market-clearing problem, an improved Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3) algorithm is employed, along with a temporal-difference (TD) error-based prioritized experience replay mechanism to strengthen exploration. Case studies validate the effectiveness of the proposed approach in guiding heterogeneous resources toward cooperative bidding behaviors, improving market efficiency, and reinforcing the sustainable and resilient operation of future power systems.
Suggested Citation
Jian Huang & Ming Yang & Li Wang & Mingxing Mei & Jianfang Ye & Kejia Liu & Yaolong Bo, 2025.
"Multi-Agent Reinforcement Learning for Sustainable Integration of Heterogeneous Resources in a Double-Sided Auction Market with Power Balance Incentive Mechanism,"
Sustainability, MDPI, vol. 18(1), pages 1-18, December.
Handle:
RePEc:gam:jsusta:v:18:y:2025:i:1:p:141-:d:1824089
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:18:y:2025:i:1:p:141-:d:1824089. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.