IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v18y2025i1p141-d1824089.html

Multi-Agent Reinforcement Learning for Sustainable Integration of Heterogeneous Resources in a Double-Sided Auction Market with Power Balance Incentive Mechanism

Author

Listed:
  • Jian Huang

    (State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)

  • Ming Yang

    (State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)

  • Li Wang

    (State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)

  • Mingxing Mei

    (State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)

  • Jianfang Ye

    (State Grid Zhejiang Electric Power Company Lishui Power Supply Company, Lishui 323000, China)

  • Kejia Liu

    (College of Electrical Engineering, Zhejiang University, Hangzhou 310027, China
    Zhejiang Key Laboratory of Electrical Technology and System on Renewable Energy, Hangzhou 310027, China)

  • Yaolong Bo

    (College of Electrical Engineering, Zhejiang University, Hangzhou 310027, China
    Zhejiang Key Laboratory of Electrical Technology and System on Renewable Energy, Hangzhou 310027, China)

Abstract

Traditional electricity market bidding typically focuses on unilateral structures, where independent energy storage units and flexible loads act merely as price takers. This reduces bidding motivation and weakens the balancing capability of regional power systems, thereby limiting the large-scale utilization of renewable energy. To address these challenges and support sustainable power system operation, this paper proposes a double-sided auction market strategy for heterogeneous multi-resource (HMR) participation based on multi-agent reinforcement learning (MARL). The framework explicitly considers the heterogeneous bidding and quantity reporting behaviors of renewable generation, flexible demand, and energy storage. An improved incentive mechanism is introduced to enhance real-time system power balance, thereby enabling higher renewable energy integration and reducing curtailment. To efficiently solve the market-clearing problem, an improved Multi-Agent Twin Delayed Deep Deterministic Policy Gradient (MATD3) algorithm is employed, along with a temporal-difference (TD) error-based prioritized experience replay mechanism to strengthen exploration. Case studies validate the effectiveness of the proposed approach in guiding heterogeneous resources toward cooperative bidding behaviors, improving market efficiency, and reinforcing the sustainable and resilient operation of future power systems.

Suggested Citation

  • Jian Huang & Ming Yang & Li Wang & Mingxing Mei & Jianfang Ye & Kejia Liu & Yaolong Bo, 2025. "Multi-Agent Reinforcement Learning for Sustainable Integration of Heterogeneous Resources in a Double-Sided Auction Market with Power Balance Incentive Mechanism," Sustainability, MDPI, vol. 18(1), pages 1-18, December.
  • Handle: RePEc:gam:jsusta:v:18:y:2025:i:1:p:141-:d:1824089
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/18/1/141/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/18/1/141/
    Download Restriction: no
    ---><---

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:18:y:2025:i:1:p:141-:d:1824089. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.