IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v402y2026ipbs0306261925016897.html

Multi-market agency-based power procurement strategies for power grid companies using reinforcement learning

Author

Listed:
  • Wang, Boyu
  • Xu, Xiaofeng
  • Wang, Peng

Abstract

At this critical stage of simultaneously advancing the construction of a new power system and power market reform, power grid companies face challenges in managing multifaceted risks and optimizing costs associated with high penetration of renewable energy in agency power procurement. Existing research has mainly studied distributed entities or single markets, which struggles to handle the complex challenges of dynamically linked multidimensional risks and multi-timescale decision-making in agency procurement. Therefore, a collaborative decision-making framework that integrates risk volatility modeling with multi-agent reinforcement learning is proposed. First, a multi-dimensional risk quantification model is developed that incorporates public opinion, supply security and compliance risks. Monte Carlo simulation is subsequently employed to characterize the dynamic distributions of these risks. Second, through risk-cost linkage analysis, an optimization model for agency power purchase transactions is constructed using the Asynchronous Advantage Actor-Critic (A3C) reinforcement learning algorithm, aiming to minimize both procurement costs and associated risks. Finally, case simulations verify the model's effectiveness and explore optimized procurement strategies under different risk scenarios. Simulation results demonstrate that the proposed model reduces procurement costs and total risk by 8.0 %–18.8 %, while reducing the dispersion of risk costs by 30.1 % under a combined risk strategy. This framework offers a methodological tool for enhancing risk control and cost optimization in agency power procurement operations for grid companies.

Suggested Citation

  • Wang, Boyu & Xu, Xiaofeng & Wang, Peng, 2026. "Multi-market agency-based power procurement strategies for power grid companies using reinforcement learning," Applied Energy, Elsevier, vol. 402(PB).
  • Handle: RePEc:eee:appene:v:402:y:2026:i:pb:s0306261925016897
    DOI: 10.1016/j.apenergy.2025.126959
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261925016897
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2025.126959?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Dai, Peng-Fei & Xiong, Xiong & Duc Huynh, Toan Luu & Wang, Jiqiang, 2022. "The impact of economic policy uncertainties on the volatility of European carbon market," Journal of Commodity Markets, Elsevier, vol. 26(C).
    2. Weng, Haoen & Hu, Yongli & Liang, Min & Xi, Jiayang & Yin, Baocai, 2025. "Optimizing bidding strategy in electricity market based on graph convolutional neural network and deep reinforcement learning," Applied Energy, Elsevier, vol. 380(C).
    3. Ciarreta, Aitor & Zarraga, Ainhoa, 2016. "Modeling realized volatility on the Spanish intra-day electricity market," Energy Economics, Elsevier, vol. 58(C), pages 152-163.
    4. Li, Hongze & Li, Xumeng & Zhang, Yuanyuan & Zhao, Yihang & Pan, Jiaqi & Zhao, Huiru, 2024. "Declaration strategy of wind power and pumped storage participating in the power market considering multiple uncertainties," Energy, Elsevier, vol. 293(C).
    5. Shen, Weijie & Ding, Hongjian & Zeng, Ming & Zhang, Xiaochun, 2025. "Optimal bidding strategy for integrated energy system participating in spot power market: A Wasserstein metric based DRO method," Energy, Elsevier, vol. 327(C).
    6. Xiao, Guowei & Zhang, Miao & Huang, Weiqiang & Mo, Zihao & Xie, Haishun & Tang, Xiongmin, 2025. "A data-driven hybrid robust optimization approach for microgrid operators in the energy reserve market considering different wind power producers’ strategies," Applied Energy, Elsevier, vol. 386(C).
    7. Guo, Kun & Liu, Yu & Cao, Shanwei & Zhai, Xiangyang & Ji, Qiang, 2025. "Can climate factors improve the forecasting of electricity price volatility? Evidence from Australia," Energy, Elsevier, vol. 315(C).
    8. Yan, Laiqing & Zhang, Xiaoyu & Ullah, Zia & Qazi, Hasan Saeed & Hasanien, Hany M., 2025. "A novel solution strategy for scheduling optimization of virtual power plant considering multiple participants and Peak Energy Market," Renewable Energy, Elsevier, vol. 250(C).
    9. Qu, Hui & Chen, Wei & Niu, Mengyi & Li, Xindan, 2016. "Forecasting realized volatility in electricity markets using logistic smooth transition heterogeneous autoregressive models," Energy Economics, Elsevier, vol. 54(C), pages 68-76.
    10. Wang, Yudong & Liu, Li & Ma, Feng & Diao, Xundi, 2018. "Momentum of return predictability," Journal of Empirical Finance, Elsevier, vol. 45(C), pages 141-156.
    11. Li, Zhao-Chen & Xie, Chi & Wang, Gang-Jin & Zhu, You & Zeng, Zhi-Jian & Gong, Jue, 2024. "Forecasting global stock market volatilities: A shrinkage heterogeneous autoregressive (HAR) model with a large cross-market predictor set," International Review of Economics & Finance, Elsevier, vol. 93(PB), pages 673-711.
    12. Narayan, Paresh Kumar & Liu, Ruipeng, 2018. "A new GARCH model with higher moments for stock return predictability," Journal of International Financial Markets, Institutions and Money, Elsevier, vol. 56(C), pages 93-103.
    13. Jiang, Yuzheng & Dong, Jun & Huang, Hexiang, 2024. "Optimal bidding strategy for the price-maker virtual power plant in the day-ahead market based on multi-agent twin delayed deep deterministic policy gradient algorithm," Energy, Elsevier, vol. 306(C).
    14. Chen, Chuanglian & Ye, Cheng & Wang, Haonan & Yao, Shujie, 2025. "Policy intervention and stock market stability risks: Evidence from carbon emission trading policy on energy firms in China," Journal of Asian Economics, Elsevier, vol. 98(C).
    15. Rehman, Anis Ur & Ullah, Zia & Qazi, Hasan Saeed & Hasanien, Hany M. & Khalid, Haris M., 2024. "Reinforcement learning-driven proximal policy optimization-based voltage control for PV and WT integrated power system," Renewable Energy, Elsevier, vol. 227(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yi, Yongsheng & Ma, Feng & Zhang, Yaojie & Huang, Dengshi, 2019. "Forecasting stock returns with cycle-decomposed predictors," International Review of Financial Analysis, Elsevier, vol. 64(C), pages 250-261.
    2. Yuan, Xianghui & Li, Xiang, 2022. "Delta-hedging demand and intraday momentum: Evidence from China," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 600(C).
    3. Yaojie Zhang & Feng Ma & Chao Liang & Yi Zhang, 2021. "Good variance, bad variance, and stock return predictability," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 26(3), pages 4410-4423, July.
    4. Yongan Xu & Jianqiong Wang & Zhonglu Chen & Chao Liang, 2023. "Sentiment indices and stock returns: Evidence from China," International Journal of Finance & Economics, John Wiley & Sons, Ltd., vol. 28(1), pages 1063-1080, January.
    5. Di Persio, Luca & Garbelli, Matteo & Giordano, Luca Maria, 2025. "Reinforcement learning for bidding strategy optimization in day-ahead energy market," Energy Economics, Elsevier, vol. 149(C).
    6. Wen, Zhuzhu & Gong, Xu & Ma, Diandian & Xu, Yahua, 2021. "Intraday momentum and return predictability: Evidence from the crude oil market," Economic Modelling, Elsevier, vol. 95(C), pages 374-384.
    7. Ana Sofia Monteiro & Helder Sebastião & Nuno Silva, 2025. "Prediction and Allocation of Stocks, Bonds, and REITs in the US Market," Computational Economics, Springer;Society for Computational Economics, vol. 65(3), pages 1191-1230, March.
    8. Yuan, Ying & Qu, Yong & Qiao, Sijia, 2025. "Equity premium prediction: A constraint-based predictor decomposition approach," Global Finance Journal, Elsevier, vol. 68(C).
    9. Aitor Ciarreta & Peru Muniainy & Ainhoa Zarraga, 2017. "Modelling Realized Volatility in Electricity Spot Prices: New insights and Application to the Japanese Electricity Market," ISER Discussion Paper 0991, Institute of Social and Economic Research, The University of Osaka.
    10. Souropanis, Ioannis & Vivian, Andrew, 2023. "Forecasting realized volatility with wavelet decomposition," Journal of Empirical Finance, Elsevier, vol. 74(C).
    11. Qiao, Gaoxiu & Jiang, Gongyue & Yang, Jiyu, 2022. "VIX term structure forecasting: New evidence based on the realized semi-variances," International Review of Financial Analysis, Elsevier, vol. 82(C).
    12. Li Liu & Zhiyuan Pan & Yudong Wang, 2021. "What can we learn from the return predictability over the business cycle?," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 40(1), pages 108-131, January.
    13. Nonejad, Nima, 2022. "Equity premium prediction using the price of crude oil: Uncovering the nonlinear predictive impact," Energy Economics, Elsevier, vol. 115(C).
    14. Dai, Zhifeng & Jiang, Fuwei & Kang, Jie & Xue, Bowen, 2025. "Stock return predictability in the frequency domain," International Journal of Forecasting, Elsevier, vol. 41(3), pages 1126-1147.
    15. Dai, Zhifeng & Zhu, Huan, 2020. "Stock return predictability from a mixed model perspective," Pacific-Basin Finance Journal, Elsevier, vol. 60(C).
    16. Xu, Yongan & Wang, Jianqiong & Chen, Zhonglu & Liang, Chao, 2021. "Economic policy uncertainty and stock market returns: New evidence," The North American Journal of Economics and Finance, Elsevier, vol. 58(C).
    17. Park, Jin Suk & Newaz, Mohammad Khaleq, 2021. "Liquidity and short-run predictability: Evidence from international stock markets," Global Finance Journal, Elsevier, vol. 50(C).
    18. Dai, Zhifeng & Zhu, Huan, 2021. "Indicator selection and stock return predictability," The North American Journal of Economics and Finance, Elsevier, vol. 57(C).
    19. Dai, Zhifeng & Dong, Xiaodi & Kang, Jie & Hong, Lianying, 2020. "Forecasting stock market returns: New technical indicators and two-step economic constraint method," The North American Journal of Economics and Finance, Elsevier, vol. 53(C).
    20. Dai, Zhifeng & Zhou, Huiting & Kang, Jie & Wen, Fenghua, 2021. "The skewness of oil price returns and equity premium predictability," Energy Economics, Elsevier, vol. 94(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    JEL classification:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:402:y:2026:i:pb:s0306261925016897. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.