IDEAS home Printed from https://ideas.repec.org/a/eee/eneeco/v149y2025ics0140988325005006.html

Reinforcement learning for bidding strategy optimization in day-ahead energy market

Author

Listed:
  • Di Persio, Luca
  • Garbelli, Matteo
  • Giordano, Luca Maria

Abstract

In day-ahead markets, participants submit bids specifying the amounts of energy they wish to buy or sell and the price they are prepared to pay or receive. However, the dynamic for forming the Market Clearing Price (MCP) dictated by the bidding mechanism is frequently overlooked in the literature on energy market modeling. Forecasting models usually focus on predicting the MCP rather than trying to build the optimal supply and demand curves for a given price scenario. This article develops a data-driven approach for generating optimal offering curves using Deep Deterministic Policy Gradient (DDPG), a reinforcement learning algorithm capable of handling continuous action spaces. Our model processes historical Italian electricity price data to generate stepwise offering curves that maximize profit over time. Numerical experiments demonstrate the effectiveness of our approach, with the agent achieving up to 85% of the normalized reward, i.e. the ratio between actual profit and the maximum possible revenue obtainable if all production capacity were sold at the highest feasible price. These results demonstrate that reinforcement learning can effectively capture complex temporal patterns in electricity price data without requiring explicit forecast models, providing market participants with adaptive bidding strategies that improve profit margins while accounting for production constraints.

Suggested Citation

  • Di Persio, Luca & Garbelli, Matteo & Giordano, Luca Maria, 2025. "Reinforcement learning for bidding strategy optimization in day-ahead energy market," Energy Economics, Elsevier, vol. 149(C).
  • Handle: RePEc:eee:eneeco:v:149:y:2025:i:c:s0140988325005006
    DOI: 10.1016/j.eneco.2025.108673
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0140988325005006
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.eneco.2025.108673?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Ardi Tampuu & Tambet Matiisen & Dorian Kodelja & Ilya Kuzovkin & Kristjan Korjus & Juhan Aru & Jaan Aru & Raul Vicente, 2017. "Multiagent cooperation and competition with deep reinforcement learning," PLOS ONE, Public Library of Science, vol. 12(4), pages 1-15, April.
    2. James Nicolaisen & Valentin Petrov & Leigh Tesfatsion, 2000. "Market Power and Efficiency in a Computational Electricity Market with Discriminatory Double-Auction Pricing," Computational Economics 0004005, University Library of Munich, Germany.
    3. Weng, Haoen & Hu, Yongli & Liang, Min & Xi, Jiayang & Yin, Baocai, 2025. "Optimizing bidding strategy in electricity market based on graph convolutional neural network and deep reinforcement learning," Applied Energy, Elsevier, vol. 380(C).
    4. Serafin, Tomasz & Marcjasz, Grzegorz & Weron, Rafał, 2022. "Trading on short-term path forecasts of intraday electricity prices," Energy Economics, Elsevier, vol. 112(C).
    5. Katarzyna Maciejowska & Bartosz Uniejewski & Rafa{l} Weron, 2022. "Forecasting Electricity Prices," Papers 2204.11735, arXiv.org.
    6. Cardo-Miota, Javier & Beltran, Hector & Pérez, Emilio & Khadem, Shafi & Bahloul, Mohamed, 2025. "Deep reinforcement learning-based strategy for maximizing returns from renewable energy and energy storage systems in multi-electricity markets," Applied Energy, Elsevier, vol. 388(C).
    7. Sebastian Jaimungal, 2022. "Reinforcement learning and stochastic optimisation," Finance and Stochastics, Springer, vol. 26(1), pages 103-129, January.
    8. Jiang, Yuzheng & Dong, Jun & Huang, Hexiang, 2024. "Optimal bidding strategy for the price-maker virtual power plant in the day-ahead market based on multi-agent twin delayed deep deterministic policy gradient algorithm," Energy, Elsevier, vol. 306(C).
    9. Luca Di Persio & Mohammed Alruqimi & Matteo Garbelli, 2024. "Stochastic Approaches to Energy Markets: From Stochastic Differential Equations to Mean Field Games and Neural Network Modeling," Energies, MDPI, vol. 17(23), pages 1-46, December.
    10. Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Emilio Calvano & Giacomo Calzolari & Vincenzo Denicolò & Sergio Pastorello, 2019. "Algorithmic Pricing What Implications for Competition Policy?," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 55(1), pages 155-171, August.
    2. Harrold, Daniel J.B. & Cao, Jun & Fan, Zhong, 2022. "Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning," Applied Energy, Elsevier, vol. 318(C).
    3. Wang, Xuekai & D’Ariano, Andrea & Su, Shuai & Tang, Tao, 2023. "Cooperative train control during the power supply shortage in metro system: A multi-agent reinforcement learning approach," Transportation Research Part B: Methodological, Elsevier, vol. 170(C), pages 244-278.
    4. Mahmud, Sakib & Sayed, Aya Nabil & Himeur, Yassine & Nhlabatsi, Armstrong & Bensaali, Faycal, 2026. "A comprehensive review of deep reinforcement learning applications from centralized power generation to modern energy internet frameworks," Renewable and Sustainable Energy Reviews, Elsevier, vol. 226(PE).
    5. Gal Neria & Michal Tzur, 2024. "The Dynamic Pickup and Allocation with Fairness Problem," Transportation Science, INFORMS, vol. 58(4), pages 821-840, July.
    6. Perera, A.T.D. & Kamalaruban, Parameswaran, 2021. "Applications of reinforcement learning in energy systems," Renewable and Sustainable Energy Reviews, Elsevier, vol. 137(C).
    7. Wang, Boyu & Xu, Xiaofeng & Wang, Peng, 2026. "Multi-market agency-based power procurement strategies for power grid companies using reinforcement learning," Applied Energy, Elsevier, vol. 402(PB).
    8. Xiang Cheng & Zhuo Jin & Hailiang Yang & George Yin, 2026. "A Hybrid Deep Reinforcement Learning Method for Insurance Portfolio Management," Journal of Optimization Theory and Applications, Springer, vol. 208(1), pages 1-42, January.
    9. Hongxin Yu & Lihui Zhang & Meng Zhang & Fengyue Jin & Yibing Wang, 2024. "Coordinated Ramp Metering Considering the Dynamics of Mixed-Autonomy Traffic," Sustainability, MDPI, vol. 16(22), pages 1-26, November.
    10. Daniel Russo, 2023. "Approximation Benefits of Policy Gradient Methods with Aggregated States," Management Science, INFORMS, vol. 69(11), pages 6898-6911, November.
    11. Balint, T. & Lamperti, F. & Mandel, A. & Napoletano, M. & Roventini, A. & Sapio, A., 2017. "Complexity and the Economics of Climate Change: A Survey and a Look Forward," Ecological Economics, Elsevier, vol. 138(C), pages 252-265.
    12. Tulika Saha & Sriparna Saha & Pushpak Bhattacharyya, 2020. "Towards sentiment aided dialogue policy learning for multi-intent conversations using hierarchical reinforcement learning," PLOS ONE, Public Library of Science, vol. 15(7), pages 1-28, July.
    13. Hamid Ebrahimi, 2026. "A Novel Evidential Uncertainty Framework for Hybrid Models in Rainfall Simulation," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 40(2), pages 1-24, January.
    14. Mahmoud Mahfouz & Angelos Filos & Cyrine Chtourou & Joshua Lockhart & Samuel Assefa & Manuela Veloso & Danilo Mandic & Tucker Balch, 2019. "On the Importance of Opponent Modeling in Auction Markets," Papers 1911.12816, arXiv.org.
    15. Lixiang Zhang & Yan Yan & Yaoguang Hu, 2024. "Deep reinforcement learning for dynamic scheduling of energy-efficient automated guided vehicles," Journal of Intelligent Manufacturing, Springer, vol. 35(8), pages 3875-3888, December.
    16. Imen Azzouz & Wiem Fekih Hassen, 2023. "Optimization of Electric Vehicles Charging Scheduling Based on Deep Reinforcement Learning: A Decentralized Approach," Energies, MDPI, vol. 16(24), pages 1-18, December.
    17. Iqra Nazir & Nermish Mushtaq & Waqas Amin, 2025. "Smart Grid Systems: Addressing Privacy Threats, Security Vulnerabilities, and Demand–Supply Balance (A Review)," Energies, MDPI, vol. 18(19), pages 1-77, September.
    18. Benjamin Heinbach & Peter Burggräf & Johannes Wagner, 2024. "gym-flp: A Python Package for Training Reinforcement Learning Algorithms on Facility Layout Problems," SN Operations Research Forum, Springer, vol. 5(1), pages 1-26, March.
    19. Jacob W. Crandall & Mayada Oudah & Tennom & Fatimah Ishowo-Oloko & Sherief Abdallah & Jean-François Bonnefon & Manuel Cebrian & Azim Shariff & Michael A. Goodrich & Iyad Rahwan, 2018. "Cooperating with machines," Nature Communications, Nature, vol. 9(1), pages 1-12, December.
      • Abdallah, Sherief & Bonnefon, Jean-François & Cebrian, Manuel & Crandall, Jacob W. & Ishowo-Oloko, Fatimah & Oudah, Mayada & Rahwan, Iyad & Shariff, Azim & Tennom,, 2017. "Cooperating with Machines," TSE Working Papers 17-806, Toulouse School of Economics (TSE), revised 10 Jun 2026.
      • Abdallah, Sherief & Bonnefon, Jean-François & Cebrian, Manuel & Crandall, Jacob W. & Ishowo-Oloko, Fatimah & Oudah, Mayada & Rahwan, Iyad & Shariff, Azim & Tennom,, 2017. "Cooperating with Machines," IAST Working Papers 17-68, Institute for Advanced Study in Toulouse (IAST), revised 10 Jun 2026.
    20. Sun, Alexander Y., 2020. "Optimal carbon storage reservoir management through deep reinforcement learning," Applied Energy, Elsevier, vol. 278(C).

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    JEL classification:

    • Q41 - Agricultural and Natural Resource Economics; Environmental and Ecological Economics - - Energy - - - Demand and Supply; Prices
    • Q47 - Agricultural and Natural Resource Economics; Environmental and Ecological Economics - - Energy - - - Energy Forecasting
    • C57 - Mathematical and Quantitative Methods - - Econometric Modeling - - - Econometrics of Games and Auctions
    • D44 - Microeconomics - - Market Structure, Pricing, and Design - - - Auctions
    • C73 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Stochastic and Dynamic Games; Evolutionary Games
    • N74 - Economic History - - Economic History: Transport, International and Domestic Trade, Energy, and Other Services - - - Europe: 1913-

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:eneeco:v:149:y:2025:i:c:s0140988325005006. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eneco .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.