IDEAS home Printed from https://ideas.repec.org/a/eee/appene/v391y2025ics0306261925005458.html

Mixed strategy Nash equilibrium analysis in real-time pricing and demand response for future smart retail market

Author

Listed:
  • Hu, Ze
  • Zhu, Ziqing
  • Wei, Xiang
  • Chan, Ka Wing
  • Bu, Siqi

Abstract

Real-time pricing and demand response (RTP-DR) is a key problem for profit-maximizing and policy-making in the deregulated retail electricity market (REM). However, previous studies overlooked the non-convexity and multi-equilibria caused by the network constraints and the temporally-related non-linear power consumption characteristics of end-users (EUs) in a privacy-protected environment. This paper employs mixed strategy Nash equilibrium (MSNE) to analyze the multiple equilibria in the non-convex game of the RTP-DR problem, providing a comprehensive view of the potential transaction results. A novel multi-agent Q-learning algorithm is developed to estimate subgame perfect equilibrium (SPE) in the proposed game. As a multi-agent reinforcement learning (MARL) algorithm, it enables players in the game to be rational “agents” that learn from “trial and error” to make optimal decisions across time periods. Moreover, the proposed algorithm has a bi-level structure and adopts probability distributions to denote Q-values, representing the belief in environmental response. Through validation on a Northern Illinois utility dataset, our proposed approach demonstrates notable advantages over benchmark algorithms. Specifically, it provides more profitable pricing decisions for monopoly retailers in REM, leading to strategic outcomes for EUs. The numerical results also find that multiple optimal pricing decisions over a day exist simultaneously by providing almost identical profits to the retailer, while leading to different energy consumption patterns and also significant differences in total energy usage on the demand side.

Suggested Citation

  • Hu, Ze & Zhu, Ziqing & Wei, Xiang & Chan, Ka Wing & Bu, Siqi, 2025. "Mixed strategy Nash equilibrium analysis in real-time pricing and demand response for future smart retail market," Applied Energy, Elsevier, vol. 391(C).
  • Handle: RePEc:eee:appene:v:391:y:2025:i:c:s0306261925005458
    DOI: 10.1016/j.apenergy.2025.125815
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0306261925005458
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.apenergy.2025.125815?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Charbonnier, Flora & Morstyn, Thomas & McCulloch, Malcolm D., 2022. "Scalable multi-agent reinforcement learning for distributed control of residential energy flexibility," Applied Energy, Elsevier, vol. 314(C).
    2. Li, Kangping & Wang, Fei & Mi, Zengqiang & Fotuhi-Firuzabad, Mahmoud & Duić, Neven & Wang, Tieqiang, 2019. "Capacity and output power estimation approach of individual behind-the-meter distributed photovoltaic system for demand response baseline estimation," Applied Energy, Elsevier, vol. 253(C), pages 1-1.
    3. Zhang, Li & Gao, Yan & Zhu, Hongbo & Tao, Li, 2022. "Bi-level stochastic real-time pricing model in multi-energy generation system: A reinforcement learning approach," Energy, Elsevier, vol. 239(PA).
    4. Lu, Renzhi & Bai, Ruichang & Huang, Yuan & Li, Yuting & Jiang, Junhui & Ding, Yuemin, 2021. "Data-driven real-time price-based demand response for industrial facilities energy management," Applied Energy, Elsevier, vol. 283(C).
    5. Lu, Renzhi & Hong, Seung Ho & Zhang, Xiongfeng, 2018. "A Dynamic pricing demand response algorithm for smart grid: Reinforcement learning approach," Applied Energy, Elsevier, vol. 220(C), pages 220-230.
    6. Hua, Haochen & Qin, Yuchao & Hao, Chuantong & Cao, Junwei, 2019. "Optimal energy management strategies for energy Internet via deep reinforcement learning approach," Applied Energy, Elsevier, vol. 239(C), pages 598-609.
    7. Wang, Qi & Zhang, Chunyu & Ding, Yi & Xydis, George & Wang, Jianhui & Østergaard, Jacob, 2015. "Review of real-time electricity markets for integrating Distributed Energy Resources and Demand Response," Applied Energy, Elsevier, vol. 138(C), pages 695-706.
    8. Partha Dasgupta & Eric Maskin, 1986. "The Existence of Equilibrium in Discontinuous Economic Games, I: Theory," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 53(1), pages 1-26.
    9. Qiu, Dawei & Ye, Yujian & Papadaskalopoulos, Dimitrios & Strbac, Goran, 2021. "Scalable coordinated management of peer-to-peer energy trading: A multi-cluster deep reinforcement learning approach," Applied Energy, Elsevier, vol. 292(C).
    10. Zhu, Ziqing & Hu, Ze & Chan, Ka Wing & Bu, Siqi & Zhou, Bin & Xia, Shiwei, 2023. "Reinforcement learning in deregulated energy market: A comprehensive review," Applied Energy, Elsevier, vol. 329(C).
    11. Harrold, Daniel J.B. & Cao, Jun & Fan, Zhong, 2022. "Renewable energy integration and microgrid energy trading using multi-agent deep reinforcement learning," Applied Energy, Elsevier, vol. 318(C).
    12. Partha Dasgupta & Eric Maskin, 1986. "The Existence of Equilibrium in Discontinuous Economic Games, II: Applications," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 53(1), pages 27-41.
    13. Wang, Ziyang & Sun, Mei & Gao, Cuixia & Wang, Xin & Ampimah, Benjamin Chris, 2021. "A new interactive real-time pricing mechanism of demand response based on an evaluation model," Applied Energy, Elsevier, vol. 295(C).
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Zhai, Chao & Cao, Zhixiang & Wang, Yi & Abdou-Tankari, Mahamadou & Yu, Jian & Lei, Zhen, 2025. "A reverse incentive-based demand response strategy for shared energy storage in industrial microgrids: Optimization, scheduling, and investment analysis," Energy, Elsevier, vol. 330(C).

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xu, Fangyuan & Zhu, Weidong & Wang, Yi Fei & Lai, Chun Sing & Yuan, Haoliang & Zhao, Yujia & Guo, Siming & Fu, Zhengxin, 2022. "A new deregulated demand response scheme for load over-shifting city in regulated power market," Applied Energy, Elsevier, vol. 311(C).
    2. Tariq, Abdul Haseeb & Amin, Uzma, 2025. "Peer-to-peer multi-energy trading in a decentralized network: A review," Renewable and Sustainable Energy Reviews, Elsevier, vol. 208(C).
    3. Peralta, Susana & Wauthy, Xavier & van Ypersele, Tanguy, 2006. "Should countries control international profit shifting?," Journal of International Economics, Elsevier, vol. 68(1), pages 24-37, January.
    4. Bonatti, Alessandro & Hörner, Johannes, 2017. "Learning to disagree in a game of experimentation," Journal of Economic Theory, Elsevier, vol. 169(C), pages 234-269.
    5. Allison, Blake A. & Bagh, Adib & Lepore, Jason J., 2018. "Sufficient conditions for weak reciprocal upper semi-continuity in mixed extensions of games," Journal of Mathematical Economics, Elsevier, vol. 74(C), pages 99-107.
    6. Becchetti, Leonardo & Palestini, Arsen & Solferino, Nazaria & Elisabetta Tessitore, M., 2014. "The socially responsible choice in a duopolistic market: A dynamic model of “ethical product” differentiation," Economic Modelling, Elsevier, vol. 43(C), pages 114-123.
    7. Jacobs, Martin & Requate, Till, 2016. "Bertrand-Edgeworth markets with increasing marginal costs and voluntary trading: Experimental evidence," Economics Working Papers 2016-01, Christian-Albrechts-University of Kiel, Department of Economics.
    8. Bernhardt, Dan & Koufopoulos, Kostas & Trigilia, Giulio, 2022. "Separating equilibria, underpricing and security design," Journal of Financial Economics, Elsevier, vol. 145(3), pages 788-801.
    9. Azevedo, Eduardo M. & Gottlieb, Daniel, 2019. "An example of non-existence of Riley equilibrium in markets with adverse selection," Games and Economic Behavior, Elsevier, vol. 116(C), pages 152-157.
    10. Marie-Laure Cabon-Dhersin & Nicolas Drouhin, 2020. "A general model of price competition with soft capacity constraints," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 70(1), pages 95-120, July.
    11. Benjamin Lester & Ali Shourideh & Venky Venkateswaran & Ariel Zetlin-Jones, 2019. "Screening and Adverse Selection in Frictional Markets," Journal of Political Economy, University of Chicago Press, vol. 127(1), pages 338-377.
    12. Xin Zhao, 2018. "Heterogeneity and Unanimity: Optimal Committees with Information Acquisition," Working Paper Series 52, Economics Discipline Group, UTS Business School, University of Technology, Sydney.
    13. Mark Armstrong & John Vickers, 2018. "Patterns of Competition with Captive Customers," Economics Series Working Papers 864, University of Oxford, Department of Economics.
    14. Luke Garrod & Matthew Olczak, 2017. "Collusion Under Imperfect Monitoring with Asymmetric Firms," Journal of Industrial Economics, Wiley Blackwell, vol. 65(3), pages 654-682, September.
    15. Plan, Asaf, 2023. "Symmetry in n-player games," Journal of Economic Theory, Elsevier, vol. 207(C).
    16. Amit Pazgal & David Soberman & Raphael Thomadsen, 2016. "Maximal or Minimal Differentiation in a Hotelling Market? A Fresh Perspective," Customer Needs and Solutions, Springer;Institute for Sustainable Innovation and Growth (iSIG), vol. 3(1), pages 42-47, March.
    17. Rabia Nessah & Guoqiang Tian, 2013. "Existence of Solution of Minimax Inequalities, Equilibria in Games and Fixed Points Without Convexity and Compactness Assumptions," Journal of Optimization Theory and Applications, Springer, vol. 157(1), pages 75-95, April.
    18. Emilie Dargaud & Maxime Menuet & Petros G. Sekeris, 2024. "Collusion and predation under Cournot competition," Managerial and Decision Economics, John Wiley & Sons, Ltd., vol. 45(1), pages 315-325, January.
    19. Attila Tasnádi, 2016. "Endogenous timing of moves in Bertrand–Edgeworth triopolies," International Journal of Economic Theory, The International Society for Economic Theory, vol. 12(4), pages 317-334, December.
    20. Vettas, Nikolaos & Biglaiser, Gary, 2004. "Dynamic Price Competition with Capacity Constraints and Strategic Buyers," CEPR Discussion Papers 4315, C.E.P.R. Discussion Papers.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:appene:v:391:y:2025:i:c:s0306261925005458. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/wps/find/journaldescription.cws_home/405891/description#description .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.