Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria

My bibliography Save this article

Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria

Author

Listed:

Christoph Graf
(New York University
Stanford University)
Viktor Zobernig
(University of Natural Resources and Life Sciences)
Johannes Schmidt
(University of Natural Resources and Life Sciences)
Claude Klöckl
(University of Natural Resources and Life Sciences)

Registered:

Abstract

We test the performance of deep deterministic policy gradient—a deep reinforcement learning algorithm, able to handle continuous state and action spaces—to find Nash equilibria in a setting where firms compete in offer prices through a uniform price auction. These algorithms are typically considered “model-free” although a large set of parameters is utilized by the algorithm. These parameters may include learning rates, memory buffers, state space dimensioning, normalizations, or noise decay rates, and the purpose of this work is to systematically test the effect of these parameter configurations on convergence to the analytically derived Bertrand equilibrium. We find parameter choices that can reach convergence rates of up to 99%. We show that the algorithm also converges in more complex settings with multiple players and different cost structures. Its reliable convergence may make the method a useful tool to studying strategic behavior of firms even in more complex settings.

Suggested Citation

Christoph Graf & Viktor Zobernig & Johannes Schmidt & Claude Klöckl, 2024. "Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria," Computational Economics, Springer;Society for Computational Economics, vol. 63(2), pages 529-576, February.

Handle: RePEc:kap:compec:v:63:y:2024:i:2:d:10.1007_s10614-022-10351-6
DOI: 10.1007/s10614-022-10351-6

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Emmanuel Guerre & Isabelle Perrigne & Quang Vuong, 2000. "Optimal Nonparametric Estimation of First-Price Auctions," Econometrica, Econometric Society, vol. 68(3), pages 525-574, May.
Johann Lussange & Ivan Lazarevich & Sacha Bourgeois-Gironde & Stefano Palminteri & Boris Gutkin, 2021. "Modelling Stock Markets by Multi-agent Reinforcement Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(1), pages 113-147, January.
Viossat, Yannick & Zapechelnyuk, Andriy, 2013. "No-regret dynamics and fictitious play," Journal of Economic Theory, Elsevier, vol. 148(2), pages 825-842.
- Yannick Viossat & Andriy Zapechelnyuk, 2013. "No-regret Dynamics and Fictitious Play," Post-Print hal-00713871, HAL.
Noe, Thomas H. & Rebello, Michael & Wang, Jun, 2012. "Learning to bid: The design of auctions under uncertainty and adaptation," Games and Economic Behavior, Elsevier, vol. 74(2), pages 620-636.
Christopher Boyer & B. Brorsen, 2014. "Implications of a Reserve Price in an Agent-Based Common-Value Auction," Computational Economics, Springer;Society for Computational Economics, vol. 43(1), pages 33-51, January.
Drew Fudenberg & Eric Maskin, 2008. "The Folk Theorem In Repeated Games With Discounting Or With Incomplete Information," World Scientific Book Chapters, in: Drew Fudenberg & David K Levine (ed.), A Long-Run Collaboration On Long-Run Games, chapter 11, pages 209-230, World Scientific Publishing Co. Pte. Ltd..
- Fudenberg, Drew & Maskin, Eric, 1986. "The Folk Theorem in Repeated Games with Discounting or with Incomplete Information," Econometrica, Econometric Society, vol. 54(3), pages 533-554, May.
Harrison, Glenn W, 1989. "Theory and Misbehavior of First-Price Auctions," American Economic Review, American Economic Association, vol. 79(4), pages 749-762, September.
- Glenn W. Harrison, 1987. "Theory and Misbehavior of First-Price Auctions," University of Western Ontario, Departmental Research Report Series 8710, University of Western Ontario, Department of Economics.
Emilio Calvano & Giacomo Calzolari & Vincenzo Denicolò & Sergio Pastorello, 2020. "Artificial Intelligence, Algorithmic Pricing, and Collusion," American Economic Review, American Economic Association, vol. 110(10), pages 3267-3297, October.
- Calzolari, Giacomo & Calvano, Emilio & Denicolo, Vincenzo & Pastorello, Sergio, 2018. "Artificial intelligence, algorithmic pricing and collusion," CEPR Discussion Papers 13405, C.E.P.R. Discussion Papers.
Aliabadi, Danial Esmaeili & Kaya, Murat & Şahin, Güvenç, 2017. "An agent-based simulation of power generation company behavior in electricity markets under different market-clearing mechanisms," Energy Policy, Elsevier, vol. 100(C), pages 191-205.
Jian Yao & Ilan Adler & Shmuel S. Oren, 2008. "Modeling and Computing Two-Settlement Oligopolistic Equilibrium in a Congested Electricity Network," Operations Research, INFORMS, vol. 56(1), pages 34-47, February.
Mar Reguant, 2014. "Complementary Bidding Mechanisms and Startup Costs in Electricity Markets," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 81(4), pages 1708-1742.
- Mar Reguant, 2014. "Complementary Bidding Mechanisms and Startup Costs in Electricity Markets," CESifo Working Paper Series 4811, CESifo.
Andreoni James & Miller John H., 1995. "Auctions with Artificial Adaptive Agents," Games and Economic Behavior, Elsevier, vol. 10(1), pages 39-64, July.
Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
- D. Foster & R. Vohra, 2010. "Calibrated Learning and Correlated Equilibrium," Levine's Working Paper Archive 568, David K. Levine.
Julian Schrittwieser & Ioannis Antonoglou & Thomas Hubert & Karen Simonyan & Laurent Sifre & Simon Schmitt & Arthur Guez & Edward Lockhart & Demis Hassabis & Thore Graepel & Timothy Lillicrap & David , 2020. "Mastering Atari, Go, chess and shogi by planning with a learned model," Nature, Nature, vol. 588(7839), pages 604-609, December.
Koichiro Ito & Mar Reguant, 2016. "Sequential Markets, Market Power, and Arbitrage," American Economic Review, American Economic Association, vol. 106(7), pages 1921-1957, July.
- Koichiro Ito & Mar Reguant, 2014. "Sequential Markets, Market Power and Arbitrage," NBER Working Papers 20782, National Bureau of Economic Research, Inc.
- Koichiro ITO & Mar REGUANT, 2015. "Sequential Markets, Market Power and Arbitrage," Discussion papers 15015, Research Institute of Economy, Trade and Industry (RIETI).
Hommes, Cars H., 2006. "Heterogeneous Agent Models in Economics and Finance," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 23, pages 1109-1186, Elsevier.
- Cars H. Hommes, 2005. "Heterogeneous Agent Models in Economics and Finance," Tinbergen Institute Discussion Papers 05-056/1, Tinbergen Institute.
Elodie Guerre & I. Perrigne & Q.H. Vuong, 2000. "Optimal nonparametric estimation of first-price auctions [[Estimation nonparamétrique optimale des enchères au premier prix]]," Post-Print hal-02697497, HAL.
Justin Sirignano & Rama Cont, 2019. "Universal features of price formation in financial markets: perspectives from deep learning," Quantitative Finance, Taylor & Francis Journals, vol. 19(9), pages 1449-1459, September.
Volodymyr Mnih & Koray Kavukcuoglu & David Silver & Andrei A. Rusu & Joel Veness & Marc G. Bellemare & Alex Graves & Martin Riedmiller & Andreas K. Fidjeland & Georg Ostrovski & Stig Petersen & Charle, 2015. "Human-level control through deep reinforcement learning," Nature, Nature, vol. 518(7540), pages 529-533, February.
Jakub Kastl, 2011. "Discrete Bids and Empirical Inference in Divisible Good Auctions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 78(3), pages 974-1014.
Oriol Vinyals & Igor Babuschkin & Wojciech M. Czarnecki & Michaël Mathieu & Andrew Dudzik & Junyoung Chung & David H. Choi & Richard Powell & Timo Ewalds & Petko Georgiev & Junhyuk Oh & Dan Horgan & M, 2019. "Grandmaster level in StarCraft II using multi-agent reinforcement learning," Nature, Nature, vol. 575(7782), pages 350-354, November.
El Hadi Caoui, 2022. "A Study of Umbrella Damages from Bid Rigging," Journal of Law and Economics, University of Chicago Press, vol. 65(2), pages 239-277.
David Silver & Aja Huang & Chris J. Maddison & Arthur Guez & Laurent Sifre & George van den Driessche & Julian Schrittwieser & Ioannis Antonoglou & Veda Panneershelvam & Marc Lanctot & Sander Dieleman, 2016. "Mastering the game of Go with deep neural networks and tree search," Nature, Nature, vol. 529(7587), pages 484-489, January.
Viehmann, Johannes & Lorenczik, Stefan & Malischek, Raimund, 2021. "Multi-unit multiple bid auctions in balancing markets: An agent-based Q-learning approach," Energy Economics, Elsevier, vol. 93(C).

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Christoph Graf & Viktor Zobernig & Johannes Schmidt & Claude Klockl, 2021. "Computational Performance of Deep Reinforcement Learning to find Nash Equilibria," Papers 2104.12895, arXiv.org.
Esmaeili Aliabadi, Danial & Chan, Katrina, 2022. "The emerging threat of artificial intelligence on competition in liberalized electricity markets: A deep Q-network approach," Applied Energy, Elsevier, vol. 325(C).
Li, Wenqing & Ni, Shaoquan, 2022. "Train timetabling with the general learning environment and multi-agent deep reinforcement learning," Transportation Research Part B: Methodological, Elsevier, vol. 157(C), pages 230-251.
Sang Won Kim & Marcelo Olivares & Gabriel Y. Weintraub, 2014. "Measuring the Performance of Large-Scale Combinatorial Auctions: A Structural Estimation Approach," Management Science, INFORMS, vol. 60(5), pages 1180-1201, May.
Lamy, Laurent & Patnam, Manasa & Visser, Michael, 2016. "Correcting for Sample Selection From Competitive Bidding, with an Application to Estimating the Effect of Wages on Performance," CEPR Discussion Papers 11376, C.E.P.R. Discussion Papers.
- Laurent Lamy & Manasa Patnam & Michael Visser, 2017. "Correcting for Sample Selection From Competitive Bidding, with an Application to Estimating the Effect of Wages on Performance," Post-Print hal-01688267, HAL.
Quang Vuong & Ayse Pehlivan, 2015. "Supply Function Competition and Exporters: Nonparametric Identification and Estimation of Productivity Distributions and Marginal Costs," 2015 Meeting Papers 1414, Society for Economic Dynamics.
Hunt Allcott, 2012. "The Smart Grid, Entry, and Imperfect Competition in Electricity Markets," NBER Working Papers 18071, National Bureau of Economic Research, Inc.
Jason Allen & Jakub Kastl & Milena Wittwer, 2020. "Primary Dealers and the Demand for Government Debt," Working Papers 2020-27, Princeton University. Economics Department..
Ali Hortaçsu & Jakub Kastl & Allen Zhang, 2018. "Bid Shading and Bidder Surplus in the US Treasury Auction System," American Economic Review, American Economic Association, vol. 108(1), pages 147-169, January.
- Ali Hortaçsu & Jakub Kastl & Allen Zhang, 2017. "Bid Shading and Bidder Surplus in the U.S. Treasury Auction System," NBER Working Papers 24024, National Bureau of Economic Research, Inc.
Gabrielli, M. Florencia & Willington, Manuel, 2023. "Estimating damages from bidding rings in first-price auctions," Economic Modelling, Elsevier, vol. 126(C).
Patrick Bajari & Ali Hortacsu, 2005. "Are Structural Estimates of Auction Models Reasonable? Evidence from Experimental Data," Journal of Political Economy, University of Chicago Press, vol. 113(4), pages 703-741, August.
- Patrick Bajari & Ali Hortacsu, 2003. "Are Structural Estimates of Auction Models Reasonable? Evidence from Experimental Data," NBER Working Papers 9889, National Bureau of Economic Research, Inc.
- Patrick Bajari & Ali Hortacsu, 2003. "Are Structural Estimates of Auction Models Reasonable? Evidence from Experimental Data," Working Papers 03002, Stanford University, Department of Economics.
Shuo Sun & Rundong Wang & Bo An, 2021. "Reinforcement Learning for Quantitative Trading," Papers 2109.13851, arXiv.org.
Justin P. Johnson & Andrew Rhodes & Matthijs Wildenbeest, 2023. "Platform Design When Sellers Use Pricing Algorithms," Econometrica, Econometric Society, vol. 91(5), pages 1841-1879, September.
- Johnson, Justin Pappas & Rhodes, Andrew & Wildenbeest, Matthijs, 2020. "Platform Design when Sellers Use Pricing Algorithms," TSE Working Papers 20-1146, Toulouse School of Economics (TSE).
- Rhodes, Andrew & Johnson, Justin & Wildenbeest, Matthijs, 2020. "Platform Design When Sellers Use Pricing Algorithms," CEPR Discussion Papers 15504, C.E.P.R. Discussion Papers.
- Justin Pappas Johnson & Andrew Rhodes & Matthijs Wildenbeest, 2023. "Platform design when sellers use pricing algorithms," Post-Print hal-04226232, HAL.
Ngo, Vu Minh & Nguyen, Huan Huu & Van Nguyen, Phuc, 2023. "Does reinforcement learning outperform deep learning and traditional portfolio optimization models in frontier and developed financial markets?," Research in International Business and Finance, Elsevier, vol. 65(C).
Chloé Le Coq & Sebastian Schwenen, 2020. "Financial contracts as coordination device," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 29(2), pages 241-259, April.
- Le Coq, Chloe & Schwenen, Sebastian, 2019. "Financial Contracts as Coordination Device," SITE Working Paper Series 47, Stockholm School of Economics, Stockholm Institute of Transition Economics.
- Chloé Le Coq & Sebastian Schwenen, 2020. "Financial contracts as coordination device," Post-Print hal-04129332, HAL.
Kastl, Jakub, 2012. "On the properties of equilibria in private value divisible good auctions with constrained bidding," Journal of Mathematical Economics, Elsevier, vol. 48(6), pages 339-352.
Jakub Kastl & Ali Hortacsu, 2007. "Testing for Common Valuation in Treasury Bills Auctions," 2007 Meeting Papers 222, Society for Economic Dynamics.
Hickman Brent R. & Hubbard Timothy P. & Sağlam Yiğit, 2012. "Structural Econometric Methods in Auctions: A Guide to the Literature," Journal of Econometric Methods, De Gruyter, vol. 1(1), pages 67-106, August.
- SaÄŸlam, YiÄŸit, 2012. "Structural Econometric Methods in Auctions: A Guide to the Literature," Working Paper Series 4115, Victoria University of Wellington, The New Zealand Institute for the Study of Competition and Regulation.
De Moor, Bram J. & Gijsbrechts, Joren & Boute, Robert N., 2022. "Reward shaping to improve the performance of deep reinforcement learning in perishable inventory management," European Journal of Operational Research, Elsevier, vol. 301(2), pages 535-545.
Susan Athey & Philip A. Haile, 2006. "Empirical Models of Auctions," NBER Working Papers 12126, National Bureau of Economic Research, Inc.
- Susan Athey & Philip A. Haile, 2006. "Empirical Models of Auctions," Cowles Foundation Discussion Papers 1562, Cowles Foundation for Research in Economics, Yale University.
- Susan Athey & Philip A. Haile, 2006. "Empirical Models of Auctions," Levine's Bibliography 122247000000001045, UCLA Department of Economics.

More about this item

Keywords

; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:compec:v:63:y:2024:i:2:d:10.1007_s10614-022-10351-6. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Computational Performance of Deep Reinforcement Learning to Find Nash Equilibria

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data