Nash Convergence of Mean-Based Learning Algorithms in First-Price Auctions

My bibliography Save this paper

Nash Convergence of Mean-Based Learning Algorithms in First-Price Auctions

Author

Listed:

Xiaotie Deng
Xinyan Hu
Tao Lin
Weiqiang Zheng

Registered:

Abstract

The convergence properties of learning dynamics in repeated auctions is a timely and important question, with numerous applications in, e.g., online advertising markets. This work focuses on repeated first-price auctions where bidders with fixed values learn to bid using mean-based algorithms -- a large class of online learning algorithms that include popular no-regret algorithms such as Multiplicative Weights Update and Follow the Perturbed Leader. We completely characterize the learning dynamics of mean-based algorithms, under two notions of convergence: (1) time-average: the fraction of rounds where bidders play a Nash equilibrium converges to 1; (2) last-iterate: the mixed strategy profile of bidders converges to a Nash equilibrium. Specifically, the results depend on the number of bidders with the highest value: - If the number is at least three, the dynamics almost surely converges to a Nash equilibrium of the auction, in both time-average and last-iterate. - If the number is two, the dynamics almost surely converges to a Nash equilibrium in time-average but not necessarily last-iterate. - If the number is one, the dynamics may not converge to a Nash equilibrium in time-average or last-iterate. Our discovery opens up new possibilities in the study of the convergence of learning dynamics.

Suggested Citation

Xiaotie Deng & Xinyan Hu & Tao Lin & Weiqiang Zheng, 2021. "Nash Convergence of Mean-Based Learning Algorithms in First-Price Auctions," Papers 2110.03906, arXiv.org, revised Aug 2025.

Handle: RePEc:arx:papers:2110.03906

Download full text from publisher

References listed on IDEAS

Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Bernard Lebrun, 1996. "Existence of an equilibrium in first price auctions (*)," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 7(3), pages 421-443.
- Lebrun, Bernard, 1996. "Existence of an Equilibrium in First Price Auctions," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 7(3), pages 421-443, April.
Eric Maskin & John Riley, 2000. "Equilibrium in Sealed High Bid Auctions," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 67(3), pages 439-454.
Foster, Dean P. & Vohra, Rakesh V., 1997. "Calibrated Learning and Correlated Equilibrium," Games and Economic Behavior, Elsevier, vol. 21(1-2), pages 40-55, October.
- D. Foster & R. Vohra, 2010. "Calibrated Learning and Correlated Equilibrium," Levine's Working Paper Archive 568, David K. Levine.
Hon-Snir, Shlomit & Monderer, Dov & Sela, Aner, 1998. "A Learning Approach to Auctions," Journal of Economic Theory, Elsevier, vol. 82(1), pages 65-88, September.
- Shlomit Hon-Snir & Dov Monderer & Aner Sela, 1996. "A Learning Approach to Auctions," Game Theory and Information 9610004, University Library of Munich, Germany, revised 07 Oct 1996.
- Hon-Suir, S. & Monderer, Dov & Sela, Aner, 1997. "A learning approach to auctions," Sonderforschungsbereich 504 Publications 97-11, Sonderforschungsbereich 504, Universität Mannheim;Sonderforschungsbereich 504, University of Mannheim.
- Hon-Snir, Shlomit & Monderer, Dov & Sela, Aner, 1997. "A learning approach to auctions," Papers 97-11, Sonderforschungsbreich 504.
Sergiu Hart & Andreu Mas-Colell, 2013. "A Simple Adaptive Procedure Leading To Correlated Equilibrium," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 2, pages 17-46, World Scientific Publishing Co. Pte. Ltd..
- Sergiu Hart & Andreu Mas-Colell, 2000. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Econometrica, Econometric Society, vol. 68(5), pages 1127-1150, September.
- Sergiu Hart & Andreu Mas-Colell, 1996. "A simple adaptive procedure leading to correlated equilibrium," Economics Working Papers 200, Department of Economics and Business, Universitat Pompeu Fabra, revised Dec 1996.
- S. Hart & A. Mas-Collel, 2010. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Levine's Working Paper Archive 572, David K. Levine.
- Sergiu Hart & Andreu Mas-Colell, 1997. "A Simple Adaptive Procedure Leading to Correlated Equilibrium," Game Theory and Information 9703006, University Library of Munich, Germany, revised 25 Nov 1997.
Lebrun, Bernard, 1999. "First Price Auctions in the Asymmetric N Bidder Case," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 40(1), pages 125-142, February.
- Lebrun, Bernard, 1997. "First Price Auctions in the Asymmetric N Bidder Case," Cahiers de recherche 9715, Université Laval - Département d'économique.
Xiaotie Deng & Ron Lavi & Tao Lin & Qi Qi & Wenwei Wang & Xiang Yan, 2020. "A Game-Theoretic Analysis of the Empirical Revenue Maximization Algorithm with Endogenous Sampling," Papers 2010.05519, arXiv.org.
Krishnamurthy Iyer & Ramesh Johari & Mukund Sundararajan, 2014. "Mean Field Equilibria of Dynamic Auctions with Learning," Management Science, INFORMS, vol. 60(12), pages 2949-2970, December.
repec:cup:cbooks:9781316779309 is not listed on IDEAS
Roughgarden,Tim, 2016. "Twenty Lectures on Algorithmic Game Theory," Cambridge Books, Cambridge University Press, number 9781316624791, Enero-Abr.
Roughgarden,Tim, 2016. "Twenty Lectures on Algorithmic Game Theory," Cambridge Books, Cambridge University Press, number 9781107172661, Enero-Abr.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Emerson Melo, 2021. "Learning in Random Utility Models Via Online Decision Problems," Papers 2112.10993, arXiv.org, revised Aug 2022.
Tom Johnston & Michael Savery & Alex Scott & Bassel Tarbush, 2023. "Game Connectivity and Adaptive Dynamics," Papers 2309.10609, arXiv.org, revised Jun 2025.
Emerson Melo, 2021. "Learning In Random Utility Models Via Online Decision Problems," CAEPR Working Papers 2022-003 Classification-D, Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington.
Michael Foley & Rory Smead & Patrick Forber & Christoph Riedl, 2021. "Avoiding the bullies: The resilience of cooperation among unequals," PLOS Computational Biology, Public Library of Science, vol. 17(4), pages 1-19, April.
- Michael Foley & Rory Smead & Patrick Forber & Christoph Riedl, 2021. "Avoiding the bullies: The resilience of cooperation among unequals," Papers 2104.08636, arXiv.org.
Daron Acemoglu & Asuman Ozdaglar, 2011. "Opinion Dynamics and Learning in Social Networks," Dynamic Games and Applications, Springer, vol. 1(1), pages 3-49, March.
- Daron Acemoglu & Asuman E. Ozdaglar, 2010. "Opinion Dynamics and Learning in Social Networks," Levine's Working Paper Archive 661465000000000222, David K. Levine.
Burkhard C. Schipper, 2022. "Strategic Teaching and Learning in Games," American Economic Journal: Microeconomics, American Economic Association, vol. 14(3), pages 321-352, August.
- Burkhard Schipper, 2015. "Strategic teaching and learning in games," Working Papers 152, University of California, Davis, Department of Economics.
- Burkhard Schipper, 2017. "Strategic Teaching and Learning in Games," Working Papers 232, University of California, Davis, Department of Economics.
Tim Roughgarden, 2018. "Complexity Theory, Game Theory, and Economics: The Barbados Lectures," Papers 1801.00734, arXiv.org, revised Feb 2020.
Rene Saran & Roberto Serrano, 2012. "Regret Matching with Finite Memory," Dynamic Games and Applications, Springer, vol. 2(1), pages 160-175, March.
- Rene Saran & Roberto Serrano, 2010. "Regret Matching with Finite Memory," Levine's Working Paper Archive 661465000000000078, David K. Levine.
- Rene Saran & Roberto Serrano, 2010. "Regret matching with finite memory," Working Papers 2010-10, Instituto Madrileño de Estudios Avanzados (IMDEA) Ciencias Sociales.
- Saran, R.R.S. & Serrano, R., 2010. "Regret matching with finite memory," Research Memorandum 033, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
- Rene Saran & Roberto Serrano, 2010. "Regret Matching with Finite Memory," Working Papers 2010-10, Brown University, Department of Economics.
Santiago R. Balseiro & Yonatan Gur, 2019. "Learning in Repeated Auctions with Budgets: Regret Minimization and Equilibrium," Management Science, INFORMS, vol. 65(9), pages 3952-3968, September.
Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2006. "Stochastic Approximations and Differential Inclusions, Part II: Applications," Mathematics of Operations Research, INFORMS, vol. 31(4), pages 673-695, November.
- Michel Benaïm & Josef Hofbauer & Sylvain Sorin, 2005. "Stochastic Approximations and Differential Inclusions; Part II: Applications," Working Papers hal-00242974, HAL.
Vivaldo M. Mendes & Diana A. Mendes & Orlando Gomes, 2008. "Learning to Play Nash in Deterministic Uncoupled Dynamics," Working Papers Series 1 ercwp1808, ISCTE-IUL, Business Research Unit (BRU-IUL).
Sergiu Hart & Yishay Mansour, 2013. "How Long To Equilibrium? The Communication Complexity Of Uncoupled Equilibrium Procedures," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 10, pages 215-249, World Scientific Publishing Co. Pte. Ltd..
- Hart, Sergiu & Mansour, Yishay, 2010. "How long to equilibrium? The communication complexity of uncoupled equilibrium procedures," Games and Economic Behavior, Elsevier, vol. 69(1), pages 107-126, May.
Nakayama, Kazuaki & Nakamura, Ryuzo & Hisakado, Masato & Mori, Shintaro, 2020. "Optimal learning dynamics of multiagent system in restless multiarmed bandit game," Physica A: Statistical Mechanics and its Applications, Elsevier, vol. 549(C).
Ueda, Masahiko, 2023. "Memory-two strategies forming symmetric mutual reinforcement learning equilibrium in repeated prisoners’ dilemma game," Applied Mathematics and Computation, Elsevier, vol. 444(C).
Eric Friedman & Scott Shenker & Amy Greenwald, 1998. "Learning in Networks Contexts: Experimental Results from Simulations," Departmental Working Papers 199825, Rutgers University, Department of Economics.
Cabrales, Antonio & Serrano, Roberto, 2011. "Implementation in adaptive better-response dynamics: Towards a general theory of bounded rationality in mechanisms," Games and Economic Behavior, Elsevier, vol. 73(2), pages 360-374.
Sergiu Hart & Yishay Mansour, 2006. "The Communication Complexity of Uncoupled Nash Equilibrium Procedures," Levine's Bibliography 122247000000001299, UCLA Department of Economics.
- Sergiu Hart & Yishay Mansour, 2006. "The Communication Complexity of Uncoupled Nash Equilibrium Procedures," Discussion Paper Series dp419, The Federmann Center for the Study of Rationality, the Hebrew University, Jerusalem.
Germano, Fabrizio & Lugosi, Gabor, 2007. "Global Nash convergence of Foster and Young's regret testing," Games and Economic Behavior, Elsevier, vol. 60(1), pages 135-154, July.
- Fabrizio Germano & Gábor Lugosi, 2004. "Global Nash convergence of Foster and Young's regret testing," Economics Working Papers 788, Department of Economics and Business, Universitat Pompeu Fabra.
Du, Ye & Lehrer, Ehud, 2020. "Constrained no-regret learning," Journal of Mathematical Economics, Elsevier, vol. 88(C), pages 16-24.
Sergiu Hart & Andreu Mas-Colell, 2013. "A General Class Of Adaptive Strategies," World Scientific Book Chapters, in: Simple Adaptive Strategies From Regret-Matching to Uncoupled Dynamics, chapter 3, pages 47-76, World Scientific Publishing Co. Pte. Ltd..
- Hart, Sergiu & Mas-Colell, Andreu, 2001. "A General Class of Adaptive Strategies," Journal of Economic Theory, Elsevier, vol. 98(1), pages 26-54, May.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A general class of adaptative strategies," Economics Working Papers 373, Department of Economics and Business, Universitat Pompeu Fabra.
- Sergiu Hart & Andreu Mas-Colell, 1999. "A General Class of Adaptive Strategies," Game Theory and Information 9904001, University Library of Munich, Germany, revised 23 Mar 2000.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-DES-2021-10-18 (Economic Design)
NEP-GTH-2021-10-18 (Game Theory)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2110.03906. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Nash Convergence of Mean-Based Learning Algorithms in First-Price Auctions

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data