Learning in Repeated Auctions with Budgets: Regret Minimization and Equilibrium

My bibliography Save this article

Learning in Repeated Auctions with Budgets: Regret Minimization and Equilibrium

Author

Listed:

Santiago R. Balseiro
(Graduate School of Business, Columbia University, New York, New York 10027)
Yonatan Gur
(Graduate School of Business, Stanford University, Stanford, California 94305)

Registered:

Abstract

In online advertising markets, advertisers often purchase ad placements through bidding in repeated auctions based on realized viewer information. We study how budget-constrained advertisers may compete in such sequential auctions in the presence of uncertainty about future bidding opportunities and competition. We formulate this problem as a sequential game of incomplete information, in which bidders know neither their own valuation distribution nor the budgets and valuation distributions of their competitors. We introduce a family of practical bidding strategies we refer to as adaptive pacing strategies, in which advertisers adjust their bids according to the sample path of expenditures they exhibit, and analyze the performance of these strategies in different competitive settings. We establish the asymptotic optimality of these strategies when competitors’ bids are independent and identically distributed over auctions, but also when competing bids are arbitrary. When all the bidders adopt these strategies, we establish the convergence of the induced dynamics and characterize a regime (well motivated in the context of online advertising markets) under which these strategies constitute an approximate Nash equilibrium in dynamic strategies: the benefit from unilaterally deviating to other strategies, including ones with access to complete information, becomes negligible as the number of auctions and competitors grows large. This establishes a connection between regret minimization and market stability, by which advertisers can essentially follow approximate equilibrium bidding strategies that also ensure the best performance that can be guaranteed off equilibrium.

Suggested Citation

Santiago R. Balseiro & Yonatan Gur, 2019. "Learning in Repeated Auctions with Budgets: Regret Minimization and Equilibrium," Management Science, INFORMS, vol. 65(9), pages 3952-3968, September.

Handle: RePEc:inm:ormnsc:v:65:y:2019:i:9:p:3952-3968
DOI: 10.1287/mnsc.2018.3174

Download full text from publisher

References listed on IDEAS

Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Santiago R. Balseiro & Jon Feldman & Vahab Mirrokni & S. Muthukrishnan, 2014. "Yield Optimization of Display Advertising with Ad Exchange," Management Science, INFORMS, vol. 60(12), pages 2886-2907, December.
Omar Besbes & Yonatan Gur & Assaf Zeevi, 2015. "Non-Stationary Stochastic Optimization," Operations Research, INFORMS, vol. 63(5), pages 1227-1244, October.
Hon-Snir, Shlomit & Monderer, Dov & Sela, Aner, 1998. "A Learning Approach to Auctions," Journal of Economic Theory, Elsevier, vol. 82(1), pages 65-88, September.
- Shlomit Hon-Snir & Dov Monderer & Aner Sela, 1996. "A Learning Approach to Auctions," Game Theory and Information 9610004, University Library of Munich, Germany, revised 07 Oct 1996.
- Hon-Suir, S. & Monderer, Dov & Sela, Aner, 1997. "A learning approach to auctions," Sonderforschungsbereich 504 Publications 97-11, Sonderforschungsbereich 504, Universität Mannheim;Sonderforschungsbereich 504, University of Mannheim.
- Hon-Snir, Shlomit & Monderer, Dov & Sela, Aner, 1997. "A learning approach to auctions," Papers 97-11, Sonderforschungsbreich 504.
L. Elisa Celis & Gregory Lewis & Markus Mobius & Hamid Nazerzadeh, 2014. "Buy-It-Now or Take-a-Chance: Price Discrimination Through Randomized Auctions," Management Science, INFORMS, vol. 60(12), pages 2927-2948, December.
- L. Elisa Celis & Gregory Lewis & Markus M. Mobius & Hamid Nazerzadeh, 2012. "Buy-it-now or Take-a-chance: Price Discrimination through Randomized Auctions," NBER Working Papers 18590, National Bureau of Economic Research, Inc.
Krishnamurthy Iyer & Ramesh Johari & Mukund Sundararajan, 2014. "Mean Field Equilibria of Dynamic Auctions with Learning," Management Science, INFORMS, vol. 60(12), pages 2949-2970, December.
Santiago R. Balseiro & Omar Besbes & Gabriel Y. Weintraub, 2015. "Repeated Auctions with Budgets in Ad Exchanges: Approximations and Design," Management Science, INFORMS, vol. 61(4), pages 864-884, April.
Kalyan Talluri & Garrett van Ryzin, 1998. "An Analysis of Bid-Price Controls for Network Revenue Management," Management Science, INFORMS, vol. 44(11-Part-1), pages 1577-1593, November.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Ilan Lobel, 2021. "Revenue Management and the Rise of the Algorithmic Economy," Management Science, INFORMS, vol. 67(9), pages 5389-5398, September.
Hana Choi & Carl F. Mela & Santiago R. Balseiro & Adam Leary, 2020. "Online Display Advertising Markets: A Literature Review and Future Directions," Information Systems Research, INFORMS, vol. 31(2), pages 556-575, June.
Santiago R. Balseiro & Omar Besbes & Gabriel Y. Weintraub, 2015. "Repeated Auctions with Budgets in Ad Exchanges: Approximations and Design," Management Science, INFORMS, vol. 61(4), pages 864-884, April.
Xiaotie Deng & Xinyan Hu & Tao Lin & Weiqiang Zheng, 2021. "Nash Convergence of Mean-Based Learning Algorithms in First-Price Auctions," Papers 2110.03906, arXiv.org, revised Aug 2025.
Saran, R.R.S. & Serrano, R., 2010. "Ex-Post regret learning in games with fixed and random matching: the case of private values," Research Memorandum 032, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
- Rene Saran & Roberto Serrano, 2010. "Ex-Post Regret Learning in Games with Fixed and Random Matching: The Case of Private Values," Working Papers 2010-11, Brown University, Department of Economics.
- Rene Saran & Roberto Serrano, 2010. "Ex-Post Regret Learning in Games with Fixed and Random Matching: The Case of Private Values," Levine's Working Paper Archive 661465000000000083, David K. Levine.
- Rene Saran & Roberto Serrano, 2010. "Ex-post regret learning in games with fixed and random matching: The case of private values," Working Papers 2010-11, Instituto Madrileño de Estudios Avanzados (IMDEA) Ciencias Sociales.
Hernandez Senosiain, Patricio, 2022. "Why Do Men Keep Swiping Right? Two-Sided Search in Swipe-Based Dating Platforms," Warwick-Monash Economics Student Papers 37, Warwick Monash Economics Student Papers.
Santiago R. Balseiro & Ozan Candogan, 2017. "Optimal Contracts for Intermediaries in Online Advertising," Operations Research, INFORMS, vol. 65(4), pages 878-896, August.
Saran, Rene & Serrano, Roberto, 2014. "Ex-post regret heuristics under private values (I): Fixed and random matching," Journal of Mathematical Economics, Elsevier, vol. 54(C), pages 97-111.
Dragos Florin Ciocan & Krishnamurthy Iyer, 2021. "Tractable Equilibria in Sponsored Search with Endogenous Budgets," Operations Research, INFORMS, vol. 69(1), pages 227-244, January.
Atanasios Mitropoulos, 2001. "Learning Under Little Information: An Experiment on Mutual Fate Control," Game Theory and Information 0110003, University Library of Munich, Germany.
Cabrales, Antonio & Serrano, Roberto, 2011. "Implementation in adaptive better-response dynamics: Towards a general theory of bounded rationality in mechanisms," Games and Economic Behavior, Elsevier, vol. 73(2), pages 360-374.
Berger, Ulrich, 2008. "Learning in games with strategic complementarities revisited," Journal of Economic Theory, Elsevier, vol. 143(1), pages 292-301, November.
Berger, Ulrich, 2007. "Brown's original fictitious play," Journal of Economic Theory, Elsevier, vol. 135(1), pages 572-578, July.
- Ulrich Berger, 2005. "Brown's Original Fictitious Play," Game Theory and Information 0503008, University Library of Munich, Germany.
Leila Hosseini & Shaojie Tang & Vijay Mookerjee, 2024. "When Is More Merrier? A Cloud-Based Architecture to Procure Impressions from Multiple Ad Exchanges," Information Systems Research, INFORMS, vol. 35(1), pages 294-317, March.
Raghav Singal & Omar Besbes & Antoine Desir & Vineet Goyal & Garud Iyengar, 2022. "Shapley Meets Uniform: An Axiomatic Framework for Attribution in Online Advertising," Management Science, INFORMS, vol. 68(10), pages 7457-7479, October.
Ludovico Crippa & Yonatan Gur & Bar Light, 2022. "Equilibria in Repeated Games under No-Regret with Dynamic Benchmarks," Papers 2212.03152, arXiv.org, revised Jan 2025.
Galbiati, Marco & Soramäki, Kimmo, 2011. "An agent-based model of payment systems," Journal of Economic Dynamics and Control, Elsevier, vol. 35(6), pages 859-875, June.
- Marco Galbiati & Kimmo Soramaki, 2008. "An agent-based model of payment systems," Bank of England working papers 352, Bank of England.
Schipper, Burkhard C., 2021. "Discovery and equilibrium in games with unawareness," Journal of Economic Theory, Elsevier, vol. 198(C).
- Schipper, Burkhard C, 2018. "Discovery and Equilibrium in Games with Unawareness," MPRA Paper 86300, University Library of Munich, Germany.
- Burkhard C. Schipper, 2021. "Discovery and Equilibrium in Games with Unawareness," Working Papers 340, University of California, Davis, Department of Economics.
Mathieu Faure & Gregory Roth, 2010. "Stochastic Approximations of Set-Valued Dynamical Systems: Convergence with Positive Probability to an Attractor," Mathematics of Operations Research, INFORMS, vol. 35(3), pages 624-640, August.
Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.

More about this item

Keywords

; ; ; ; ; ; ; ;

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormnsc:v:65:y:2019:i:9:p:3952-3968. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Learning in Repeated Auctions with Budgets: Regret Minimization and Equilibrium

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Keywords

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data