IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2203.14126.html
   My bibliography  Save this paper

Robust No-Regret Learning in Min-Max Stackelberg Games

Author

Listed:
  • Denizalp Goktas
  • Jiayi Zhao
  • Amy Greenwald

Abstract

The behavior of no-regret learning algorithms is well understood in two-player min-max (i.e, zero-sum) games. In this paper, we investigate the behavior of no-regret learning in min-max games with dependent strategy sets, where the strategy of the first player constrains the behavior of the second. Such games are best understood as sequential, i.e., min-max Stackelberg, games. We consider two settings, one in which only the first player chooses their actions using a no-regret algorithm while the second player best responds, and one in which both players use no-regret algorithms. For the former case, we show that no-regret dynamics converge to a Stackelberg equilibrium. For the latter case, we introduce a new type of regret, which we call Lagrangian regret, and show that if both players minimize their Lagrangian regrets, then play converges to a Stackelberg equilibrium. We then observe that online mirror descent (OMD) dynamics in these two settings correspond respectively to a known nested (i.e., sequential) gradient descent-ascent (GDA) algorithm and a new simultaneous GDA-like algorithm, thereby establishing convergence of these algorithms to Stackelberg equilibrium. Finally, we analyze the robustness of OMD dynamics to perturbations by investigating online min-max Stackelberg games. We prove that OMD dynamics are robust for a large class of online min-max games with independent strategy sets. In the dependent case, we demonstrate the robustness of OMD dynamics experimentally by simulating them in online Fisher markets, a canonical example of a min-max Stackelberg game with dependent strategy sets.

Suggested Citation

  • Denizalp Goktas & Jiayi Zhao & Amy Greenwald, 2022. "Robust No-Regret Learning in Min-Max Stackelberg Games," Papers 2203.14126, arXiv.org, revised Apr 2022.
  • Handle: RePEc:arx:papers:2203.14126
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2203.14126
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Paul Milgrom & Ilya Segal, 2002. "Envelope Theorems for Arbitrary Choice Sets," Econometrica, Econometric Society, vol. 70(2), pages 583-601, March.
    2. William C. Brainard & Herbert E. Scarf, 2005. "How to Compute Equilibrium Prices in 1891," American Journal of Economics and Sociology, Wiley Blackwell, vol. 64(1), pages 57-83, January.
    3. Morton Slater, 1959. "Lagrange Multipliers Revisited," Cowles Foundation Discussion Papers 80, Cowles Foundation for Research in Economics, Yale University.
    4. Francisco Facchinei & Christian Kanzow, 2010. "Generalized Nash Equilibrium Problems," Annals of Operations Research, Springer, vol. 175(1), pages 177-211, March.
    5. ,, 2000. "Problems And Solutions," Econometric Theory, Cambridge University Press, vol. 16(2), pages 287-299, April.
    6. NESTEROV, Yurii & SCRIMALI, Laura, 2011. "Solving strongly monotone variational and quasi-variational inequalities," LIDAM Reprints CORE 2357, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
    7. A. Nedić & A. Ozdaglar, 2009. "Subgradient Methods for Saddle-Point Problems," Journal of Optimization Theory and Applications, Springer, vol. 142(1), pages 205-228, July.
    8. Charles R. Harris & K. Jarrod Millman & Stéfan J. Walt & Ralf Gommers & Pauli Virtanen & David Cournapeau & Eric Wieser & Julian Taylor & Sebastian Berg & Nathaniel J. Smith & Robert Kern & Matti Picu, 2020. "Array programming with NumPy," Nature, Nature, vol. 585(7825), pages 357-362, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Denizalp Goktas & Jiayi Zhao & Amy Greenwald, 2022. "Zero-Sum Stochastic Stackelberg Games," Papers 2211.13847, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Denizalp Goktas & Amy Greenwald, 2022. "Gradient Descent Ascent in Min-Max Stackelberg Games," Papers 2208.09690, arXiv.org.
    2. Denizalp Goktas & Jiayi Zhao & Amy Greenwald, 2023. "T\^atonnement in Homothetic Fisher Markets," Papers 2306.04890, arXiv.org.
    3. Vinayaka G. Yaji & Shalabh Bhatnagar, 2020. "Stochastic Recursive Inclusions in Two Timescales with Nonadditive Iterate-Dependent Markov Noise," Mathematics of Operations Research, INFORMS, vol. 45(4), pages 1405-1444, November.
    4. Shipra Singh & Aviv Gibali & Simeon Reich, 2021. "Multi-Time Generalized Nash Equilibria with Dynamic Flow Applications," Mathematics, MDPI, vol. 9(14), pages 1-23, July.
    5. Giancarlo Bigi & Mauro Passacantando, 2016. "Gap functions for quasi-equilibria," Journal of Global Optimization, Springer, vol. 66(4), pages 791-810, December.
    6. David Cerezo S'anchez, 2021. "JUBILEE: Secure Debt Relief and Forgiveness," Papers 2109.07267, arXiv.org.
    7. Stevanovic Dalibor, 2016. "Common time variation of parameters in reduced-form macroeconomic models," Studies in Nonlinear Dynamics & Econometrics, De Gruyter, vol. 20(2), pages 159-183, April.
    8. Kaido, Hiroaki, 2017. "Asymptotically Efficient Estimation Of Weighted Average Derivatives With An Interval Censored Variable," Econometric Theory, Cambridge University Press, vol. 33(5), pages 1218-1241, October.
    9. Wenqing Chen & Melvyn Sim & Jie Sun & Chung-Piaw Teo, 2010. "From CVaR to Uncertainty Set: Implications in Joint Chance-Constrained Optimization," Operations Research, INFORMS, vol. 58(2), pages 470-485, April.
    10. A. Fadlelmawla & M. Al-Otaibi, 2005. "Analysis of the Water Resources Status in Kuwait," Water Resources Management: An International Journal, Published for the European Water Resources Association (EWRA), Springer;European Water Resources Association (EWRA), vol. 19(5), pages 555-570, October.
    11. Loebbing, Jonas, 2018. "An Elementary Theory of Endogenous Technical Change and Wage Inequality," VfS Annual Conference 2018 (Freiburg, Breisgau): Digital Economy 181603, Verein für Socialpolitik / German Economic Association.
    12. Tan Wang & L. Jeff Hong, 2023. "Large-Scale Inventory Optimization: A Recurrent Neural Networks–Inspired Simulation Approach," INFORMS Journal on Computing, INFORMS, vol. 35(1), pages 196-215, January.
    13. Stefan Mišković, 2017. "A VNS-LP algorithm for the robust dynamic maximal covering location problem," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 39(4), pages 1011-1033, October.
    14. Stefanie Stantcheva, 2020. "Dynamic Taxation," Annual Review of Economics, Annual Reviews, vol. 12(1), pages 801-831, August.
    15. Duan, Jinyun & Li, Chenwei & Xu, Yue & Wu, Chia-Huei, 2017. "Transformational leadership and employee voice behavior: a Pygmalion mechanism," LSE Research Online Documents on Economics 68035, London School of Economics and Political Science, LSE Library.
    16. Hota, Monali & Bartsch, Fabian, 2019. "Consumer socialization in childhood and adolescence: Impact of psychological development and family structure," Journal of Business Research, Elsevier, vol. 105(C), pages 11-20.
    17. Oliver Stein & Nathan Sudermann-Merx, 2016. "The Cone Condition and Nonsmoothness in Linear Generalized Nash Games," Journal of Optimization Theory and Applications, Springer, vol. 170(2), pages 687-709, August.
    18. Léon Faure & Bastien Mollet & Wolfram Liebermeister & Jean-Loup Faulon, 2023. "A neural-mechanistic hybrid approach improving the predictive power of genome-scale metabolic models," Nature Communications, Nature, vol. 14(1), pages 1-14, December.
    19. Abernethy, Margaret A. & Vagnoni, Emidia, 2004. "Power, organization design and managerial behaviour," Accounting, Organizations and Society, Elsevier, vol. 29(3-4), pages 207-225.
    20. Koessler, Frédéric & Skreta, Vasiliki, 2016. "Informed seller with taste heterogeneity," Journal of Economic Theory, Elsevier, vol. 165(C), pages 456-471.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2203.14126. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.