IDEAS home Printed from https://ideas.repec.org/a/eee/dyncon/v33y2009i8p1593-1603.html

Single-leader-multiple-follower games with boundedly rational agents

Author

Listed:
  • Tharakunnel, Kurian
  • Bhattacharyya, Siddhartha

Abstract

This paper studies a class of hierarchical games called single-leader-multiple-follower games (SLMFGs) that have important applications in economics and engineering. We consider such games in the context of boundedly rational agents that are limited in the information and computational power they may possess. Agents in our SLMFG are modeled as adaptive learners that use simple reinforcement learning schemes to learn their optimal behavior. The proposed learning approach is illustrated using a well-studied problem in economics. It is shown that with a patiently learning leader the repeated plays of the game result in approximate equilibrium outcomes.

Suggested Citation

  • Tharakunnel, Kurian & Bhattacharyya, Siddhartha, 2009. "Single-leader-multiple-follower games with boundedly rational agents," Journal of Economic Dynamics and Control, Elsevier, vol. 33(8), pages 1593-1603, August.
  • Handle: RePEc:eee:dyncon:v:33:y:2009:i:8:p:1593-1603
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0165-1889(09)00046-3
    Download Restriction: Full text for ScienceDirect subscribers only
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
    2. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    3. Rudolf Vetschera, 2003. "Experimentation and Learning in Repeated Cooperation," Computational and Mathematical Organization Theory, Springer, vol. 9(1), pages 37-60, May.
    4. Kalai, Ehud & Ledyard, John O., 1998. "Repeated Implementation," Journal of Economic Theory, Elsevier, vol. 83(2), pages 308-317, December.
    5. Kutschinski, Erich & Uthmann, Thomas & Polani, Daniel, 2003. "Learning competitive pricing strategies by multi-agent reinforcement learning," Journal of Economic Dynamics and Control, Elsevier, vol. 27(11), pages 2207-2218.
    6. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
    7. Radner, Roy, 1985. "Repeated Principal-Agent Games with Discounting," Econometrica, Econometric Society, vol. 53(5), pages 1173-1198, September.
    8. Kutschinski, Erich & Uthmann, Thomas & Polani, Daniel, 2003. "Learning competitive pricing strategies by multi-agent reinforcement learning," Journal of Economic Dynamics and Control, Elsevier, vol. 27(11-12), pages 2207-2218, September.
    9. William A. Brock & Cars H. Hommes, 2001. "A Rational Route to Randomness," Chapters, in: W. D. Dechert (ed.), Growth Theory, Nonlinear Dynamics and Economic Modelling, chapter 16, pages 402-438, Edward Elgar Publishing.
    10. Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
    11. Alemdar, Nedim M. & Sirakaya, Sibel, 2003. "On-line computation of Stackelberg equilibria with synchronous parallel genetic algorithms," Journal of Economic Dynamics and Control, Elsevier, vol. 27(8), pages 1503-1515, June.
    12. Vallee, Thomas & Basar, Tamer, 1999. "Off-Line Computation of Stackelberg Solutions with the Genetic Algorithm," Computational Economics, Springer;Society for Computational Economics, vol. 13(3), pages 201-209, June.
    13. Groves, Theodore, 1973. "Incentives in Teams," Econometrica, Econometric Society, vol. 41(4), pages 617-631, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Junyi Xu, 2021. "Reinforcement Learning in a Cournot Oligopoly Model," Computational Economics, Springer;Society for Computational Economics, vol. 58(4), pages 1001-1024, December.
    2. Guoling Wang & Miao Wang & Hui Yang & Guanghui Yang & Chun Wang, 2024. "Existence of $$\alpha $$ α -Robust Weak Nash Equilibria for Leader–Follower Population Games with Fuzzy Parameters," Journal of Optimization Theory and Applications, Springer, vol. 203(3), pages 2739-2758, December.
    3. Grigory Belyavsky & Natalya Danilova & Guennady Ougolnitsky, 2018. "A Markovian Mechanism of Proportional Resource Allocation in the Incentive Model as a Dynamic Stochastic Inverse Stackelberg Game," Mathematics, MDPI, vol. 6(8), pages 1-10, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hanaki, Nobuyuki & Ishikawa, Ryuichiro & Akiyama, Eizo, 2009. "Learning games," Journal of Economic Dynamics and Control, Elsevier, vol. 33(10), pages 1739-1756, October.
    2. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    3. Arthur Charpentier & Romuald Élie & Carl Remlinger, 2023. "Reinforcement Learning in Economics and Finance," Computational Economics, Springer;Society for Computational Economics, vol. 62(1), pages 425-462, June.
    4. Mauersberger, Felix, 2019. "Thompson Sampling: Endogenously Random Behavior in Games and Markets," VfS Annual Conference 2019 (Leipzig): 30 Years after the Fall of the Berlin Wall - Democracy and Market Economy 203600, Verein für Socialpolitik / German Economic Association.
    5. Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
    6. Junyi Xu, 2021. "Reinforcement Learning in a Cournot Oligopoly Model," Computational Economics, Springer;Society for Computational Economics, vol. 58(4), pages 1001-1024, December.
    7. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
    8. Benaïm, Michel & Hofbauer, Josef & Hopkins, Ed, 2009. "Learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 144(4), pages 1694-1709, July.
    9. Erhao Xie, 2019. "Monetary Payoff and Utility Function in Adaptive Learning Models," Staff Working Papers 19-50, Bank of Canada.
    10. B Kelsey Jack, 2009. "Auctioning Conservation Contracts in Indonesia - Participant Learning in Multiple Trial Rounds," CID Working Papers 35, Center for International Development at Harvard University.
    11. Waters, George A., 2009. "Chaos in the cobweb model with a new learning dynamic," Journal of Economic Dynamics and Control, Elsevier, vol. 33(6), pages 1201-1216, June.
    12. Brit Grosskopf & Ido Erev & Eldad Yechiam, 2006. "Foregone with the Wind: Indirect Payoff Information and its Implications for Choice," International Journal of Game Theory, Springer;Game Theory Society, vol. 34(2), pages 285-302, August.
    13. Jacob K. Goeree & Charles A. Holt, 2001. "Ten Little Treasures of Game Theory and Ten Intuitive Contradictions," American Economic Review, American Economic Association, vol. 91(5), pages 1402-1422, December.
    14. Andreas Nicklisch, 2011. "Learning strategic environments: an experimental study of strategy formation and transfer," Theory and Decision, Springer, vol. 71(4), pages 539-558, October.
    15. Prajapati, Hari Ram, 2012. "An Application of Game Theory in Strategic Decision of Marriage Occurrence," MPRA Paper 105344, University Library of Munich, Germany, revised 2013.
    16. Giovanna Devetag, 2000. "Coordination in "Critical Mass" Games: An Experimental Study," LEM Papers Series 2000/03, Laboratory of Economics and Management (LEM), Sant'Anna School of Advanced Studies, Pisa, Italy.
    17. Rick, Scott & Weber, Roberto A., 2010. "Meaningful learning and transfer of learning in games played repeatedly without feedback," Games and Economic Behavior, Elsevier, vol. 68(2), pages 716-730, March.
    18. Nobuyuki Hanaki, 2007. "Individual and Social Learning," Computational Economics, Springer;Society for Computational Economics, vol. 29(3), pages 421-421, May.
    19. Torsten Heinrich & Yoojin Jang & Luca Mungo & Marco Pangallo & Alex Scott & Bassel Tarbush & Samuel Wiese, 2021. "Best-response dynamics, playing sequences, and convergence to equilibrium in random games," Papers 2101.04222, arXiv.org, revised Nov 2022.
    20. Sonnemans, Joep & Hommes, Cars & Tuinstra, Jan & van de Velden, Henk, 2004. "The instability of a heterogeneous cobweb economy: a strategy experiment on expectation formation," Journal of Economic Behavior & Organization, Elsevier, vol. 54(4), pages 453-481, August.

    More about this item

    Keywords

    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:dyncon:v:33:y:2009:i:8:p:1593-1603. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jedc .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.