IDEAS home Printed from https://ideas.repec.org/a/wsi/igtrxx/v03y2001i02n03ns0219198901000348.html
   My bibliography  Save this article

Aspiration-Based Reinforcement Learning In Repeated Interaction Games: An Overview

Author

Listed:
  • JONATHAN BENDOR

    (Graduate School of Business, Stanford University, 518 Memorial Way, Stanford, CA 94305-5015, USA)

  • DILIP MOOKHERJEE

    (Department of Economics, Boston University, 270 Bay State Road, Boston, MA 02215, USA)

  • DEBRAJ RAY

    (Department of Economics, New York University, 269 Mercer St, NY 10003, USA)

Abstract

In models of aspiration-based reinforcement learning, agents adapt by comparing payoffs achieved from actions chosen in the past with an aspiration level. Though such models are well-established in behavioural psychology, only recently have they begun to receive attention in game theory and its applications to economics and politics. This paper provides an informal overview of a range of such theories applied to repeated interaction games. We describe different models of aspiration formation: where (1) aspirations are fixed but required to be consistent with longrun average payoffs; (2) aspirations evolve based on past personal experience or of previous generations of players; and (3) aspirations are based on the experience of peers. Convergence to non-Nash outcomes may result in either of these formulations. Indeed, cooperative behaviour can emerge and survive in the long run, even though it may be a strictly dominated strategy in the stage game, and despite the myopic adaptation of stage game strategies. Differences between reinforcement learning and evolutionary game theory are also discussed.

Suggested Citation

  • Jonathan Bendor & Dilip Mookherjee & Debraj Ray, 2001. "Aspiration-Based Reinforcement Learning In Repeated Interaction Games: An Overview," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 3(02n03), pages 159-174.
  • Handle: RePEc:wsi:igtrxx:v:03:y:2001:i:02n03:n:s0219198901000348
    DOI: 10.1142/S0219198901000348
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219198901000348
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219198901000348?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Heinrich H. Nax, 2016. "When is Market the Benchmark? Reinforcement Evidence from Repurchase Decisions," Economics Series Working Papers 781, University of Oxford, Department of Economics.
    2. Takahiro Ezaki & Naoki Masuda, 2017. "Reinforcement learning account of network reciprocity," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-8, December.
    3. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    4. Li, Cong & Xu, Hedong & Fan, Suohai, 2020. "Synergistic effects of self-optimization and imitation rules on the evolution of cooperation in the investor sharing game," Applied Mathematics and Computation, Elsevier, vol. 370(C).
    5. Dziubiński, Marcin & Roy, Jaideep, 2012. "Popularity of reinforcement-based and belief-based learning models: An evolutionary approach," Journal of Economic Dynamics and Control, Elsevier, vol. 36(3), pages 433-454.
    6. Izquierdo, Luis R. & Izquierdo, Segismundo S. & Gotts, Nicholas M. & Polhill, J. Gary, 2007. "Transient and asymptotic dynamics of reinforcement learning in games," Games and Economic Behavior, Elsevier, vol. 61(2), pages 259-276, November.
    7. E. J. Anderson & T. D. H. Cau, 2009. "Modeling Implicit Collusion Using Coevolution," Operations Research, INFORMS, vol. 57(2), pages 439-455, April.
    8. Oindrila Dey & Debalina Chakravarty, 2020. "Electric Street Car as a Clean Public Transport Alternative: A Choice Experiment Approach," Working Papers 2042, Indian Institute of Foreign Trade.
    9. Siegfried Berninghaus & Werner Güth & M. Vittoria Levati & Jianying Qiu, 2006. "Satisficing in sales competition: experimental evidence," Papers on Strategic Interaction 2006-32, Max Planck Institute of Economics, Strategic Interaction Group.
    10. Lekfuangfu, Warn N. & Odermatt, Reto, 2022. "All I have to do is dream? The role of aspirations in intergenerational mobility and well-being," European Economic Review, Elsevier, vol. 148(C).
    11. Sung-youn Kim, 2012. "A model of political information-processing and learning cooperation in the repeated Prisoner’s Dilemma," Journal of Theoretical Politics, , vol. 24(1), pages 46-65, January.
    12. Takahiro Ezaki & Yutaka Horita & Masanori Takezawa & Naoki Masuda, 2016. "Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-13, July.
    13. MacLeod, W. Bentley & Pingle, Mark, 2005. "Aspiration uncertainty: its impact on decision performance and process," Journal of Economic Behavior & Organization, Elsevier, vol. 56(4), pages 617-629, April.
    14. Heymann, D. & Kawamura, E. & Perazzo, R. & Zimmermann, M.G., 2014. "Behavioral heuristics and market patterns in a Bertrand–Edgeworth game," Journal of Economic Behavior & Organization, Elsevier, vol. 105(C), pages 124-139.
    15. Marcin Dziubinski & Jaideep Roy, 2007. "Endogenous Selection of Aspiring and Rational rules in Coordination Games," CEDI Discussion Paper Series 07-14, Centre for Economic Development and Institutions(CEDI), Brunel University.
    16. Yu Zhang & Jason Leezer, 2010. "Simulating human-like decisions in a memory-based agent model," Computational and Mathematical Organization Theory, Springer, vol. 16(4), pages 373-399, December.
    17. Napel, Stefan, 2003. "Aspiration adaptation in the ultimatum minigame," Games and Economic Behavior, Elsevier, vol. 43(1), pages 86-106, April.
    18. Rajiv Sarin & Hyun Chang Yi, 2020. "A Model of Satisficing Behaviour," Working Papers 2020-21, Economic Research Institute, Bank of Korea.
    19. He, Zhongzhi (Lawrence), 2023. "A Gradient-based reinforcement learning model of market equilibration," Journal of Economic Dynamics and Control, Elsevier, vol. 152(C).
    20. Huw Dixon, 2020. "Almost‐Maximization as a Behavioral Theory of the Firm: Static, Dynamic and Evolutionary Perspectives," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 56(2), pages 237-258, March.

    More about this item

    JEL classification:

    • B4 - Schools of Economic Thought and Methodology - - Economic Methodology
    • C0 - Mathematical and Quantitative Methods - - General
    • C6 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling
    • C7 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory
    • D5 - Microeconomics - - General Equilibrium and Disequilibrium
    • D7 - Microeconomics - - Analysis of Collective Decision-Making
    • M2 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Business Economics

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:igtrxx:v:03:y:2001:i:02n03:n:s0219198901000348. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/igtr/igtr.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.