Aspiration-Based Reinforcement Learning In Repeated Interaction Games: An Overview

My bibliography Save this article

Aspiration-Based Reinforcement Learning In Repeated Interaction Games: An Overview

Author

Listed:

JONATHAN BENDOR
(Graduate School of Business, Stanford University, 518 Memorial Way, Stanford, CA 94305-5015, USA)
DILIP MOOKHERJEE
(Department of Economics, Boston University, 270 Bay State Road, Boston, MA 02215, USA)
DEBRAJ RAY
(Department of Economics, New York University, 269 Mercer St, NY 10003, USA)

Registered:

Abstract

In models of aspiration-based reinforcement learning, agents adapt by comparing payoffs achieved from actions chosen in the past with an aspiration level. Though such models are well-established in behavioural psychology, only recently have they begun to receive attention in game theory and its applications to economics and politics. This paper provides an informal overview of a range of such theories applied to repeated interaction games. We describe different models of aspiration formation: where (1) aspirations are fixed but required to be consistent with longrun average payoffs; (2) aspirations evolve based on past personal experience or of previous generations of players; and (3) aspirations are based on the experience of peers. Convergence to non-Nash outcomes may result in either of these formulations. Indeed, cooperative behaviour can emerge and survive in the long run, even though it may be a strictly dominated strategy in the stage game, and despite the myopic adaptation of stage game strategies. Differences between reinforcement learning and evolutionary game theory are also discussed.

Suggested Citation

Jonathan Bendor & Dilip Mookherjee & Debraj Ray, 2001. "Aspiration-Based Reinforcement Learning In Repeated Interaction Games: An Overview," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 3(02n03), pages 159-174.

Handle: RePEc:wsi:igtrxx:v:03:y:2001:i:02n03:n:s0219198901000348
DOI: 10.1142/S0219198901000348

Download full text from publisher

As the access to this document is restricted, you may want to search for a different version of it.

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Heinrich H. Nax, 2016. "When is Market the Benchmark? Reinforcement Evidence from Repurchase Decisions," Economics Series Working Papers 781, University of Oxford, Department of Economics.
- Peiran Jiao & Heinrich H. Nax, 2016. "When is Market the Benchmark? Reinforcement Evidence from Repurchase Decisions," Economics Papers 2016-W01, Economics Group, Nuffield College, University of Oxford.
Takahiro Ezaki & Naoki Masuda, 2017. "Reinforcement learning account of network reciprocity," PLOS ONE, Public Library of Science, vol. 12(12), pages 1-8, December.
Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
- John Duffy, 2004. "Agent-Based Models and Human Subject Experiments," Computational Economics 0412001, University Library of Munich, Germany.
Li, Cong & Xu, Hedong & Fan, Suohai, 2020. "Synergistic effects of self-optimization and imitation rules on the evolution of cooperation in the investor sharing game," Applied Mathematics and Computation, Elsevier, vol. 370(C).
Dziubiński, Marcin & Roy, Jaideep, 2012. "Popularity of reinforcement-based and belief-based learning models: An evolutionary approach," Journal of Economic Dynamics and Control, Elsevier, vol. 36(3), pages 433-454.
Izquierdo, Luis R. & Izquierdo, Segismundo S. & Gotts, Nicholas M. & Polhill, J. Gary, 2007. "Transient and asymptotic dynamics of reinforcement learning in games," Games and Economic Behavior, Elsevier, vol. 61(2), pages 259-276, November.
E. J. Anderson & T. D. H. Cau, 2009. "Modeling Implicit Collusion Using Coevolution," Operations Research, INFORMS, vol. 57(2), pages 439-455, April.
Oindrila Dey & Debalina Chakravarty, 2020. "Electric Street Car as a Clean Public Transport Alternative: A Choice Experiment Approach," Working Papers 2042, Indian Institute of Foreign Trade.
- Dey, Oindrila & Chakravarty, Debalina, 2020. "Electric Street Car as a Clean Public Transport Alternative: A Choice Experiment Approach," MPRA Paper 101000, University Library of Munich, Germany, revised Apr 2020.
Siegfried Berninghaus & Werner Güth & M. Vittoria Levati & Jianying Qiu, 2006. "Satisficing in sales competition: experimental evidence," Papers on Strategic Interaction 2006-32, Max Planck Institute of Economics, Strategic Interaction Group.
- Siegfried Berninghaus & Werner Güth & M. Vittoria Levati & Jianying Qiu, 2009. "Satisficing in sales competition: experimental evidence," Working Papers 2009-14, Faculty of Economics and Statistics, Universität Innsbruck.
- Berninghaus, Siegrfried K. & Güth, Werner & Levati, M. Vittoria & Qiu, Jianying, 2007. "Satisficing in sales competition : experimental evidence," Papers 07-03, Sonderforschungsbreich 504.
- Berninghaus, Siegfried K. & Gueth, Werner & Levati, M. Vittoria & Qiu, Jianying, 2007. "Satisficing in sales competition: experimental evidence," Sonderforschungsbereich 504 Publications 07-03, Sonderforschungsbereich 504, Universität Mannheim;Sonderforschungsbereich 504, University of Mannheim.
Lekfuangfu, Warn N. & Odermatt, Reto, 2022. "All I have to do is dream? The role of aspirations in intergenerational mobility and well-being," European Economic Review, Elsevier, vol. 148(C).
- Warn N. Lekfuangfu & Reto Odermatt, 2020. "All I have to do is dream? The role of aspirations in intergenerational mobility and well-being," RF Berlin - CReAM Discussion Paper Series 2016, Rockwool Foundation Berlin (RF Berlin) - Centre for Research and Analysis of Migration (CReAM).
- Nuarpear Lekfuangfu & Reto Odermatt, 2020. "All I Have to Do Is Dream? The Role of Aspirations in Intergenerational Mobility and Well-Being," PIER Discussion Papers 142, Puey Ungphakorn Institute for Economic Research.
Sung-youn Kim, 2012. "A model of political information-processing and learning cooperation in the repeated Prisonerâ€™s Dilemma," Journal of Theoretical Politics, , vol. 24(1), pages 46-65, January.
Takahiro Ezaki & Yutaka Horita & Masanori Takezawa & Naoki Masuda, 2016. "Reinforcement Learning Explains Conditional Cooperation and Its Moody Cousin," PLOS Computational Biology, Public Library of Science, vol. 12(7), pages 1-13, July.
MacLeod, W. Bentley & Pingle, Mark, 2005. "Aspiration uncertainty: its impact on decision performance and process," Journal of Economic Behavior & Organization, Elsevier, vol. 56(4), pages 617-629, April.
Heymann, D. & Kawamura, E. & Perazzo, R. & Zimmermann, M.G., 2014. "Behavioral heuristics and market patterns in a Bertrand–Edgeworth game," Journal of Economic Behavior & Organization, Elsevier, vol. 105(C), pages 124-139.
- Daniel Heymann & Enrique Kawamura & Roberto Perazzo & Martin Zimmermann, 2011. "Behavioral Heuristics and Market Patterns in a Bertrand-Edgeworth Game," Working Papers 108, Universidad de San Andres, Departamento de Economia, revised Mar 2011.
Marcin Dziubinski & Jaideep Roy, 2007. "Endogenous Selection of Aspiring and Rational rules in Coordination Games," CEDI Discussion Paper Series 07-14, Centre for Economic Development and Institutions(CEDI), Brunel University.
- Dziubinski, Marcin & Roy, Jaideep, 2007. "Endogenous selection of aspiring and rational rules in coordination games," MPRA Paper 5941, University Library of Munich, Germany.
Yu Zhang & Jason Leezer, 2010. "Simulating human-like decisions in a memory-based agent model," Computational and Mathematical Organization Theory, Springer, vol. 16(4), pages 373-399, December.
Napel, Stefan, 2003. "Aspiration adaptation in the ultimatum minigame," Games and Economic Behavior, Elsevier, vol. 43(1), pages 86-106, April.
Rajiv Sarin & Hyun Chang Yi, 2020. "A Model of Satisficing Behaviour," Working Papers 2020-21, Economic Research Institute, Bank of Korea.
He, Zhongzhi (Lawrence), 2023. "A Gradient-based reinforcement learning model of market equilibration," Journal of Economic Dynamics and Control, Elsevier, vol. 152(C).
Huw Dixon, 2020. "Almost‐Maximization as a Behavioral Theory of the Firm: Static, Dynamic and Evolutionary Perspectives," Review of Industrial Organization, Springer;The Industrial Organization Society, vol. 56(2), pages 237-258, March.

More about this item

JEL classification:

B4 - Schools of Economic Thought and Methodology - - Economic Methodology
C0 - Mathematical and Quantitative Methods - - General
C6 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling
C7 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory
D5 - Microeconomics - - General Equilibrium and Disequilibrium
D7 - Microeconomics - - Analysis of Collective Decision-Making
M2 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Business Economics

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:igtrxx:v:03:y:2001:i:02n03:n:s0219198901000348. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

We have no bibliographic references for this item. You can help adding them by using this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/igtr/igtr.shtml .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Aspiration-Based Reinforcement Learning In Repeated Interaction Games: An Overview

Author

Abstract

Suggested Citation

Download full text from publisher

Citations

More about this item

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data