IDEAS home Printed from https://ideas.repec.org/a/eee/dyncon/v36y2012i3p433-454.html
   My bibliography  Save this article

Popularity of reinforcement-based and belief-based learning models: An evolutionary approach

Author

Listed:
  • Dziubiński, Marcin
  • Roy, Jaideep

Abstract

In an evolutionary model, players from a given population meet randomly in pairs each instant to play a coordination game. At each instant, the learning model used is determined via some replicator dynamics that respects payoff fitness. We allow for two such models: a belief-based best-response model that uses a costly predictor, and a costless reinforcement-based one. This generates dynamics over the choice of learning models and the consequent choices of endogenous variables. We report conditions under which the long run outcomes are efficient (or inefficient) and they support the exclusive use of either of the models (or their co-existence).

Suggested Citation

  • Dziubiński, Marcin & Roy, Jaideep, 2012. "Popularity of reinforcement-based and belief-based learning models: An evolutionary approach," Journal of Economic Dynamics and Control, Elsevier, vol. 36(3), pages 433-454.
  • Handle: RePEc:eee:dyncon:v:36:y:2012:i:3:p:433-454
    DOI: 10.1016/j.jedc.2011.10.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0165188911001928
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jedc.2011.10.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Fernando Vega-Redondo & Frédéric Palomino, 1999. "Convergence of aspirations and (partial) cooperation in the prisoner's dilemma," International Journal of Game Theory, Springer;Game Theory Society, vol. 28(4), pages 465-488.
    2. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    3. Goeree, Jacob K. & Hommes, Cars H., 2000. "Heterogeneous beliefs and the non-linear cobweb model," Journal of Economic Dynamics and Control, Elsevier, vol. 24(5-7), pages 761-798, June.
    4. Mookherjee Dilip & Sopher Barry, 1994. "Learning Behavior in an Experimental Matching Pennies Game," Games and Economic Behavior, Elsevier, vol. 7(1), pages 62-91, July.
    5. Chryssi Giannitsarou, 2003. "Heterogeneous Learning," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 6(4), pages 885-906, October.
    6. Robson, Arthur J. & Vega-Redondo, Fernando, 1996. "Efficient Equilibrium Selection in Evolutionary Games with Random Matching," Journal of Economic Theory, Elsevier, vol. 70(1), pages 65-92, July.
    7. Van Huyck, John B & Battalio, Raymond C & Beil, Richard O, 1990. "Tacit Coordination Games, Strategic Uncertainty, and Coordination Failure," American Economic Review, American Economic Association, vol. 80(1), pages 234-248, March.
    8. Josephson, Jens, 2009. "Stochastic adaptation in finite games played by heterogeneous populations," Journal of Economic Dynamics and Control, Elsevier, vol. 33(8), pages 1543-1554, August.
    9. Cooper, Russell, et al, 1990. "Selection Criteria in Coordination Games: Some Experimental Results," American Economic Review, American Economic Association, vol. 80(1), pages 218-233, March.
    10. Guse, Eran A., 2005. "Stability properties for learning with heterogeneous expectations and multiple equilibria," Journal of Economic Dynamics and Control, Elsevier, vol. 29(10), pages 1623-1642, October.
    11. Honkapohja, Seppo & Mitra, Kaushik, 2003. "Learning with bounded memory in stochastic models," Journal of Economic Dynamics and Control, Elsevier, vol. 27(8), pages 1437-1457, June.
    12. Jorgen W. Weibull, 1997. "Evolutionary Game Theory," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262731215, December.
    13. Bouchez, Nicole & Friedman, Daniel, 2008. "Equilibrium Convergence in Normal Form Games," Handbook of Experimental Economics Results, in: Charles R. Plott & Vernon L. Smith (ed.), Handbook of Experimental Economics Results, edition 1, volume 1, chapter 53, pages 472-480, Elsevier.
    14. Camerer, Colin F. & Ho, Teck H. & Chong, Juin-Kuan, 2008. "Learning and Equilibrium in Games," Handbook of Experimental Economics Results, in: Charles R. Plott & Vernon L. Smith (ed.), Handbook of Experimental Economics Results, edition 1, volume 1, chapter 66, pages 607-615, Elsevier.
    15. Crawford, Vincent P, 1995. "Adaptive Dynamics in Coordination Games," Econometrica, Econometric Society, vol. 63(1), pages 103-143, January.
    16. Ed Hopkins, 2002. "Two Competing Models of How People Learn in Games," Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
    17. Young, H Peyton, 1993. "The Evolution of Conventions," Econometrica, Econometric Society, vol. 61(1), pages 57-84, January.
    18. Young, H. Peyton, 2004. "Strategic Learning and its Limits," OUP Catalogue, Oxford University Press, number 9780199269181, Decembrie.
    19. Chiarella, Carl & He, Xue-Zhong, 2003. "Dynamics of beliefs and learning under aL-processes -- the heterogeneous case," Journal of Economic Dynamics and Control, Elsevier, vol. 27(3), pages 503-531, January.
    20. Brock, William A. & Hommes, Cars H. & Wagener, Florian O. O., 2005. "Evolutionary dynamics in markets with many trader types," Journal of Mathematical Economics, Elsevier, vol. 41(1-2), pages 7-42, February.
    21. Tesfatsion, Leigh & Judd, Kenneth L., 2006. "Handbook of Computational Economics, Vol. 2: Agent-Based Computational Economics," Staff General Research Papers Archive 10368, Iowa State University, Department of Economics.
    22. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
    23. Jeffrey S. Banks & Charles R. Plott & David P. Porter, 1988. "An Experimental Analysis of Unanimity in Public Goods Provision Mechanisms," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 55(2), pages 301-322.
    24. William A. Brock & Cars H. Hommes, 1997. "A Rational Route to Randomness," Econometrica, Econometric Society, vol. 65(5), pages 1059-1096, September.
    25. Kandori, Michihiro & Mailath, George J & Rob, Rafael, 1993. "Learning, Mutation, and Long Run Equilibria in Games," Econometrica, Econometric Society, vol. 61(1), pages 29-56, January.
    26. Evans, George W. & Honkapohja, Seppo & Marimon, Ramon, 2001. "Convergence In Monetary Inflation Models With Heterogeneous Learning Rules," Macroeconomic Dynamics, Cambridge University Press, vol. 5(1), pages 1-31, February.
    27. Bergin, James & Lipman, Barton L, 1996. "Evolution with State-Dependent Mutations," Econometrica, Econometric Society, vol. 64(4), pages 943-956, July.
    28. Hommes, Cars H., 2006. "Heterogeneous Agent Models in Economics and Finance," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 23, pages 1109-1186, Elsevier.
    29. Jonathan Bendor & Dilip Mookherjee & Debraj Ray, 2001. "Aspiration-Based Reinforcement Learning In Repeated Interaction Games: An Overview," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 3(02n03), pages 159-174.
    30. Brock, William A. & Hommes, Cars H., 1998. "Heterogeneous beliefs and routes to chaos in a simple asset pricing model," Journal of Economic Dynamics and Control, Elsevier, vol. 22(8-9), pages 1235-1274, August.
    31. William A. Brock & Cars H. Hommes, 2001. "A Rational Route to Randomness," Chapters, in: W. D. Dechert (ed.), Growth Theory, Nonlinear Dynamics and Economic Modelling, chapter 16, pages 402-438, Edward Elgar Publishing.
    32. Samuelson, Larry, 2001. "Introduction to the Evolution of Preferences," Journal of Economic Theory, Elsevier, vol. 97(2), pages 225-230, April.
    33. Dixon, Huw David, 2000. "Keeping up with the Joneses: competition and the evolution of collusion," Journal of Economic Behavior & Organization, Elsevier, vol. 43(2), pages 223-238, October.
    34. Leigh Tesfatsion & Kenneth L. Judd (ed.), 2006. "Handbook of Computational Economics," Handbook of Computational Economics, Elsevier, edition 1, volume 2, number 2.
    35. Juang, Wei-Torng, 2002. "Rule Evolution and Equilibrium Selection," Games and Economic Behavior, Elsevier, vol. 39(1), pages 71-90, April.
    36. Cheung, Yin-Wong & Friedman, Daniel, 1997. "Individual Learning in Normal Form Games: Some Laboratory Results," Games and Economic Behavior, Elsevier, vol. 19(1), pages 46-76, April.
    37. R. Isaac & David Schmidtz & James Walker, 1989. "The assurance problem in a laboratory market," Public Choice, Springer, vol. 62(3), pages 217-236, September.
    38. Roth, Alvin E & Schoumaker, Francoise, 1983. "Expectations and Reputations in Bargaining: An Experimental Study," American Economic Review, American Economic Association, vol. 73(3), pages 362-372, June.
    39. John B. Van Huyck & Raymond C. Battalio & Richard O. Beil, 1991. "Strategic Uncertainty, Equilibrium Selection, and Coordination Failure in Average Opinion Games," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 106(3), pages 885-910.
    40. Bendor Jonathan & Mookherjee Dilip & Ray Debraj, 2001. "Reinforcement Learning in Repeated Interaction Games," The B.E. Journal of Theoretical Economics, De Gruyter, vol. 1(1), pages 1-44, March.
    41. Friedman, Daniel, 1996. "Equilibrium in Evolutionary Games: Some Experimental Results," Economic Journal, Royal Economic Society, vol. 106(434), pages 1-25, January.
    42. Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    2. Hommes, Cars & Kiseleva, Tatiana & Kuznetsov, Yuri & Verbic, Miroslav, 2012. "Is More Memory In Evolutionary Selection (De)Stabilizing?," Macroeconomic Dynamics, Cambridge University Press, vol. 16(3), pages 335-357, June.
    3. Battalio,R. & Samuelson,L. & Huyck,J. van, 1998. "Risk dominance, payoff dominance and probabilistic choice learning," Working papers 2, Wisconsin Madison - Social Systems.
    4. J. Van Huyck & R. Battalio & F. Rankin, 1996. "On the Evolution of Convention: Evidence from Coordination Games," Levine's Working Paper Archive 548, David K. Levine.
    5. Amilon, Henrik, 2008. "Estimation of an adaptive stock market model with heterogeneous agents," Journal of Empirical Finance, Elsevier, vol. 15(2), pages 342-362, March.
    6. Hommes, Cars, 2011. "The heterogeneous expectations hypothesis: Some evidence from the lab," Journal of Economic Dynamics and Control, Elsevier, vol. 35(1), pages 1-24, January.
    7. Pangallo, Marco & Sanders, James B.T. & Galla, Tobias & Farmer, J. Doyne, 2022. "Towards a taxonomy of learning dynamics in 2 × 2 games," Games and Economic Behavior, Elsevier, vol. 132(C), pages 1-21.
    8. Napel, Stefan, 2003. "Aspiration adaptation in the ultimatum minigame," Games and Economic Behavior, Elsevier, vol. 43(1), pages 86-106, April.
    9. repec:wyi:journl:002151 is not listed on IDEAS
    10. Hommes, Cars, 2018. "Behavioral & experimental macroeconomics and policy analysis: a complex systems approach," Working Paper Series 2201, European Central Bank.
    11. Anufriev, Mikhail & Panchenko, Valentyn, 2009. "Asset prices, traders' behavior and market design," Journal of Economic Dynamics and Control, Elsevier, vol. 33(5), pages 1073-1090, May.
    12. Tai, Chung-Ching & Chen, Shu-Heng & Yang, Lee-Xieng, 2018. "Cognitive ability and earnings performance: Evidence from double auction market experiments," Journal of Economic Dynamics and Control, Elsevier, vol. 91(C), pages 409-440.
    13. Anufriev, Mikhail & Dindo, Pietro, 2010. "Wealth-driven selection in a financial market with heterogeneous agents," Journal of Economic Behavior & Organization, Elsevier, vol. 73(3), pages 327-358, March.
    14. Carl Chiarella & Roberto Dieci & Xue-Zhong He, 2013. "Time-varying beta: a boundedly rational equilibrium approach," Journal of Evolutionary Economics, Springer, vol. 23(3), pages 609-639, July.
    15. Mikhail Anufriev & Cars Hommes, 2012. "Evolutionary Selection of Individual Expectations and Aggregate Outcomes in Asset Pricing Experiments," American Economic Journal: Microeconomics, American Economic Association, vol. 4(4), pages 35-64, November.
    16. Pedro Dal Bo & Guillaume R. Frochette, 2011. "The Evolution of Cooperation in Infinitely Repeated Games: Experimental Evidence," American Economic Review, American Economic Association, vol. 101(1), pages 411-429, February.
    17. Chiarella, Carl & He, Xue-Zhong & Hommes, Cars, 2006. "A dynamic analysis of moving average rules," Journal of Economic Dynamics and Control, Elsevier, vol. 30(9-10), pages 1729-1753.
    18. Cabrales, Antonio & Garcia-Fontes, Walter & Motta, Massimo, 2000. "Risk dominance selects the leader: An experimental analysis," International Journal of Industrial Organization, Elsevier, vol. 18(1), pages 137-162, January.
    19. Cars Hommes, 2010. "The heterogeneous expectations hypothesis: some evidence from the lab," Post-Print hal-00753041, HAL.
    20. Shu-Heng Chen & Bin-Tzong Chie & Ying-Fang Kao & Ragupathy Venkatachalam, 2019. "Agent-Based Modeling of a Non-tâtonnement Process for the Scarf Economy: The Role of Learning," Computational Economics, Springer;Society for Computational Economics, vol. 54(1), pages 305-341, June.
    21. Hanaki, Nobuyuki & Ishikawa, Ryuichiro & Akiyama, Eizo, 2009. "Learning games," Journal of Economic Dynamics and Control, Elsevier, vol. 33(10), pages 1739-1756, October.

    More about this item

    Keywords

    Co-evolution; Best-response; Aspirations; Coordination games;
    All these keywords.

    JEL classification:

    • D01 - Microeconomics - - General - - - Microeconomic Behavior: Underlying Principles
    • D03 - Microeconomics - - General - - - Behavioral Microeconomics: Underlying Principles
    • D70 - Microeconomics - - Analysis of Collective Decision-Making - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:dyncon:v:36:y:2012:i:3:p:433-454. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jedc .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.