IDEAS home Printed from https://ideas.repec.org/a/eee/dyncon/v36y2012i3p433-454.html

Popularity of reinforcement-based and belief-based learning models: An evolutionary approach

Author

Listed:
  • Dziubiński, Marcin
  • Roy, Jaideep

Abstract

In an evolutionary model, players from a given population meet randomly in pairs each instant to play a coordination game. At each instant, the learning model used is determined via some replicator dynamics that respects payoff fitness. We allow for two such models: a belief-based best-response model that uses a costly predictor, and a costless reinforcement-based one. This generates dynamics over the choice of learning models and the consequent choices of endogenous variables. We report conditions under which the long run outcomes are efficient (or inefficient) and they support the exclusive use of either of the models (or their co-existence).

Suggested Citation

  • Dziubiński, Marcin & Roy, Jaideep, 2012. "Popularity of reinforcement-based and belief-based learning models: An evolutionary approach," Journal of Economic Dynamics and Control, Elsevier, vol. 36(3), pages 433-454.
  • Handle: RePEc:eee:dyncon:v:36:y:2012:i:3:p:433-454
    DOI: 10.1016/j.jedc.2011.10.002
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0165188911001928
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jedc.2011.10.002?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    2. Goeree, Jacob K. & Hommes, Cars H., 2000. "Heterogeneous beliefs and the non-linear cobweb model," Journal of Economic Dynamics and Control, Elsevier, vol. 24(5-7), pages 761-798, June.
    3. Mookherjee Dilip & Sopher Barry, 1994. "Learning Behavior in an Experimental Matching Pennies Game," Games and Economic Behavior, Elsevier, vol. 7(1), pages 62-91, July.
    4. Chryssi Giannitsarou, 2003. "Heterogeneous Learning," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 6(4), pages 885-906, October.
    5. Josephson, Jens, 2009. "Stochastic adaptation in finite games played by heterogeneous populations," Journal of Economic Dynamics and Control, Elsevier, vol. 33(8), pages 1543-1554, August.
    6. Guse, Eran A., 2005. "Stability properties for learning with heterogeneous expectations and multiple equilibria," Journal of Economic Dynamics and Control, Elsevier, vol. 29(10), pages 1623-1642, October.
    7. Cooper, Russell, et al, 1990. "Selection Criteria in Coordination Games: Some Experimental Results," American Economic Review, American Economic Association, vol. 80(1), pages 218-233, March.
    8. Honkapohja, Seppo & Mitra, Kaushik, 2003. "Learning with bounded memory in stochastic models," Journal of Economic Dynamics and Control, Elsevier, vol. 27(8), pages 1437-1457, June.
    9. Jorgen W. Weibull, 1997. "Evolutionary Game Theory," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262731215, December.
    10. Camerer, Colin F. & Ho, Teck H. & Chong, Juin-Kuan, 2008. "Learning and Equilibrium in Games," Handbook of Experimental Economics Results, in: Charles R. Plott & Vernon L. Smith (ed.), Handbook of Experimental Economics Results, edition 1, volume 1, chapter 66, pages 607-615, Elsevier.
    11. Crawford, Vincent P, 1995. "Adaptive Dynamics in Coordination Games," Econometrica, Econometric Society, vol. 63(1), pages 103-143, January.
    12. Young, H Peyton, 1993. "The Evolution of Conventions," Econometrica, Econometric Society, vol. 61(1), pages 57-84, January.
    13. Young, H. Peyton, 2004. "Strategic Learning and its Limits," OUP Catalogue, Oxford University Press, number 9780199269181.
    14. Van Huyck, John B & Battalio, Raymond C & Beil, Richard O, 1990. "Tacit Coordination Games, Strategic Uncertainty, and Coordination Failure," American Economic Review, American Economic Association, vol. 80(1), pages 234-248, March.
    15. Chiarella, Carl & He, Xue-Zhong, 2003. "Dynamics of beliefs and learning under aL-processes -- the heterogeneous case," Journal of Economic Dynamics and Control, Elsevier, vol. 27(3), pages 503-531, January.
    16. Tesfatsion, Leigh & Judd, Kenneth L., 2006. "Handbook of Computational Economics, Vol. 2: Agent-Based Computational Economics," Staff General Research Papers Archive 10368, Iowa State University, Department of Economics.
    17. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
    18. Brock, William A. & Hommes, Cars H. & Wagener, Florian O. O., 2005. "Evolutionary dynamics in markets with many trader types," Journal of Mathematical Economics, Elsevier, vol. 41(1-2), pages 7-42, February.
    19. Jeffrey S. Banks & Charles R. Plott & David P. Porter, 1988. "An Experimental Analysis of Unanimity in Public Goods Provision Mechanisms," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 55(2), pages 301-322.
    20. Evans, George W. & Honkapohja, Seppo & Marimon, Ramon, 2001. "Convergence In Monetary Inflation Models With Heterogeneous Learning Rules," Macroeconomic Dynamics, Cambridge University Press, vol. 5(1), pages 1-31, February.
    21. Ed Hopkins, 2002. "Two Competing Models of How People Learn in Games," Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
    22. Jonathan Bendor & Dilip Mookherjee & Debraj Ray, 2001. "Aspiration-Based Reinforcement Learning In Repeated Interaction Games: An Overview," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 3(02n03), pages 159-174.
    23. Brock, William A. & Hommes, Cars H., 1998. "Heterogeneous beliefs and routes to chaos in a simple asset pricing model," Journal of Economic Dynamics and Control, Elsevier, vol. 22(8-9), pages 1235-1274, August.
    24. Bouchez, Nicole & Friedman, Daniel, 2008. "Equilibrium Convergence in Normal Form Games," Handbook of Experimental Economics Results, in: Charles R. Plott & Vernon L. Smith (ed.), Handbook of Experimental Economics Results, edition 1, volume 1, chapter 53, pages 472-480, Elsevier.
    25. Samuelson, Larry, 2001. "Introduction to the Evolution of Preferences," Journal of Economic Theory, Elsevier, vol. 97(2), pages 225-230, April.
    26. Dixon, Huw David, 2000. "Keeping up with the Joneses: competition and the evolution of collusion," Journal of Economic Behavior & Organization, Elsevier, vol. 43(2), pages 223-238, October.
    27. Robson, Arthur J. & Vega-Redondo, Fernando, 1996. "Efficient Equilibrium Selection in Evolutionary Games with Random Matching," Journal of Economic Theory, Elsevier, vol. 70(1), pages 65-92, July.
    28. Leigh Tesfatsion & Kenneth L. Judd (ed.), 2006. "Handbook of Computational Economics," Handbook of Computational Economics, Elsevier, edition 1, volume 2, number 2.
    29. Kandori, Michihiro & Mailath, George J & Rob, Rafael, 1993. "Learning, Mutation, and Long Run Equilibria in Games," Econometrica, Econometric Society, vol. 61(1), pages 29-56, January.
    30. Juang, Wei-Torng, 2002. "Rule Evolution and Equilibrium Selection," Games and Economic Behavior, Elsevier, vol. 39(1), pages 71-90, April.
    31. Cheung, Yin-Wong & Friedman, Daniel, 1997. "Individual Learning in Normal Form Games: Some Laboratory Results," Games and Economic Behavior, Elsevier, vol. 19(1), pages 46-76, April.
    32. William A. Brock & Cars H. Hommes, 2001. "A Rational Route to Randomness," Chapters, in: W. D. Dechert (ed.), Growth Theory, Nonlinear Dynamics and Economic Modelling, chapter 16, pages 402-438, Edward Elgar Publishing.
    33. Bergin, James & Lipman, Barton L, 1996. "Evolution with State-Dependent Mutations," Econometrica, Econometric Society, vol. 64(4), pages 943-956, July.
    34. Fernando Vega-Redondo & Frédéric Palomino, 1999. "Convergence of aspirations and (partial) cooperation in the prisoner's dilemma," International Journal of Game Theory, Springer;Game Theory Society, vol. 28(4), pages 465-488.
    35. R. Isaac & David Schmidtz & James Walker, 1989. "The assurance problem in a laboratory market," Public Choice, Springer, vol. 62(3), pages 217-236, September.
    36. Roth, Alvin E & Schoumaker, Francoise, 1983. "Expectations and Reputations in Bargaining: An Experimental Study," American Economic Review, American Economic Association, vol. 73(3), pages 362-372, June.
    37. John B. Van Huyck & Raymond C. Battalio & Richard O. Beil, 1991. "Strategic Uncertainty, Equilibrium Selection, and Coordination Failure in Average Opinion Games," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 106(3), pages 885-910.
    38. Bendor Jonathan & Mookherjee Dilip & Ray Debraj, 2001. "Reinforcement Learning in Repeated Interaction Games," The B.E. Journal of Theoretical Economics, De Gruyter, vol. 1(1), pages 1-44, March.
    39. Friedman, Daniel, 1996. "Equilibrium in Evolutionary Games: Some Experimental Results," Economic Journal, Royal Economic Society, vol. 106(434), pages 1-25, January.
    40. Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
    41. Hommes, Cars H., 2006. "Heterogeneous Agent Models in Economics and Finance," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 23, pages 1109-1186, Elsevier.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    2. J. Van Huyck & R. Battalio & F. Rankin, 1996. "On the Evolution of Convention: Evidence from Coordination Games," Levine's Working Paper Archive 548, David K. Levine.
    3. repec:wyi:journl:002151 is not listed on IDEAS
    4. Hommes, Cars & Kiseleva, Tatiana & Kuznetsov, Yuri & Verbic, Miroslav, 2012. "Is More Memory In Evolutionary Selection (De)Stabilizing?," Macroeconomic Dynamics, Cambridge University Press, vol. 16(3), pages 335-357, June.
    5. Battalio,R. & Samuelson,L. & Huyck,J. van, 1998. "Risk dominance, payoff dominance and probabilistic choice learning," Working papers 2, Wisconsin Madison - Social Systems.
    6. Napel, Stefan, 2003. "Aspiration adaptation in the ultimatum minigame," Games and Economic Behavior, Elsevier, vol. 43(1), pages 86-106, April.
    7. Amilon, Henrik, 2008. "Estimation of an adaptive stock market model with heterogeneous agents," Journal of Empirical Finance, Elsevier, vol. 15(2), pages 342-362, March.
    8. Gerard van der Laan & A.F. Tieman, 1996. "Evolutionary Game Theory and the Modelling of Economic Behavior," Tinbergen Institute Discussion Papers 96-172/8, Tinbergen Institute.
    9. Hommes, Cars, 2011. "The heterogeneous expectations hypothesis: Some evidence from the lab," Journal of Economic Dynamics and Control, Elsevier, vol. 35(1), pages 1-24, January.
    10. Broseta, Bruno, 2000. "Adaptive Learning and Equilibrium Selection in Experimental Coordination Games: An ARCH(1) Approach," Games and Economic Behavior, Elsevier, vol. 32(1), pages 25-50, July.
    11. Hommes, Cars, 2018. "Behavioral & experimental macroeconomics and policy analysis: a complex systems approach," Working Paper Series 2201, European Central Bank.
    12. Izquierdo, Luis R. & Izquierdo, Segismundo S. & Gotts, Nicholas M. & Polhill, J. Gary, 2007. "Transient and asymptotic dynamics of reinforcement learning in games," Games and Economic Behavior, Elsevier, vol. 61(2), pages 259-276, November.
    13. Friedman, Daniel & Zhao, Shuchen, 2021. "When are mixed equilibria relevant?," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 51-65.
    14. Teck H Ho & Colin Camerer & Juin-Kuan Chong, 2003. "Functional EWA: A one-parameter theory of learning in games," Levine's Working Paper Archive 506439000000000514, David K. Levine.
    15. Friedman, Daniel & Abraham, Ralph, 2009. "Bubbles and crashes: Gradient dynamics in financial markets," Journal of Economic Dynamics and Control, Elsevier, vol. 33(4), pages 922-937, April.
    16. Masiliūnas, Aidas, 2019. "Overcoming inefficient lock-in in coordination games with sophisticated and myopic players," Mathematical Social Sciences, Elsevier, vol. 100(C), pages 1-12.
    17. Matros, Alexander, 2012. "Altruistic versus egoistic behavior in a Public Good game," Journal of Economic Dynamics and Control, Elsevier, vol. 36(4), pages 642-656.
    18. Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
    19. Erhao Xie, 2019. "Monetary Payoff and Utility Function in Adaptive Learning Models," Staff Working Papers 19-50, Bank of Canada.
    20. Waters, George A., 2009. "Chaos in the cobweb model with a new learning dynamic," Journal of Economic Dynamics and Control, Elsevier, vol. 33(6), pages 1201-1216, June.
    21. He, Xue-Zhong & Li, Youwei, 2015. "Testing of a market fraction model and power-law behaviour in the DAX 30," Journal of Empirical Finance, Elsevier, vol. 31(C), pages 1-17.

    More about this item

    Keywords

    ;
    ;
    ;
    ;

    JEL classification:

    • D01 - Microeconomics - - General - - - Microeconomic Behavior: Underlying Principles
    • D03 - Microeconomics - - General - - - Behavioral Microeconomics: Underlying Principles
    • D70 - Microeconomics - - Analysis of Collective Decision-Making - - - General

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:dyncon:v:36:y:2012:i:3:p:433-454. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jedc .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.