The reinforcement heuristic in normal form games

The reinforcement heuristic in normal form games

Author

Listed:

Alós-Ferrer, Carlos
Ritschel, Alexander

Abstract

We analyze simple reinforcement-based behavioral rules in 3 × 3 games through choice data and response times. We argue that there is a large overlap between reinforcement-based heuristics (win-stay, lose-shift) and the more “rational” behavioral rule of myopic best reply. However, evidence from response times shows that choices in agreement with the common prescription of those rules are comparatively fast, and choices of the form “lose-shift” occur more frequently for larger differences with bygone payoffs. Both observations speak in favor of reinforcement processes as a cognitive shortcut for apparent myopic best reply, and advise caution when interpreting behavioral results in favor of optimizing behavior.

Suggested Citation

Alós-Ferrer, Carlos & Ritschel, Alexander, 2018. "The reinforcement heuristic in normal form games," Journal of Economic Behavior & Organization, Elsevier, vol. 152(C), pages 224-234.

Handle: RePEc:eee:jeborg:v:152:y:2018:i:c:p:224-234
DOI: 10.1016/j.jebo.2018.06.014

Download full text from publisher

As the access to this document is restricted, you may want to

for a different version of it.

References listed on IDEAS

Alos-Ferrer, Carlos & Weidenholzer, Simon, 2008. "Erratum to "Partial bandwagon effects and local interactions" [Games Econ. Behav. 61 (2007) 179-197]," Games and Economic Behavior, Elsevier, vol. 62(1), pages 324-325, January.
- Alos-Ferrer, Carlos & Weidenholzer, Simon, 2007. "Partial bandwagon effects and local interactions," Games and Economic Behavior, Elsevier, vol. 61(2), pages 179-197, November.
Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
Laslier, Jean-Francois & Topol, Richard & Walliser, Bernard, 2001. "A Behavioral Learning Process in Games," Games and Economic Behavior, Elsevier, vol. 37(2), pages 340-366, November.
- J.-F. Laslier & R. Topol & B. Walliser, 1999. "A behavioral learning process in games," Thema Working Papers 99-03, THEMA (ThÃ©orie Economique, ModÃ©lisation et Applications), CY Cergy-Paris University, ESSEC and CNRS.
- Laslier, J.-F. & Topol, R. & Walliser, B., 1999. "A Behavioral Learning Process in Games," Papers 99-03, Paris X - Nanterre, U.F.R. de Sc. Ec. Gest. Maths Infor..
Lang, Frieder R. & John, Dennis & Lüdtke, Oliver & Schupp, Jürgen & Wagner, Gert G., 2011. "Short Assessment of the Big Five: Robust Across Survey Methods Except Telephone Interviewing," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 43(2), pages 548-567.
Ben Greiner, 2015. "Subject pool recruitment procedures: organizing experiments with ORSEE," Journal of the Economic Science Association, Springer;Economic Science Association, vol. 1(1), pages 114-125, July.
Kandori Michihiro & Rob Rafael, 1995. "Evolution of Equilibria in the Long Run: A General Theory and Applications," Journal of Economic Theory, Elsevier, vol. 65(2), pages 383-414, April.
- M. Kandori & R. Rob, 2010. "Evolution of Equilibria in the Long Run: A General Theory and Applications," Levine's Working Paper Archive 502, David K. Levine.
A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2008. "Bootstrap-Based Improvements for Inference with Clustered Errors," The Review of Economics and Statistics, MIT Press, vol. 90(3), pages 414-427, August.
- Jonah B. Gelbach & Doug Miller & A. Colin Cameron, 2006. "Bootstrap-Based Improvements for Inference with Clustered Errors," Working Papers 128, University of California, Davis, Department of Economics.
- A. Colin Cameron & Jonah B. Gelbach & Douglas L. Miller, 2007. "Bootstrap-Based Improvements for Inference with Clustered Errors," NBER Technical Working Papers 0344, National Bureau of Economic Research, Inc.
Alós-Ferrer, Carlos & Strack, Fritz, 2014. "From dual processes to multiple selves: Implications for economic behavior," Journal of Economic Psychology, Elsevier, vol. 41(C), pages 1-11.
Charness, Gary & Gneezy, Uri & Halladay, Brianna, 2016. "Experimental methods: Pay one or pay all," Journal of Economic Behavior & Organization, Elsevier, vol. 131(PA), pages 141-150.
Alós-Ferrer, Carlos & Hügelschäfer, Sabine & Li, Jiahui, 2017. "Framing effects and the reinforcement heuristic," Economics Letters, Elsevier, vol. 156(C), pages 32-35.
Daniel Kahneman, 2003. "Maps of Bounded Rationality: Psychology for Behavioral Economics," American Economic Review, American Economic Association, vol. 93(5), pages 1449-1475, December.
Robin L. Dillon & Catherine H. Tinsley, 2008. "How Near-Misses Influence Decision Making Under Risk: A Missed Opportunity for Learning," Management Science, INFORMS, vol. 54(8), pages 1425-1440, August.
McKelvey Richard D. & Palfrey Thomas R., 1995. "Quantal Response Equilibria for Normal Form Games," Games and Economic Behavior, Elsevier, vol. 10(1), pages 6-38, July.
- McKelvey, Richard D. & Palfrey, Thomas R., 1994. "Quantal Response Equilibria For Normal Form Games," Working Papers 883, California Institute of Technology, Division of the Humanities and Social Sciences.
- R. McKelvey & T. Palfrey, 2010. "Quantal Response Equilibria for Normal Form Games," Levine's Working Paper Archive 510, David K. Levine.
Anja Achtziger & Carlos Alós-Ferrer, 2014. "Fast or Rational? A Response-Times Study of Bayesian Updating," Management Science, INFORMS, vol. 60(4), pages 923-938, April.
Borgers, Tilman & Sarin, Rajiv, 1997. "Learning Through Reinforcement and Replicator Dynamics," Journal of Economic Theory, Elsevier, vol. 77(1), pages 1-14, November.
- Tilman Börgers & Rajiv Sarin, "undated". "Learning Through Reinforcement and Replicator Dynamics," ELSE working papers 051, ESRC Centre on Economics Learning and Social Evolution.
- T. Borgers & R. Sarin, 2010. "Learning Through Reinforcement and Replicator Dynamics," Levine's Working Paper Archive 380, David K. Levine.
Yaron Azrieli & Christopher P. Chambers & Paul J. Healy, 2018. "Incentives in Experiments: A Theoretical Analysis," Journal of Political Economy, University of Chicago Press, vol. 126(4), pages 1472-1503.
- Paul J. Healy & Yaron Azrieli & Christopher P. Chambers, 2016. "Incentives in Experiments: A Theoretical Analysis," Working Papers 16-03, Ohio State University, Department of Economics.
Alós-Ferrer, Carlos & Weidenholzer, Simon, 2008. "Contagion and efficiency," Journal of Economic Theory, Elsevier, vol. 143(1), pages 251-274, November.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Fernando Vega-Redondo, 1997. "The Evolution of Walrasian Behavior," Econometrica, Econometric Society, vol. 65(2), pages 375-384, March.
- Fernando Vega Redondo, 1996. "The evolution of walrasian behavior," Working Papers. Serie AD 1996-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Gary Charness & Dan Levin, 2005. "When Optimal Choices Feel Wrong: A Laboratory Study of Bayesian Updating, Complexity, and Affect," American Economic Review, American Economic Association, vol. 95(4), pages 1300-1309, September.
- Charness, Gary & Levin, Dan, 2003. "When Optimal Choices Feel Wrong: A Laboratory Study of Bayesian Updating, Complexity, and Affect," University of California at Santa Barbara, Economics Working Paper Series qt7g63k28w, Department of Economics, UC Santa Barbara.
Urs Fischbacher, 2007. "z-Tree: Zurich toolbox for ready-made economic experiments," Experimental Economics, Springer;Economic Science Association, vol. 10(2), pages 171-178, June.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Francesco Fallucchi & Jan Niederreiter & Massimo Riccaboni, 2021. "Learning and dropout in contests: an experimental approach," Theory and Decision, Springer, vol. 90(2), pages 245-278, March.
Arkady Konovalov & Ian Krajbich, 2019. "Revealed strength of preference: Inference from response times," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 14(4), pages 381-394, July.
Sawa, Ryoji, 2021. "A prospect theory Nash bargaining solution and its stochastic stability," Journal of Economic Behavior & Organization, Elsevier, vol. 184(C), pages 692-711.
Carlos Alós-Ferrer & Ernst Fehr & Nick Netzer, 2021. "Time Will Tell: Recovering Preferences When Choices Are Noisy," Journal of Political Economy, University of Chicago Press, vol. 129(6), pages 1828-1877.
- Alós-Ferrer, Carlos & Fehr, Ernst & Netzer, Nick, 2021. "Time Will Tell: Recovering Preferences When Choices Are Noisy," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 129(6), pages 1828-1877.
- Carlos Alós-Ferrer & Ernst Fehr & Nick Netzer, 2018. "Time will tell: recovering preferences when choices are noisy," ECON - Working Papers 306, Department of Economics - University of Zurich, revised Jun 2020.
- Carlos Alos-Ferrer & Ernst Fehr & Nick Netzer, 2018. "Time will tell - Recovering Preferences when Choices are Noisy," Papers 1811.02497, arXiv.org.
- Carlos Alós-Ferrer & Ernst Fehr & Nick Netzer, 2018. "Time Will Tell: Recovering Preferences when Choices Are Noisy," CESifo Working Paper Series 7333, CESifo.
- Alós-Ferrer, Carlos & Fehr, Ernst & Netzer, Nick, 2018. "Time Will Tell: Recovering Preferences When Choices Are Noisy," IZA Discussion Papers 11918, IZA Network @ LISER.
Carlos Alós-Ferrer & Jaume García-Segarra & Alexander Ritschel, 2018. "The Big Robber Game," ECON - Working Papers 291, Department of Economics - University of Zurich.
Alós-Ferrer, Carlos & Ritschel, Alexander, 2021. "Multiple behavioral rules in Cournot oligopolies," Journal of Economic Behavior & Organization, Elsevier, vol. 183(C), pages 250-267.
- Carlos Alós-Ferrer & Alexander Ritschel, 2019. "Multiple behavioral rules in Cournot oligopolies," ECON - Working Papers 331, Department of Economics - University of Zurich, revised Jul 2020.
Ayşegül Engin, 2021. "The cognitive ability and working memory framework: Interpreting cognitive reflection test results in the domain of the cognitive experiential theory," Central European Journal of Operations Research, Springer;Slovak Society for Operations Research;Hungarian Operational Research Society;Czech Society for Operations Research;Österr. Gesellschaft für Operations Research (ÖGOR);Slovenian Society Informatika - Section for Operational Research;Croatian Operational Research Society, vol. 29(1), pages 227-245, March.
Carlos Alós-Ferrer & Michele Garagnani, 2022. "Strength of preference and decisions under risk," Journal of Risk and Uncertainty, Springer, vol. 64(3), pages 309-329, June.
- Carlos Alós-Ferrer & Michele Garagnani, 2019. "Strength of preference and decisions under risk," ECON - Working Papers 330, Department of Economics - University of Zurich, revised Feb 2022.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Alós-Ferrer, Carlos & Ritschel, Alexander, 2021. "Multiple behavioral rules in Cournot oligopolies," Journal of Economic Behavior & Organization, Elsevier, vol. 183(C), pages 250-267.
- Carlos Alós-Ferrer & Alexander Ritschel, 2019. "Multiple behavioral rules in Cournot oligopolies," ECON - Working Papers 331, Department of Economics - University of Zurich, revised Jul 2020.
Anja Achtziger & Carlos Alós-Ferrer & Alexander Ritschel, 2020. "Cognitive load in economic decisions," ECON - Working Papers 354, Department of Economics - University of Zurich.
Jaromír Kovářík & Friederike Mengel & José Gabriel Romero, 2018. "Learning in network games," Quantitative Economics, Econometric Society, vol. 9(1), pages 85-139, March.
- Kovarik, Jaromir & Mengel, Friederike & Romero, José Gabriel, 2012. "Learning in Network Games," IKERLANAK http://www-fae1-eao1-ehu-, Universidad del País Vasco - Departamento de Fundamentos del Análisis Económico I.
Andreas Nicklisch, 2011. "Learning strategic environments: an experimental study of strategy formation and transfer," Theory and Decision, Springer, vol. 71(4), pages 539-558, October.
Cominetti, Roberto & Melo, Emerson & Sorin, Sylvain, 2010. "A payoff-based learning procedure and its application to traffic games," Games and Economic Behavior, Elsevier, vol. 70(1), pages 71-83, September.
Mohlin, Erik & Östling, Robert & Wang, Joseph Tao-yi, 2020. "Learning by similarity-weighted imitation in winner-takes-all games," Games and Economic Behavior, Elsevier, vol. 120(C), pages 225-245.
Beggs, A.W., 2005. "On the convergence of reinforcement learning," Journal of Economic Theory, Elsevier, vol. 122(1), pages 1-36, May.
- Alan Beggs, 2002. "On the Convergence of Reinforcement Learning," Economics Series Working Papers 96, University of Oxford, Department of Economics.
Ge Jiang & Simon Weidenholzer, 2017. "Local interactions under switching costs," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 64(3), pages 571-588, October.
Brown, Alexander L. & Viriyavipart, Ajalavat & Wang, Xiaoyuan, 2018. "Search deterrence in experimental consumer goods markets," European Economic Review, Elsevier, vol. 104(C), pages 167-184.
Wolf Ze'ev Ehrblatt & Kyle Hyndman & Erkut Y. ÄOzbay & Andrew Schotter, 2006. "Convergence: An Experimental Study," Levine's Working Paper Archive 122247000000001148, David K. Levine.
Jehiel, Philippe & Singh, Juni, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Games and Economic Behavior, Elsevier, vol. 130(C), pages 1-24.
- Philippe Jehiel & Juni Singh, 2019. "Multi-state choices with aggregate feedback on unfamiliar alternatives," PSE Working Papers halshs-02183444, HAL.
- Philippe Jehiel & Juni Singh, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Post-Print halshs-03672197, HAL.
- Philippe Jehiel & Juni Singh, 2021. "Multi-state choices with aggregate feedback on unfamiliar alternatives," PSE-Ecole d'économie de Paris (Postprint) halshs-03672197, HAL.
- Philippe Jehiel & Juni Singh, 2019. "Multi-state choices with aggregate feedback on unfamiliar alternatives," Working Papers halshs-02183444, HAL.
Barrafrem, Kinga & Hausfeld, Jan, 2020. "Tracing risky decisions for oneself and others: The role of intuition and deliberation," Journal of Economic Psychology, Elsevier, vol. 77(C).
Ed Hopkins, 2002. "Two Competing Models of How People Learn in Games," Econometrica, Econometric Society, vol. 70(6), pages 2141-2166, November.
- Ed Hopkins, 1999. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 42, Edinburgh School of Economics, University of Edinburgh, revised Dec 2000.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," NajEcon Working Paper Reviews 625018000000000226, www.najecon.org.
- Ed Hopkins, 2000. "Two Competing Models of How People Learn in Games," Edinburgh School of Economics Discussion Paper Series 51, Edinburgh School of Economics, University of Edinburgh, revised Dec 2000.
- Ed Hopkins, 2001. "Two Competing Models of How People Learn in Games," Levine's Working Paper Archive 625018000000000226, David K. Levine.
Mengel, Friederike, 2012. "Learning across games," Games and Economic Behavior, Elsevier, vol. 74(2), pages 601-619.
- Friederike Mengel, 2007. "Learning Across Games," Working Papers. Serie AD 2007-05, Instituto Valenciano de Investigaciones Económicas, S.A. (Ivie).
Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
Oyarzun, Carlos & Sarin, Rajiv, 2013. "Learning and risk aversion," Journal of Economic Theory, Elsevier, vol. 148(1), pages 196-225.
- Carlos Oyarzun & Rajiv Sarin, 2005. "Learning and Risk Aversion," Levine's Bibliography 784828000000000482, UCLA Department of Economics.
- Carlos Oyarzun & Rajiv Sarin, 2012. "Learning and Risk Aversion," Levine's Working Paper Archive 786969000000000572, David K. Levine.
Simon Weidenholzer, 2010. "Coordination Games and Local Interactions: A Survey of the Game Theoretic Literature," Games, MDPI, vol. 1(4), pages 1-35, November.
Apesteguia, Jose & Huck, Steffen & Oechssler, Jorg, 2007. "Imitation--theory and experimental evidence," Journal of Economic Theory, Elsevier, vol. 136(1), pages 217-235, September.
- Jose Apesteguia & Steffen Huck & Jorg Oechssler, 2003. "Imitation - Theory and Experimental Evidence," Experimental 0309001, University Library of Munich, Germany.
- José Apesteguía & Steffen Huck & Jorg Oechssler, 2003. "Imitation-Theory and Experimental Evidence-," Documentos de Trabajo - Lan Gaiak Departamento de Economía - Universidad Pública de Navarra 0306, Departamento de Economía - Universidad Pública de Navarra.
- Apesteguia, José & Huck, Steffen & Oechssler, Jörg, 2003. "Imitation - Theory and Experimental Evidence," Bonn Econ Discussion Papers 20/2003, University of Bonn, Bonn Graduate School of Economics (BGSE).
- Apesteguia, Jose & Huck, Steffen & Oechssler, Joerg, 2003. "Imitation - Theory and Experimental Evidence," University of California at Santa Barbara, Economics Working Paper Series qt3h0887tj, Department of Economics, UC Santa Barbara.
- Apestgeguia, Jose & Huck, Steffen & Oechssler, Jörg, 2005. "Imitation - Theory and Experimental Evidence," Discussion Paper Series of SFB/TR 15 Governance and the Efficiency of Economic Systems 54, Free University of Berlin, Humboldt University of Berlin, University of Bonn, University of Mannheim, University of Munich.
- Jose Alpesteguia & Steffen Huck & Jörg Oechssler, 2003. "Imitation - Theory and Experimental Evidence," CESifo Working Paper Series 1049, CESifo.
- Jose Apesteguia & Steffen Huck & Jorg Oechssler, 2004. "Imitation - Theory and Experimental Evidence," Levine's Bibliography 122247000000000132, UCLA Department of Economics.
Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed, 2010. "Testing the TASP: An experimental investigation of learning in games with unstable equilibria," Journal of Economic Theory, Elsevier, vol. 145(6), pages 2309-2331, November.
- Cason, Timothy N. & Friedman, Daniel & Hopkins, Ed H, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Santa Cruz Department of Economics, Working Paper Series qt8kp6c049, Department of Economics, UC Santa Cruz.
- Timothy N. Cason & Daniel Friedman & Ed Hopkins, 2010. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Purdue University Economics Working Papers 1233, Purdue University, Department of Economics.
- Cason, Timothy N. & Friedman, Daniel UC & Hopkins, Ed, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," SIRE Discussion Papers 2009-15, Scottish Institute for Research in Economics (SIRE).
- Timothy N. Cason & Daniel Friedman & Ed Hopkins, 2009. "Testing the TASP: An Experimental Investigation of Learning in Games with Unstable Equilibria," Edinburgh School of Economics Discussion Paper Series 188, Edinburgh School of Economics, University of Edinburgh.
Jakub Bielawski & Thiparat Chotibut & Fryderyk Falniowski & Michal Misiurewicz & Georgios Piliouras, 2022. "Unpredictable dynamics in congestion games: memory loss can prevent chaos," Papers 2201.10992, arXiv.org, revised Jan 2022.

More about this item

Keywords

; ; ; ;

JEL classification:

C72 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory - - - Noncooperative Games
C91 - Mathematical and Quantitative Methods - - Design of Experiments - - - Laboratory, Individual Behavior

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jeborg:v:152:y:2018:i:c:p:224-234. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jebo .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

The reinforcement heuristic in normal form games

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data