IDEAS home Printed from https://ideas.repec.org/p/wpa/wuwpex/0310003.html
   My bibliography  Save this paper

Learning about Learning in Games through Experimental Control of Strategic Interdependence

Author

Listed:
  • Jason Shachat

    (National University of Singapore)

  • J. Todd Swarthout

    (University of Arizona)

Abstract

We conduct experiments in which humans repeatedly play one of two games against a computer decision maker that follows either Roth and Erev's reinforcement learning algorithm or Camerer and Ho's EWA algorithm. The human/algorithm interaction provides results that can't be obtained from the analysis of pure human interactions or model simulations. The learning algorithms are more sensitive than humans in calculating exploitable opponent play. Learning algorithms respond to these calculated opportunities systematically; however, the magnitude of these responses are too weak to improve the algorithm's payoffs. Human play against various decision maker types does not significantly vary. These results demonstrate that humans and currently proposed models of their behavior differ in that humans do not adjust payoff assessments by smooth transition functions and that when humans detect exploitable play they are more likely to choose the best response to this belief.

Suggested Citation

  • Jason Shachat & J. Todd Swarthout, 2003. "Learning about Learning in Games through Experimental Control of Strategic Interdependence," Experimental 0310003, University Library of Munich, Germany.
  • Handle: RePEc:wpa:wuwpex:0310003
    Note: Type of Document - pdf; pages: 39
    as

    Download full text from publisher

    File URL: https://econwpa.ub.uni-muenchen.de/econ-wp/exp/papers/0310/0310003.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Cooper, Russell & DeJong, Douglas V. & Forsythe, Robert & Ross, Thomas W., 1996. "Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games," Games and Economic Behavior, Elsevier, vol. 12(2), pages 187-218, February.
    2. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    3. Colin Camerer & Teck Ho & Kuan Chong, 2003. "Models of Thinking, Learning, and Teaching in Games," American Economic Review, American Economic Association, vol. 93(2), pages 192-195, May.
    4. Jordan J. S., 1993. "Three Problems in Learning Mixed-Strategy Nash Equilibria," Games and Economic Behavior, Elsevier, vol. 5(3), pages 368-386, July.
    5. Fehr, Ernst & Tyran, Jean-Robert, 2007. "Money illusion and coordination failure," Games and Economic Behavior, Elsevier, vol. 58(2), pages 246-268, February.
    6. Mookherjee Dilip & Sopher Barry, 1994. "Learning Behavior in an Experimental Matching Pennies Game," Games and Economic Behavior, Elsevier, vol. 7(1), pages 62-91, July.
    7. Andreoni, James A & Miller, John H, 1993. "Rational Cooperation in the Finitely Repeated Prisoner's Dilemma: Experimental Evidence," Economic Journal, Royal Economic Society, vol. 103(418), pages 570-585, May.
    8. AJ A. Bostian & Charles A. Holt & Angela M. Smith, 2008. "Newsvendor "Pull-to-Center" Effect: Adaptive Learning in a Laboratory Experiment," Manufacturing & Service Operations Management, INFORMS, vol. 10(4), pages 590-608, July.
    9. Fudenberg, Drew & Levine, David K., 1995. "Consistency and cautious fictitious play," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 1065-1089.
    10. Spiliopoulos, Leonidas, 2008. "Humans versus computer algorithms in repeated mixed strategy games," MPRA Paper 6672, University Library of Munich, Germany.
    11. McKelvey Richard D. & Palfrey Thomas R., 1995. "Quantal Response Equilibria for Normal Form Games," Games and Economic Behavior, Elsevier, vol. 10(1), pages 6-38, July.
    12. Mookherjee, Dilip & Sopher, Barry, 1997. "Learning and Decision Costs in Experimental Constant Sum Games," Games and Economic Behavior, Elsevier, vol. 19(1), pages 97-132, April.
    13. Teck H. Ho & Xin Wang & Colin F. Camerer, 2008. "Individual Differences in EWA Learning with Partial Payoff Information," Economic Journal, Royal Economic Society, vol. 118(525), pages 37-59, January.
    14. James J. Choi & David Laibson & Brigitte C. Madrian & Andrew Metrick, 2009. "Reinforcement Learning and Savings Behavior," Journal of Finance, American Finance Association, vol. 64(6), pages 2515-2534, December.
    15. Hommes, Cars, 2011. "The heterogeneous expectations hypothesis: Some evidence from the lab," Journal of Economic Dynamics and Control, Elsevier, vol. 35(1), pages 1-24, January.
    16. Jason Shachat & J. Todd Swarthout & Lijia Wei, 2011. "Man versus Nash An experiment on the self-enforcing nature of mixed strategy equilibrium," Working Papers 1101, Xiamen Unversity, The Wang Yanan Institute for Studies in Economics, Finance and Economics Experimental Laboratory, revised 21 Feb 2011.
    17. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
    18. Gary E. Bolton & Elena Katok, 2008. "Learning by Doing in the Newsvendor Problem: A Laboratory Investigation of the Role of Experience and Feedback," Manufacturing & Service Operations Management, INFORMS, vol. 10(3), pages 519-538, September.
    19. Gjerstad, Steven, 1996. "The Rate of Convergence of Continuous Fictitious Play," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 7(1), pages 161-177, January.
    20. Ho, Teck-Hua & Camerer, Colin & Weigelt, Keith, 1998. "Iterated Dominance and Iterated Best Response in Experimental "p-Beauty Contests."," American Economic Review, American Economic Association, vol. 88(4), pages 947-969, September.
    21. Reinhard Selten & Thorsten Chmura, 2008. "Stationary Concepts for Experimental 2x2-Games," American Economic Review, American Economic Association, vol. 98(3), pages 938-966, June.
    22. Yan Chen & Fang-Fang Tang, 1998. "Learning and Incentive-Compatible Mechanisms for Public Goods Provision: An Experimental Study," Journal of Political Economy, University of Chicago Press, vol. 106(3), pages 633-662, June.
    23. Jason Shachat & J. Todd Swarthout, 2004. "Do we detect and exploit mixed strategy play by opponents?," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 59(3), pages 359-373, July.
    24. Ulrike Malmendier & Stefan Nagel, 2011. "Depression Babies: Do Macroeconomic Experiences Affect Risk Taking?," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 126(1), pages 373-416.
    25. Robert Slonim & Alvin E. Roth, 1998. "Learning in High Stakes Ultimatum Games: An Experiment in the Slovak Republic," Econometrica, Econometric Society, vol. 66(3), pages 569-596, May.
    26. Brit Grosskopf, 2003. "Reinforcement and Directional Learning in the Ultimatum Game with Responder Competition," Experimental Economics, Springer;Economic Science Association, vol. 6(2), pages 141-158, October.
    27. Walker, James M. & Smith, Vernon L. & Cox, James C., 1987. "Bidding behavior in first price sealed bid auctions : Use of computerized Nash competitors," Economics Letters, Elsevier, vol. 23(3), pages 239-244.
    28. Robert W. Rosenthal & Jason Shachat & Mark Walker, 2003. "Hide and seek in Arizona," International Journal of Game Theory, Springer;Game Theory Society, vol. 32(2), pages 273-293, December.
    29. Morgan, John & Sefton, Martin, 2002. "An Experimental Investigation of Unprofitable Games," Games and Economic Behavior, Elsevier, vol. 40(1), pages 123-146, July.
    30. Eckel, Catherine C. & Grossman, Philip J., 1996. "Altruism in Anonymous Dictator Games," Games and Economic Behavior, Elsevier, vol. 16(2), pages 181-191, October.
    31. McCabe, Kevin & Houser, Daniel & Ryan, Lee & Smith, Vernon & Trouard, Ted, 2001. "A Functional Imaging Study of Cooperation in Two-Person reciprocal Exchange," MPRA Paper 5172, University Library of Munich, Germany.
    32. James J. Choi & David Laibson & Brigitte C. Madrian & Andrew Metrick, 2009. "Reinforcement Learning and Savings Behavior," Journal of Finance, American Finance Association, vol. 64(6), pages 2515-2534, December.
    33. Timothy C. Salmon, 2001. "An Evaluation of Econometric Models of Adaptive Learning," Econometrica, Econometric Society, vol. 69(6), pages 1597-1628, November.
    34. Heemeijer, Peter & Hommes, Cars & Sonnemans, Joep & Tuinstra, Jan, 2009. "Price stability and volatility in markets with positive and negative expectations feedback: An experimental investigation," Journal of Economic Dynamics and Control, Elsevier, vol. 33(5), pages 1052-1072, May.
    35. Yaw Nyarko & Andrew Schotter, 2002. "An Experimental Study of Belief Learning Using Elicited Beliefs," Econometrica, Econometric Society, vol. 70(3), pages 971-1005, May.
    36. Andreoni James & Miller John H., 1995. "Auctions with Artificial Adaptive Agents," Games and Economic Behavior, Elsevier, vol. 10(1), pages 39-64, July.
    37. Daniel Houser & Robert Kurzban, 2002. "Revisiting Kindness and Confusion in Public Goods Experiments," American Economic Review, American Economic Association, vol. 92(4), pages 1062-1069, September.
    38. Sonsino, Doron & Sirota, Julia, 2003. "Strategic pattern recognition--experimental evidence," Games and Economic Behavior, Elsevier, vol. 44(2), pages 390-411, August.
    39. Duffy, John, 2001. "Learning to speculate: Experiments with artificial and real agents," Journal of Economic Dynamics and Control, Elsevier, vol. 25(3-4), pages 295-319, March.
    40. Dürsch, Peter & Kolb, Albert & Oechssler, Jörg & Schipper, Burkhard, 2005. "Rage against the machines : how subjects learn to play against computers," Papers 05-36, Sonderforschungsbreich 504.
    41. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    42. Cheung, Yin-Wong & Friedman, Daniel, 1997. "Individual Learning in Normal Form Games: Some Laboratory Results," Games and Economic Behavior, Elsevier, vol. 19(1), pages 46-76, April.
    43. Pedro Dal Bo & Guillaume R. Frochette, 2011. "The Evolution of Cooperation in Infinitely Repeated Games: Experimental Evidence," American Economic Review, American Economic Association, vol. 101(1), pages 411-429, February.
    44. Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
    45. Markose, Sheri & Arifovic, Jasmina & Sunder, Shyam, 2007. "Advances in experimental and agent-based modelling: Asset markets, economic networks, computational mechanism design and evolutionary game dynamics," Journal of Economic Dynamics and Control, Elsevier, vol. 31(6), pages 1801-1807, June.
    46. Eyal Winter & Shmuel Zamir, 2005. "An Experiment With Ultimatum Bargaining In A Changing Environment," The Japanese Economic Review, Japanese Economic Association, vol. 56(3), pages 363-385, September.
    47. Roth, Alvin E & Schoumaker, Francoise, 1983. "Expectations and Reputations in Bargaining: An Experimental Study," American Economic Review, American Economic Association, vol. 73(3), pages 362-372, June.
    48. Juanjuan Zhang, 2010. "The Sound of Silence: Observational Learning in the U.S. Kidney Market," Marketing Science, INFORMS, vol. 29(2), pages 315-335, 03-04.
    49. Bruno Contini & Roberto Leombruni & Matteo Richiardi, 2006. "Exploring a New ExpAce: The Complementarities between Experimental Economics and Agent-based Computational Economics," LABORatorio R. Revelli Working Papers Series 45, LABORatorio R. Revelli, Centre for Employment Studies.
    50. Shachat, Jason M., 2002. "Mixed Strategy Play and the Minimax Hypothesis," Journal of Economic Theory, Elsevier, vol. 104(1), pages 189-226, May.
    51. Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
    52. Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
    53. TeckH. Ho & Xin Wang & ColinF. Camerer, 2008. "Individual Differences in EWA Learning with Partial Payoff Information," Economic Journal, Royal Economic Society, vol. 118(525), pages 37-59, January.
    54. Arijit Mukherji & David E. Runkle, 2000. "Learning to be unpredictable : an experimental study," Quarterly Review, Federal Reserve Bank of Minneapolis, vol. 24(Spr), pages 14-20.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Dürsch, Peter & Kolb, Albert & Oechssler, Jörg & Schipper, Burkhard C., 2005. "Rage Against the Machines: How Subjects Learn to Play Against Computers," Bonn Econ Discussion Papers 31/2005, University of Bonn, Bonn Graduate School of Economics (BGSE).
    2. Sean Duffy & J. J. Naddeo & David Owens & John Smith, 2024. "Cognitive Load and Mixed Strategies: On Brains and Minimax," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 26(03), pages 1-34, September.
    3. Dürsch, Peter & Kolb, Albert & Oechssler, Jörg & Schipper, Burkhard, 2005. "Rage against the machines : how subjects learn to play against computers," Papers 05-36, Sonderforschungsbreich 504.
    4. repec:wyi:journl:002151 is not listed on IDEAS
    5. Spiliopoulos, Leonidas, 2008. "Humans versus computer algorithms in repeated mixed strategy games," MPRA Paper 6672, University Library of Munich, Germany.
    6. Feng, Jun & Qin, Xiangdong & Wang, Xiaoyuan, 2021. "A Bayesian cognitive hierarchy model with fixed reasoning levels," Journal of Economic Behavior & Organization, Elsevier, vol. 192(C), pages 704-723.
    7. March, Christoph, 2021. "Strategic interactions between humans and artificial intelligence: Lessons from experiments with computer players," Journal of Economic Psychology, Elsevier, vol. 87(C).
    8. Frederic Moisan & Cleotilde Gonzalez, 2017. "Security under Uncertainty : Adaptive Attackers Are More Challenging to Human Defenders than Random Attackers," Post-Print hal-03188217, HAL.
    9. Jason Shachat & J. Todd Swarthout & Lijia Wei, 2011. "Man versus Nash An experiment on the self-enforcing nature of mixed strategy equilibrium," Working Papers 1101, Xiamen Unversity, The Wang Yanan Institute for Studies in Economics, Finance and Economics Experimental Laboratory, revised 21 Feb 2011.
    10. Peter Duersch & Albert Kolb & Jörg Oechssler & Burkhard Schipper, 2010. "Rage against the machines: how subjects play against learning algorithms," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 43(3), pages 407-430, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. repec:wyi:journl:002151 is not listed on IDEAS
    2. Jason Shachat & J. Todd Swarthout & Lijia Wei, 2011. "Man versus Nash An experiment on the self-enforcing nature of mixed strategy equilibrium," Working Papers 1101, Xiamen Unversity, The Wang Yanan Institute for Studies in Economics, Finance and Economics Experimental Laboratory, revised 21 Feb 2011.
    3. Camerer, Colin F. & Ho, Teck-Hua, 2015. "Behavioral Game Theory Experiments and Modeling," Handbook of Game Theory with Economic Applications,, Elsevier.
    4. Erhao Xie, 2019. "Monetary Payoff and Utility Function in Adaptive Learning Models," Staff Working Papers 19-50, Bank of Canada.
    5. Xie, Erhao, 2021. "Empirical properties and identification of adaptive learning models in behavioral game theory," Journal of Economic Behavior & Organization, Elsevier, vol. 191(C), pages 798-821.
    6. Asim Ansari & Ricardo Montoya & Oded Netzer, 2012. "Dynamic learning in behavioral games: A hidden Markov mixture of experts approach," Quantitative Marketing and Economics (QME), Springer, vol. 10(4), pages 475-503, December.
    7. Wu, Hang & Bayer, Ralph-C, 2015. "Learning from inferred foregone payoffs," Journal of Economic Dynamics and Control, Elsevier, vol. 51(C), pages 445-458.
    8. Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
    9. March, Christoph, 2021. "Strategic interactions between humans and artificial intelligence: Lessons from experiments with computer players," Journal of Economic Psychology, Elsevier, vol. 87(C).
    10. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
    11. Chen, Shu-Heng, 2012. "Varieties of agents in agent-based computational economics: A historical and an interdisciplinary perspective," Journal of Economic Dynamics and Control, Elsevier, vol. 36(1), pages 1-25.
    12. Wen, Yuanji, 2018. "Voluntary information acquisition in an asymmetric-Information game:comparing learning theories in the laboratory," Journal of Economic Behavior & Organization, Elsevier, vol. 150(C), pages 202-219.
    13. Teck H. Ho & Xin Wang & Colin F. Camerer, 2008. "Individual Differences in EWA Learning with Partial Payoff Information," Economic Journal, Royal Economic Society, vol. 118(525), pages 37-59, January.
    14. Teck H Ho & Colin Camerer & Juin-Kuan Chong, 2003. "Functional EWA: A one-parameter theory of learning in games," Levine's Working Paper Archive 506439000000000514, David K. Levine.
    15. Terracol, Antoine & Vaksmann, Jonathan, 2009. "Dumbing down rational players: Learning and teaching in an experimental game," Journal of Economic Behavior & Organization, Elsevier, vol. 70(1-2), pages 54-71, May.
    16. Osili, Una Okonkwo & Paulson, Anna, 2014. "Crises and confidence: Systemic banking crises and depositor behavior," Journal of Financial Economics, Elsevier, vol. 111(3), pages 646-660.
    17. Spiliopoulos, Leonidas, 2012. "Pattern recognition and subjective belief learning in a repeated constant-sum game," Games and Economic Behavior, Elsevier, vol. 75(2), pages 921-935.
    18. Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
    19. Ioannou, Christos A. & Romero, Julian, 2014. "A generalized approach to belief learning in repeated games," Games and Economic Behavior, Elsevier, vol. 87(C), pages 178-203.
    20. Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
    21. Chmura, Thorsten & Goerg, Sebastian J. & Selten, Reinhard, 2012. "Learning in experimental 2×2 games," Games and Economic Behavior, Elsevier, vol. 76(1), pages 44-73.

    More about this item

    JEL classification:

    • C7 - Mathematical and Quantitative Methods - - Game Theory and Bargaining Theory

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wpa:wuwpex:0310003. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: EconWPA (email available below). General contact details of provider: https://econwpa.ub.uni-muenchen.de .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.