IDEAS home Printed from https://ideas.repec.org/
MyIDEAS: Log in (now much improved!) to save this paper

Learning about Learning in Games through Experimental Control of Strategic Interdependence

Listed author(s):
  • Jason Shachat
  • J. Todd Swarthout

We conduct experiments in which humans repeatedly play one of two games against a computer decision maker that follows either a reinforcement learning or an Experience Weighted Attraction algorithm. Our experiments show these learning algorithms more sensitively detect exploitable opportunities than humans. Also, learning algorithms respond to detected payoff increasing opportunities systematically; however, the responses are too weak to improve the algorithms payoffs. Human play against various decision maker types doesn't significantly vary. These factors lead to a strong linear relationship between the humans and algorithms action choice proportions that is suggestive of the algorithm's best response correspondence.

If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://excen.gsu.edu/workingpapers/GSU_EXCEN_WP_2006-17.pdf
File Function: First version, 2006
Download Restriction: no

File URL: http://excen.gsu.edu/workingpapers/GSU_EXCEN_WP_2008-06.pdf
File Function: Revised version, 2008
Download Restriction: no

Paper provided by Experimental Economics Center, Andrew Young School of Policy Studies, Georgia State University in its series Experimental Economics Center Working Paper Series with number 2006-17.

as
in new window

Length: 1
Date of creation: Jun 2002
Date of revision: Aug 2008
Handle: RePEc:exc:wpaper:2006-17
Contact details of provider: Postal:
(404) 651-3990

Phone: (404) 651-3990
Fax: (404) 651-3996
Web page: http://excen.gsu.edu/

More information through EDIRC

References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:

as
in new window

  1. Steven Gjerstad, 1995. "The rate of convergence of continuous fictitious play," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 7(1), pages 161-178.
  2. McKelvey, Richard D. & Palfrey, Thomas R., 1994. "Quantal Response Equilibria For Normal Form Games," Working Papers 883, California Institute of Technology, Division of the Humanities and Social Sciences.
  3. McCabe, Kevin & Houser, Daniel & Ryan, Lee & Smith, Vernon & Trouard, Ted, 2001. "A Functional Imaging Study of Cooperation in Two-Person reciprocal Exchange," MPRA Paper 5172, University Library of Munich, Germany.
  4. AJ A. Bostian & Charles A. Holt & Angela M. Smith, 2008. "Newsvendor "Pull-to-Center" Effect: Adaptive Learning in a Laboratory Experiment," Manufacturing & Service Operations Management, INFORMS, vol. 10(4), pages 590-608, July.
  5. Peter Dürsch & Albert Kolb & Jörg Oechssler & Burkhard C. Schipper, 2005. "Rage Against the Machines: How Subjects Learn to Play Against Computers," Working Papers 0423, University of Heidelberg, Department of Economics, revised Oct 2005.
  6. Andreoni, J. & Miller, J.H., 1991. "Rational Cooperative in the Finitely Repeated Prisoner's Dilemma: Experimental Evidence," Working papers 9102, Wisconsin Madison - Social Systems.
  7. Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011 Elsevier.
  8. Timothy C. Salmon, 2001. "An Evaluation of Econometric Models of Adaptive Learning," Econometrica, Econometric Society, vol. 69(6), pages 1597-1628, November.
  9. Brit Grosskopf, 2003. "Reinforcement and Directional Learning in the Ultimatum Game with Responder Competition," Experimental Economics, Springer;Economic Science Association, vol. 6(2), pages 141-158, October.
  10. Bruno Contini & Roberto Leombruni & Matteo Richiardi, 2006. "Exploring a New ExpAce: The Complementarities between Experimental Economics and Agent-based Computational Economics," LABORatorio R. Revelli Working Papers Series 45, LABORatorio R. Revelli, Centre for Employment Studies.
  11. Yaw Nyarko & Andrew Schotter, 2002. "An Experimental Study of Belief Learning Using Elicited Beliefs," Econometrica, Econometric Society, vol. 70(3), pages 971-1005, May.
  12. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
  13. James Choi & David Laibson & Brigitte Madrian & Andrew Metrick, 2007. "Reinforcement Learning and Savings Behavior," Yale School of Management Working Papers amz2657, Yale School of Management, revised 01 Mar 2009.
  14. Walker, James M. & Smith, Vernon L. & Cox, James C., 1987. "Bidding behavior in first price sealed bid auctions : Use of computerized Nash competitors," Economics Letters, Elsevier, vol. 23(3), pages 239-244.
  15. Russell Cooper & Douglas V. DeJong & Thomas W. Ross, 1992. "Cooperation without Reputation: Experimental Evidence from Prisoner's Dilemma Games," Papers 0036, Boston University - Industry Studies Programme.
  16. Jason Shachat & J. Todd Swarthouty & Lijia Wei, 2013. "Man Versus Nash: An Experiment on the Self-enforcing Nature of Mixed Strategy Equilibrium," WISE Working Papers 2013-10-14, Wang Yanan Institute for Studies in Economics (WISE), Xiamen University.
  17. TeckH. Ho & Xin Wang & ColinF. Camerer, 2008. "Individual Differences in EWA Learning with Partial Payoff Information," Economic Journal, Royal Economic Society, vol. 118(525), pages 37-59, 01.
  18. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
  19. Fudenberg, Drew & Levine, David K., 1995. "Consistency and cautious fictitious play," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 1065-1089.
  20. Roth, Alvin E & Schoumaker, Francoise, 1983. "Expectations and Reputations in Bargaining: An Experimental Study," American Economic Review, American Economic Association, vol. 73(3), pages 362-372, June.
  21. Reinhard Selten & Thorsten Chmura, 2005. "Stationary Concepts for Experimental 2x2 Games," Bonn Econ Discussion Papers bgse33_2005, University of Bonn, Germany.
  22. Spiliopoulos, Leonidas, 2008. "Humans versus computer algorithms in repeated mixed strategy games," MPRA Paper 6672, University Library of Munich, Germany.
  23. Jan Tuinstra & Joep Sonnemans & Cars Hommes & Peter Heemeijer, 2006. "Price Stability and Volatility in Markets with Positive and Negative Expectations Feedback: An Experimental Investigation," Working Papers wp06-18, Warwick Business School, Finance Group.
  24. Duffy, John, 2001. "Learning to speculate: Experiments with artificial and real agents," Journal of Economic Dynamics and Control, Elsevier, vol. 25(3-4), pages 295-319, March.
  25. Fehr, Ernst & Tyran, Jean-Robert, 2004. "Money Illusion and Coordination Failure," IZA Discussion Papers 1013, Institute for the Study of Labor (IZA).
  26. Andreoni James & Miller John H., 1995. "Auctions with Artificial Adaptive Agents," Games and Economic Behavior, Elsevier, vol. 10(1), pages 39-64, July.
  27. Gary E. Bolton & Elena Katok, 2008. "Learning by Doing in the Newsvendor Problem: A Laboratory Investigation of the Role of Experience and Feedback," Manufacturing & Service Operations Management, INFORMS, vol. 10(3), pages 519-538, September.
  28. Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
  29. Yan Chen & Fang-Fang Tang, 1998. "Learning and Incentive-Compatible Mechanisms for Public Goods Provision: An Experimental Study," Journal of Political Economy, University of Chicago Press, vol. 106(3), pages 633-662, June.
  30. Pedro Dal Bo & Guillaume R. Frechette, 2007. "The Evolution of Cooperation in Infinitely Repeated Games: Experimental Evidence," Working Papers 2007-7, Brown University, Department of Economics.
  31. Robert Slonim & Alvin E. Roth, 1998. "Learning in High Stakes Ultimatum Games: An Experiment in the Slovak Republic," Econometrica, Econometric Society, vol. 66(3), pages 569-596, May.
  32. Shachat, Jason M., 2002. "Mixed Strategy Play and the Minimax Hypothesis," Journal of Economic Theory, Elsevier, vol. 104(1), pages 189-226, May.
  33. Ulrike Malmendier & Stefan Nagel, 2009. "Depression Babies: Do Macroeconomic Experiences Affect Risk-Taking?," NBER Working Papers 14813, National Bureau of Economic Research, Inc.
  34. Ho, Teck-Hua & Camerer, Colin & Weigelt, Keith, 1998. "Iterated Dominance and Iterated Best Response in Experimental "p-Beauty Contests."," American Economic Review, American Economic Association, vol. 88(4), pages 947-969, September.
  35. Markose, Sheri & Arifovic, Jasmina & Sunder, Shyam, 2007. "Advances in experimental and agent-based modelling: Asset markets, economic networks, computational mechanism design and evolutionary game dynamics," Journal of Economic Dynamics and Control, Elsevier, vol. 31(6), pages 1801-1807, June.
  36. Cars Hommes, 2010. "The heterogeneous expectations hypothesis: some evidence from the lab," Post-Print hal-00753041, HAL.
  37. Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
  38. Eckel, Catherine C. & Grossman, Philip J., 1996. "Altruism in Anonymous Dictator Games," Games and Economic Behavior, Elsevier, vol. 16(2), pages 181-191, October.
  39. Mookherjee Dilip & Sopher Barry, 1994. "Learning Behavior in an Experimental Matching Pennies Game," Games and Economic Behavior, Elsevier, vol. 7(1), pages 62-91, July.
  40. Cheung, Yin-Wong & Friedman, Daniel, 1997. "Individual Learning in Normal Form Games: Some Laboratory Results," Games and Economic Behavior, Elsevier, vol. 19(1), pages 46-76, April.
  41. Mookherjee, Dilip & Sopher, Barry, 1997. "Learning and Decision Costs in Experimental Constant Sum Games," Games and Economic Behavior, Elsevier, vol. 19(1), pages 97-132, April.
  42. Jason Shachat & J. Todd Swarthout, 2003. "Do We Detect and Exploit Mixed Strategy Play by Opponents?," Experimental 0310001, EconWPA.
  43. Robert W. Rosenthal & Jason Shachat & Mark Walker, 2003. "Hide and seek in Arizona," International Journal of Game Theory, Springer;Game Theory Society, vol. 32(2), pages 273-293, December.
  44. Morgan, John & Sefton, Martin, 2002. "An Experimental Investigation of Unprofitable Games," Games and Economic Behavior, Elsevier, vol. 40(1), pages 123-146, July.
  45. Jordan J. S., 1993. "Three Problems in Learning Mixed-Strategy Nash Equilibria," Games and Economic Behavior, Elsevier, vol. 5(3), pages 368-386, July.
  46. Jason Shachat & J. Todd Swarthout, 2004. "Do we detect and exploit mixed strategy play by opponents?," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 59(3), pages 359-373, 07.
  47. Daniel Houser & Robert Kurzban, 2002. "Revisiting Kindness and Confusion in Public Goods Experiments," American Economic Review, American Economic Association, vol. 92(4), pages 1062-1069, September.
  48. Robert W. Rosenthal & Jason Shachat & Mark Walker, 2003. "Hide and Seek in Arizona," Experimental 0312001, EconWPA.
  49. Camerer, Colin F. & Ho, Teck-Hua & Chong, Juin-Kuan, 2002. "Sophisticated Experience-Weighted Attraction Learning and Strategic Teaching in Repeated Games," Journal of Economic Theory, Elsevier, vol. 104(1), pages 137-188, May.
  50. Eyal Winter & Shmuel Zamir, 2005. "An Experiment With Ultimatum Bargaining In A Changing Environment," The Japanese Economic Review, Japanese Economic Association, vol. 56(3), pages 363-385.
  51. Colin Camerer & Teck Ho & Kuan Chong, 2003. "Models of Thinking, Learning, and Teaching in Games," American Economic Review, American Economic Association, vol. 93(2), pages 192-195, May.
  52. Arijit Mukherji & David E. Runkle, 2000. "Learning to be unpredictable : an experimental study," Quarterly Review, Federal Reserve Bank of Minneapolis, issue Spr, pages 14-20.
  53. repec:spr:compst:v:59:y:2004:i:3:p:359-373 is not listed on IDEAS
  54. Sonsino, Doron & Sirota, Julia, 2003. "Strategic pattern recognition--experimental evidence," Games and Economic Behavior, Elsevier, vol. 44(2), pages 390-411, August.
  55. Juanjuan Zhang, 2010. "The Sound of Silence: Observational Learning in the U.S. Kidney Market," Marketing Science, INFORMS, vol. 29(2), pages 315-335, 03-04.
Full references (including those not matched with items on IDEAS)

This item is not listed on Wikipedia, on a reading list or among the top items on IDEAS.

When requesting a correction, please mention this item's handle: RePEc:exc:wpaper:2006-17. See general information about how to correct material in RePEc.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (J. Todd Swarthout)

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If references are entirely missing, you can add them using this form.

If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

This information is provided to you by IDEAS at the Research Division of the Federal Reserve Bank of St. Louis using RePEc data.