IDEAS home Printed from https://ideas.repec.org/
MyIDEAS: Login to save this paper or follow this series

Nonparametric learning rules from bandit experiments: the eyes have it!

  • Yingyao Hu

    (Institute for Fiscal Studies and Johns Hopkins University)

  • Yutaka Kayaba

    (Institute for Fiscal Studies)

  • Matt Shum

    (Institute for Fiscal Studies)

We estimate nonparametric learning rules using data from dynamic two-armed bandit (probabilistic reversal learning) experiments, supplemented with auxiliary eye-movement measures of subjects' beliefs. We apply recent econometric developments in the estimation of dynamic models. The direct estimation of learning rules differs from the usual modus operandi of the experimental literature. The estimated choice probabilities and learning rules from our nonparametric models have some distinctive features; notably that subjects tend to update in a non-smooth manner following positive 'exploitative' choices (those made in accordance with current beliefs). Simulation results show how the estimated nonparametric learning rules fit aspects of subjects' observed choice sequences better than alternative parameterized learning rules from Bayesian and reinforcement learning models.

If you experience problems downloading a file, check if you have the proper application to view it first. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://cemmap.ifs.org.uk/wps/cwp1510.pdf
Download Restriction: no

Paper provided by Centre for Microdata Methods and Practice, Institute for Fiscal Studies in its series CeMMAP working papers with number CWP15/10.

as
in new window

Length:
Date of creation: Jun 2010
Date of revision:
Handle: RePEc:ifs:cemmap:15/10
Contact details of provider: Postal: The Institute for Fiscal Studies 7 Ridgmount Street LONDON WC1E 7AE
Phone: (+44) 020 7291 4800
Fax: (+44) 020 7323 4780
Web page: http://cemmap.ifs.org.uk
Email:


More information through EDIRC

Order Information: Postal: The Institute for Fiscal Studies 7 Ridgmount Street LONDON WC1E 7AE
Email:


References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:

as in new window
  1. Weitzman, Martin L, 1979. "Optimal Search for the Best Alternative," Econometrica, Econometric Society, vol. 47(3), pages 641-54, May.
  2. Hu, Yingyao & Shum, Matthew, 2012. "Nonparametric identification of dynamic models with unobserved state variables," Journal of Econometrics, Elsevier, vol. 171(1), pages 32-44.
  3. Susumu Imai & Neelam Jain & Andrew Ching, 2009. "Bayesian Estimation of Dynamic Discrete Choice Models," Econometrica, Econometric Society, vol. 77(6), pages 1865-1899, November.
  4. Broseta, Bruno & Costa-Gomes, Miguel & Crawford, Vincent P., 2000. "Cognition and Behavior in Normal-Form Games: An Experimental Study," University of California at San Diego, Economics Working Paper Series qt0fp8278k, Department of Economics, UC San Diego.
  5. Miguel A. Costa-Gomes & Vincent P. Crawford, 2004. "Cognition and Behavior in Two-Person Guessing Games: An Experimental Study," Levine's Bibliography 122247000000000113, UCLA Department of Economics.
  6. Tat Y. Chan & Barton H. Hamilton, 2006. "Learning, Private Information, and the Economic Evaluation of Randomized Experiments," Journal of Political Economy, University of Chicago Press, vol. 114(6), pages 997-1040, December.
  7. Vincent P. Crawford & Nagore Iriberri, 2005. "Level-k Auctions: Can a Non-Equilibrium Model of Strategic Thinking Explain the Winner's Curse and Overbidding in Private-Value Auctions?," Levine's Bibliography 784828000000000604, UCLA Department of Economics.
  8. Charness, Gary & Levin, Dan, 2003. "When Optimal Choices Feel Wrong: A Laboratory Study of Bayesian Updating, Complexity, and Affect," University of California at Santa Barbara, Economics Working Paper Series qt7g63k28w, Department of Economics, UC Santa Barbara.
  9. Xavier Gabaix & David Laibson & Guillermo Moloche & Stephen Weinberg, 2006. "Costly Information Acquisition: Experimental Analysis of a Boundedly Rational Model," American Economic Review, American Economic Association, vol. 96(4), pages 1043-1068, September.
  10. Avi Goldfarb & Mo Xiao, 2011. "Who Thinks about the Competition? Managerial Ability and Strategic Entry in US Local Telephone Markets," American Economic Review, American Economic Association, vol. 101(7), pages 3130-61, December.
  11. Andrew Caplin & Mark Dean & Paul W. Glimcher & Robb B. Rutledge, 2010. "Measuring Beliefs and Rewards: A Neuroeconomic Approach," The Quarterly Journal of Economics, MIT Press, vol. 125(3), pages 923-960, August.
  12. Tülin Erdem & Michael P. Keane, 1996. "Decision-Making Under Uncertainty: Capturing Dynamic Brand Choice Processes in Turbulent Consumer Goods Markets," Marketing Science, INFORMS, vol. 15(1), pages 1-20.
  13. Joseph Tao-yi Wang & Michael Spezio & Colin F. Camerer, 2010. "Pinocchio's Pupil: Using Eyetracking and Pupil Dilation to Understand Truth Telling and Deception in Sender-Receiver Games," American Economic Review, American Economic Association, vol. 100(3), pages 984-1007, June.
  14. Jovanovic, Boyan, 1979. "Job Matching and the Theory of Turnover," Journal of Political Economy, University of Chicago Press, vol. 87(5), pages 972-90, October.
  15. James Choi & David Laibson & Brigitte Madrian & Andrew Metrick, 2007. "Reinforcement Learning and Savings Behavior," Yale School of Management Working Papers amz2657, Yale School of Management, revised 01 Mar 2009.
  16. Daniel A. Ackerberg, 2003. "Advertising, learning, and consumer choice in experience good markets: an empirical examination," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 44(3), pages 1007-1040, 08.
  17. Gregory S. Crawford & Matthew Shum, 2005. "Uncertainty and Learning in Pharmaceutical Demand," Econometrica, Econometric Society, vol. 73(4), pages 1137-1173, 07.
  18. K. Carrie Armel & Aurelie Beaumel & Antonio Rangel, 2008. "Biasing simple choices by manipulating relative visual attention," Judgment and Decision Making, Society for Judgment and Decision Making, vol. 3, pages 396-403, June.
  19. Bergemann, Dirk & Hege, Ulrich, 1997. "Venture Capital Financing, Moral Hazard and Learning," CEPR Discussion Papers 1738, C.E.P.R. Discussion Papers.
  20. Christopher Anderson, 2012. "Ambiguity aversion in multi-armed bandit problems," Theory and Decision, Springer, vol. 72(1), pages 15-33, January.
  21. Noah Gans & George Knox & Rachel Croson, 2007. "Simple Models of Discrete Choice and Their Performance in Bandit Experiments," Manufacturing & Service Operations Management, INFORMS, vol. 9(4), pages 383-408, December.
  22. Robert J. Meyer & Yong Shi, 1995. "Sequential Choice Under Ambiguity: Intuitive Solutions to the Armed-Bandit Problem," Management Science, INFORMS, vol. 41(5), pages 817-834, May.
  23. Hu, Yingyao, 2008. "Identification and estimation of nonlinear models with misclassification error using instrumental variables: A general solution," Journal of Econometrics, Elsevier, vol. 144(1), pages 27-61, May.
  24. Miller, Robert A, 1984. "Job Matching and Occupational Choice," Journal of Political Economy, University of Chicago Press, vol. 92(6), pages 1086-120, December.
  25. Daniel T. Knoepfle & Joseph Tao-yi Wang & Colin F. Camerer, 2009. "Studying Learning in Games Using Eye-Tracking," Journal of the European Economic Association, MIT Press, vol. 7(2-3), pages 388-398, 04-05.
  26. Pakes, Ariel & McGuire, Paul, 2001. "Stochastic Algorithms, Symmetric Markov Perfect Equilibrium, and the 'Curse' of Dimensionality," Econometrica, Econometric Society, vol. 69(5), pages 1261-81, September.
  27. Patrick Bajari & Ali Hortacsu, 2005. "Are Structural Estimates of Auction Models Reasonable? Evidence from Experimental Data," Journal of Political Economy, University of Chicago Press, vol. 113(4), pages 703-741, August.
  28. Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
  29. Yaw Nyarko & Andrew Schotter, 2002. "An Experimental Study of Belief Learning Using Elicited Beliefs," Econometrica, Econometric Society, vol. 70(3), pages 971-1005, May.
  30. Elena Reutskaja & Rosemarie Nagel & Colin F. Camerer & Antonio Rangel, 2011. "Search Dynamics in Consumer Choice under Time Pressure: An Eye-Tracking Study," American Economic Review, American Economic Association, vol. 101(2), pages 900-926, April.
  31. Brocas, Isabelle & Camerer, Colin & Carrillo, Juan D & Wang, Stephanie W., 2009. "Measuring attention and strategic behavior in games with private information," CEPR Discussion Papers 7529, C.E.P.R. Discussion Papers.
  32. Jeffrey Banks & David Porter & Mark Olson, 1997. "An experimental analysis of the bandit problem," Economic Theory, Springer, vol. 10(1), pages 55-77.
  33. K. Carrie Armel & Antonio Rangel, 2008. "The Impact of Computation Time and Experience on Decision Values," American Economic Review, American Economic Association, vol. 98(2), pages 163-68, May.
  34. Johnson, Eric J. & Camerer, Colin & Sen, Sankar & Rymon, Talia, 2002. "Detecting Failures of Backward Induction: Monitoring Information Search in Sequential Bargaining," Journal of Economic Theory, Elsevier, vol. 104(1), pages 16-47, May.
  35. Alexander L. Brown & Colin F. Camerer & Dan Lovallo, 2012. "To Review or Not to Review? Limited Strategic Thinking at the Movie Box Office," American Economic Journal: Microeconomics, American Economic Association, vol. 4(2), pages 1-26, May.
  36. Nathaniel T Wilcox, 2006. "Theories of Learning in Games and Heterogeneity Bias," Econometrica, Econometric Society, vol. 74(5), pages 1271-1292, 09.
  37. Kuhnen, Camelia M. & Knutson, Brian, 2011. "The Influence of Affect on Beliefs, Preferences, and Financial Decisions," Journal of Financial and Quantitative Analysis, Cambridge University Press, vol. 46(03), pages 605-626, June.
Full references (including those not matched with items on IDEAS)

This item is not listed on Wikipedia, on a reading list or among the top items on IDEAS.

When requesting a correction, please mention this item's handle: RePEc:ifs:cemmap:15/10. See general information about how to correct material in RePEc.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Benita Rajania)

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If references are entirely missing, you can add them using this form.

If the full references list an item that is present in RePEc, but the system did not link to it, you can help with this form.

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your profile, as there may be some citations waiting for confirmation.

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

This information is provided to you by IDEAS at the Research Division of the Federal Reserve Bank of St. Louis using RePEc data.