IDEAS home Printed from https://ideas.repec.org/p/ces/ceswps/_6029.html
   My bibliography  Save this paper

Counter Intuitive Learning: An Exploratory Study

Author

Listed:
  • Nobuyuki Hanaki
  • Alan P. Kirman
  • Paul Pezanis-Christou

Abstract

The literature on learning in unknown environments emphasises reinforcing on actions which produce positive results. But, in some cases, success requires shifting from a currently successful actions to others. We examine, experimentally and theoretically in a very simple framework, how individuals initially learn by exploiting information from the pay-offs of actions taken but also from exploring new actions. We analyse if and how they learn that pay-offs are inter-temporally dependent. We then ran the same experiments but where individuals could observe the actions taken or the pay-offs obtained by others or both. Such observations improved pay-offs if one of the pair had learned to obtain the maximum pay-off.

Suggested Citation

  • Nobuyuki Hanaki & Alan P. Kirman & Paul Pezanis-Christou, 2016. "Counter Intuitive Learning: An Exploratory Study," CESifo Working Paper Series 6029, CESifo.
  • Handle: RePEc:ces:ceswps:_6029
    as

    Download full text from publisher

    File URL: https://www.cesifo.org/DocDL/cesifo1_wp6029.pdf
    Download Restriction: no
    ---><---

    Other versions of this item:

    References listed on IDEAS

    as
    1. Woodford, Michael, 1990. "Learning to Believe in Sunspots," Econometrica, Econometric Society, vol. 58(2), pages 277-307, March.
    2. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    3. James G. March, 1991. "Exploration and Exploitation in Organizational Learning," Organization Science, INFORMS, vol. 2(1), pages 71-87, February.
    4. Ralph-C. Bayer & Hang Wu, 2013. "Do We Learn from Our Own Experience or from Observing Others?," School of Economics and Public Policy Working Papers 2013-21, University of Adelaide, School of Economics and Public Policy.
    5. Sonsino, Doron, 1997. "Learning to Learn, Pattern Recognition, and Nash Equilibrium," Games and Economic Behavior, Elsevier, vol. 18(2), pages 286-331, February.
    6. Bossan, Benjamin & Jann, Ole & Hammerstein, Peter, 2015. "The evolution of social learning and its economic consequences," Journal of Economic Behavior & Organization, Elsevier, vol. 112(C), pages 266-288.
    7. Spiliopoulos, Leonidas, 2012. "Pattern recognition and subjective belief learning in a repeated constant-sum game," Games and Economic Behavior, Elsevier, vol. 75(2), pages 921-935.
    8. Jeffrey Banks & David Porter & Mark Olson, 1997. "An experimental analysis of the bandit problem," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 10(1), pages 55-77.
    9. Bray, Margaret, 1982. "Learning, estimation, and the stability of rational expectations," Journal of Economic Theory, Elsevier, vol. 26(2), pages 318-339, April.
    10. Hu, Yingyao & Kayaba, Yutaka & Shum, Matthew, 2013. "Nonparametric learning rules from bandit experiments: The eyes have it!," Games and Economic Behavior, Elsevier, vol. 81(C), pages 215-231.
    11. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
    12. Urs Fischbacher, 2007. "z-Tree: Zurich toolbox for ready-made economic experiments," Experimental Economics, Springer;Economic Science Association, vol. 10(2), pages 171-178, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Hanaki, Nobuyuki & Kirman, Alan & Pezanis-Christou, Paul, 2018. "Observational and reinforcement pattern-learning: An exploratory study," European Economic Review, Elsevier, vol. 104(C), pages 1-21.
    2. Eric Guerci & Nobuyuki Hanaki & Naoki Watanabe, 2017. "Meaningful learning in weighted voting games: an experiment," Theory and Decision, Springer, vol. 83(1), pages 131-153, June.
    3. Eric Guerci & Nobuyuki Hanaki & Naoki Watanabe, 2015. "Meaningful Learning in Weighted Voting Games: An Experiment," Working Papers halshs-01216244, HAL.
    4. Oyarzun, Carlos & Sanjurjo, Adam & Nguyen, Hien, 2017. "Response functions," European Economic Review, Elsevier, vol. 98(C), pages 1-31.
    5. Ioannou, Christos A. & Romero, Julian, 2014. "A generalized approach to belief learning in repeated games," Games and Economic Behavior, Elsevier, vol. 87(C), pages 178-203.
    6. Wen, Yuanji, 2018. "Voluntary information acquisition in an asymmetric-Information game:comparing learning theories in the laboratory," Journal of Economic Behavior & Organization, Elsevier, vol. 150(C), pages 202-219.
    7. Naoki Watanabe, 2022. "Reconsidering Meaningful Learning in a Bandit Experiment on Weighted Voting: Subjects’ Search Behavior," The Review of Socionetwork Strategies, Springer, vol. 16(1), pages 81-107, April.
    8. Hommes, Cars, 2018. "Behavioral & experimental macroeconomics and policy analysis: a complex systems approach," Working Paper Series 2201, European Central Bank.
    9. Alina Ferecatu & Arnaud De Bruyn, 2022. "Understanding Managers’ Trade-Offs Between Exploration and Exploitation," Marketing Science, INFORMS, vol. 41(1), pages 139-165, January.
    10. Andreas Nicklisch, 2011. "Learning strategic environments: an experimental study of strategy formation and transfer," Theory and Decision, Springer, vol. 71(4), pages 539-558, October.
    11. Elmaghraby, Wedad J. & Larson, Nathan, 2012. "Explaining deviations from equilibrium in auctions with avoidable fixed costs," Games and Economic Behavior, Elsevier, vol. 76(1), pages 131-159.
    12. Rick, Scott & Weber, Roberto A., 2010. "Meaningful learning and transfer of learning in games played repeatedly without feedback," Games and Economic Behavior, Elsevier, vol. 68(2), pages 716-730, March.
    13. Zhang, Yang & Du, Xiaomin, 2017. "Network effects on strategic interactions: A laboratory approach," Journal of Economic Behavior & Organization, Elsevier, vol. 143(C), pages 133-146.
    14. Dennis A. V. Dittrich & Werner Güth & Martin G. Kocher & Paul Pezanis‐Christou, 2012. "Loss Aversion and Learning to Bid," Economica, London School of Economics and Political Science, vol. 79(314), pages 226-257, April.
    15. Claudia Neri, 2015. "Eliciting beliefs in continuous-choice games: a double auction experiment," Experimental Economics, Springer;Economic Science Association, vol. 18(4), pages 569-608, December.
    16. Wolf Ze'ev Ehrblatt & Kyle Hyndman & Erkut Y. ÄOzbay & Andrew Schotter, 2006. "Convergence: An Experimental Study," Levine's Working Paper Archive 122247000000001148, David K. Levine.
    17. Spiliopoulos, Leonidas, 2013. "Beyond fictitious play beliefs: Incorporating pattern recognition and similarity matching," Games and Economic Behavior, Elsevier, vol. 81(C), pages 69-85.
    18. Masiliūnas, Aidas, 2023. "Learning in rent-seeking contests with payoff risk and foregone payoff information," Games and Economic Behavior, Elsevier, vol. 140(C), pages 50-72.
    19. Shu-Heng Chen & Bin-Tzong Chie & Ying-Fang Kao & Ragupathy Venkatachalam, 2019. "Agent-Based Modeling of a Non-tâtonnement Process for the Scarf Economy: The Role of Learning," Computational Economics, Springer;Society for Computational Economics, vol. 54(1), pages 305-341, June.
    20. Ralph-C. Bayer & Elke Renner & Rupert Sausgruber, 2013. "Confusion and learning in the voluntary contributions game," Experimental Economics, Springer;Economic Science Association, vol. 16(4), pages 478-496, December.

    More about this item

    Keywords

    multi-armed bandit; reinforcement learning; eureka moment; pay-off patterns; observational learning;
    All these keywords.

    JEL classification:

    • D81 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Criteria for Decision-Making under Risk and Uncertainty
    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:ces:ceswps:_6029. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Klaus Wohlrabe (email available below). General contact details of provider: https://edirc.repec.org/data/cesifde.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.