IDEAS home Printed from https://ideas.repec.org/p/sfu/sfudps/dp07-24.html
   My bibliography  Save this paper

Learning by Doing vs. Learning from Others in a Principal-Agent Model

Author

Abstract

We introduce learning in a principal-agent model of stochastic output sharing under moral hazard. Without knowing the agents' preferences and technology the principal tries to learn the optimal agency contract. We implement two learning paradigms - social (learning from others) and individual (learning by doing). We use a social evolutionary learning algorithm (SEL) to represent social learning. Within the individual learning paradigm, we investigate the performance of reinforcement learning (RL), experience-weighted attraction learning (EWA), and individual evolutionary learning (IEL). Overall, our results show that learning in the principal-agent environment is very difficult. This is due to three main reasons: (1) the stochastic environment, (2) a discontinuity in the payoff space in a neighborhood of the optimal contract due to the participation constraint and (3) incorrect evaluation of foregone payoffs in the sequential game principal-agent setting. The first two factors apply to all learning algorithms we study while the third is the main contributor for the failure of the EWA and IEL models. Social learning (SEL), especially combined with selective replication, is much more successful in achieving convergence to the optimal contract than the canonical versions of individual learning from the literature. A modified version of the IEL algorithm using realized payoff evaluation performs better than the other individual learning models; however, it still falls short of the social learning's ability to converge to the optimal contract.

Suggested Citation

  • Jasmina Arifovic & Alexander Karaivanov, 2007. "Learning by Doing vs. Learning from Others in a Principal-Agent Model," Discussion Papers dp07-24, Department of Economics, Simon Fraser University.
  • Handle: RePEc:sfu:sfudps:dp07-24
    as

    Download full text from publisher

    File URL: http://www.sfu.ca/econ-research/RePEc/sfu/sfudps/dp07-24.pdf
    Download Restriction: no

    Other versions of this item:

    References listed on IDEAS

    as
    1. Romer, Paul M, 1986. "Increasing Returns and Long-run Growth," Journal of Political Economy, University of Chicago Press, vol. 94(5), pages 1002-1037, October.
    2. Joseph E. Stiglitz, 1974. "Incentives and Risk Sharing in Sharecropping," Review of Economic Studies, Oxford University Press, vol. 41(2), pages 219-255.
    3. Patrick Bolton & Mathias Dewatripont, 2005. "Contract Theory," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262025760, January.
    4. Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
    5. Rogerson, William P, 1985. "The First-Order Approach to Principal-Agent Problems," Econometrica, Econometric Society, vol. 53(6), pages 1357-1367, November.
    6. Hommes, Cars & Lux, Thomas, 2013. "Individual Expectations And Aggregate Behavior In Learning-To-Forecast Experiments," Macroeconomic Dynamics, Cambridge University Press, vol. 17(02), pages 373-401, March.
    7. Marks, Robert, 1998. "Evolved perception and behaviour in oligopolies," Journal of Economic Dynamics and Control, Elsevier, vol. 22(8-9), pages 1209-1233, August.
    8. Lettau, Martin, 1997. "Explaining the facts with adaptive agents: The case of mutual fund flows," Journal of Economic Dynamics and Control, Elsevier, vol. 21(7), pages 1117-1147, June.
    9. Arifovic, Jasmina, 1994. "Genetic algorithm learning and the cobweb model," Journal of Economic Dynamics and Control, Elsevier, vol. 18(1), pages 3-28, January.
    10. Timothy G. Conley & Christopher R. Udry, 2010. "Learning about a New Technology: Pineapple in Ghana," American Economic Review, American Economic Association, vol. 100(1), pages 35-69, March.
    11. Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
    12. Arifovic, Jasmina & Ledyard, John, 2007. "Call market book information and efficiency," Journal of Economic Dynamics and Control, Elsevier, vol. 31(6), pages 1971-2000, June.
    13. Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
    14. Jasmina Arifovic & Michael Maschek, 2006. "Revisiting Individual Evolutionary Learning in the Cobweb Model – An Illustration of the Virtual Spite-Effect," Computational Economics, Springer;Society for Computational Economics, vol. 28(4), pages 333-354, November.
    15. Lux, Thomas & Schornstein, Sascha, 2005. "Genetic learning as an explanation of stylized facts of foreign exchange markets," Journal of Mathematical Economics, Elsevier, vol. 41(1-2), pages 169-196, February.
    16. Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, January.
    17. Rose Cunningham, 2004. "Investment, Private Information and Social Learning: A Case Study of the Semiconductor Industry," Macroeconomics 0409021, EconWPA.
    18. Stokey, Nancy L, 1988. "Learning by Doing and the Introduction of New Goods," Journal of Political Economy, University of Chicago Press, vol. 96(4), pages 701-717, August.
    19. Holmstrom, Bengt & Milgrom, Paul, 1987. "Aggregation and Linearity in the Provision of Intertemporal Incentives," Econometrica, Econometric Society, vol. 55(2), pages 303-328, March.
    20. Rebecca Achee Thornton & Peter Thompson, 2001. "Learning from Experience and Learning from Others: An Exploration of Learning and Spillovers in Wartime Shipbuilding," American Economic Review, American Economic Association, vol. 91(5), pages 1350-1368, December.
    21. Xiaobo Zhang & Shenggen Fan & Ximing Cai, 2002. "The Path Of Technology Diffusion: Which Neighbors To Learn From?," Contemporary Economic Policy, Western Economic Association International, vol. 20(4), pages 470-478, October.
    22. Harald Uhlig & Martin Lettau, 1999. "Rules of Thumb versus Dynamic Programming," American Economic Review, American Economic Association, vol. 89(1), pages 148-174, March.
    23. Masten, Scott E & Snyder, Edward A, 1993. "United States versus United Shoe Machinery Corporation: On the Merits," Journal of Law and Economics, University of Chicago Press, vol. 36(1), pages 33-70, April.
    24. Arifovic, Jasmina, 1996. "The Behavior of the Exchange Rate in the Genetic Algorithm and Experimental Economies," Journal of Political Economy, University of Chicago Press, vol. 104(3), pages 510-541, June.
    25. Sunil Dutta, 2002. "Revenue Recognition in a Multiperiod Agency Setting," Journal of Accounting Research, Wiley Blackwell, vol. 40(1), pages 67-83, March.
    26. Jasmina Arifovic & John Ledyard, 2004. "Scaling Up Learning Models in Public Good Games," Journal of Public Economic Theory, Association for Public Economic Theory, vol. 6(2), pages 203-238, May.
    27. Vriend, Nicolaas J., 2000. "An illustration of the essential difference between individual and social learning, and its consequences for computational analyses," Journal of Economic Dynamics and Control, Elsevier, vol. 24(1), pages 1-19, January.
    28. Chao, Kang, 1983. "Tenure Systems in Traditional China," Economic Development and Cultural Change, University of Chicago Press, vol. 31(2), pages 295-314, January.
    29. Alexander Karaivanov & Robert M. Townsend, 2014. "Dynamic Financial Constraints: Distinguishing Mechanism Design From Exogenously Incomplete Regimes," Econometrica, Econometric Society, vol. 82(3), pages 887-959, May.
    30. Caves, Richard E & Crookell, Harold & Killing, J Peter, 1983. "The Imperfect Market for Technology Licenses," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 45(3), pages 249-267, August.
    31. Lucas, Robert Jr., 1988. "On the mechanics of economic development," Journal of Monetary Economics, Elsevier, vol. 22(1), pages 3-42, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Luba Petersen & Jasmina Arifovic, 2015. "Escaping Expectations-Driven Liquidity Traps: Experimental Evidence," Discussion Papers dp15-03, Department of Economics, Simon Fraser University.
    2. repec:eee:jeborg:v:141:y:2017:i:c:p:64-82 is not listed on IDEAS

    More about this item

    Keywords

    learning; principal-agent model; moral hazard;

    JEL classification:

    • D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
    • D86 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Economics of Contract Law
    • C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sfu:sfudps:dp07-24. See general information about how to correct material in RePEc.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: (Working Paper Coordinator). General contact details of provider: http://edirc.repec.org/data/desfuca.html .

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service hosted by the Research Division of the Federal Reserve Bank of St. Louis . RePEc uses bibliographic data supplied by the respective publishers.