Learning by Doing vs. Learning from Others in a Principal-Agent Model

Learning by Doing vs. Learning from Others in a Principal-Agent Model

Author

Listed:

Abstract

We introduce learning in a principal-agent model of stochastic output sharing under moral hazard. Without knowing the agents' preferences and technology the principal tries to learn the optimal agency contract. We implement two learning paradigms - social (learning from others) and individual (learning by doing). We use a social evolutionary learning algorithm (SEL) to represent social learning. Within the individual learning paradigm, we investigate the performance of reinforcement learning (RL), experience-weighted attraction learning (EWA), and individual evolutionary learning (IEL). Overall, our results show that learning in the principal-agent environment is very difficult. This is due to three main reasons: (1) the stochastic environment, (2) a discontinuity in the payoff space in a neighborhood of the optimal contract due to the participation constraint and (3) incorrect evaluation of foregone payoffs in the sequential game principal-agent setting. The first two factors apply to all learning algorithms we study while the third is the main contributor for the failure of the EWA and IEL models. Social learning (SEL), especially combined with selective replication, is much more successful in achieving convergence to the optimal contract than the canonical versions of individual learning from the literature. A modified version of the IEL algorithm using realized payoff evaluation performs better than the other individual learning models; however, it still falls short of the social learning's ability to converge to the optimal contract.

Suggested Citation

Jasmina Arifovic & Alexander Karaivanov, 2007. "Learning by Doing vs. Learning from Others in a Principal-Agent Model," Discussion Papers dp07-24, Department of Economics, Simon Fraser University.

Handle: RePEc:sfu:sfudps:dp07-24

Download full text from publisher

Other versions of this item:

Arifovic, Jasmina & Karaivanov, Alexander, 2010. "Learning by doing vs. learning from others in a principal-agent model," Journal of Economic Dynamics and Control, Elsevier, vol. 34(10), pages 1967-1992, October.

References listed on IDEAS

Romer, Paul M, 1986. "Increasing Returns and Long-run Growth," Journal of Political Economy, University of Chicago Press, vol. 94(5), pages 1002-1037, October.
- Paul M Romer, 1999. "Increasing Returns and Long-Run Growth," Levine's Working Paper Archive 2232, David K. Levine.
Joseph E. Stiglitz, 1974. "Incentives and Risk Sharing in Sharecropping," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 41(2), pages 219-255.
- Joseph E. Stiglitz, 1973. "Incentives and Risk-Sharing in Sharecropping," Cowles Foundation Discussion Papers 353, Cowles Foundation for Research in Economics, Yale University.
Caplin, Andrew & Leahy, John, 1994. "Business as Usual, Market Crashes, and Wisdom after the Fact," American Economic Review, American Economic Association, vol. 84(3), pages 548-565, June.
- Caplin, A. & Leahy, J., 1992. "Business as Usual, Market Crashes, and Wisdom after the Fact," Harvard Institute of Economic Research Working Papers 1594, Harvard - Institute of Economic Research.
Patrick Bolton & Mathias Dewatripont, 2005. "Contract Theory," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262025760, December.
- Mathias Dewatripont & Patrick Bolton, 2005. "Contract theory," ULB Institutional Repository 2013/9543, ULB -- Universite Libre de Bruxelles.
Fudenberg, Drew & Levine, David, 1998. "Learning in games," European Economic Review, Elsevier, vol. 42(3-5), pages 631-639, May.
- Drew Fudenberg & David K. Levine, 1998. "Learning in Games," Levine's Working Paper Archive 2222, David K. Levine.
Erev, Ido & Roth, Alvin E, 1998. "Predicting How People Play Games: Reinforcement Learning in Experimental Games with Unique, Mixed Strategy Equilibria," American Economic Review, American Economic Association, vol. 88(4), pages 848-881, September.
Stokey, Nancy L, 1988. "Learning by Doing and the Introduction of New Goods," Journal of Political Economy, University of Chicago Press, vol. 96(4), pages 701-717, August.
- Nancy L Stokey, 1986. "Learning-by-Doing and the Introduction of New Goods," Discussion Papers 699, Northwestern University, Center for Mathematical Studies in Economics and Management Science, revised May 1987.
Rogerson, William P, 1985. "The First-Order Approach to Principal-Agent Problems," Econometrica, Econometric Society, vol. 53(6), pages 1357-1367, November.
Marks, Robert, 1998. "Evolved perception and behaviour in oligopolies," Journal of Economic Dynamics and Control, Elsevier, vol. 22(8-9), pages 1209-1233, August.
- Robert E. Marks, "undated". "Evolved Perception and Behaviour in Oligopolies," Computing in Economics and Finance 1996 _038, Society for Computational Economics.
Lux, Thomas & Schornstein, Sascha, 2005. "Genetic learning as an explanation of stylized facts of foreign exchange markets," Journal of Mathematical Economics, Elsevier, vol. 41(1-2), pages 169-196, February.
- Lux, Thomas & Schornstein, Sascha, 2002. "Genetic learning as an explanation of stylized facts of foreign exchange markets," Discussion Paper Series 1: Economic Studies 2002,29, Deutsche Bundesbank.
- Lux, Thomas & Schornstein, Sascha, 2003. "Genetic learning as an explanation of stylized facts of foreign exchange markets," Economics Working Papers 2003-12, Christian-Albrechts-University of Kiel, Department of Economics.
Lettau, Martin, 1997. "Explaining the facts with adaptive agents: The case of mutual fund flows," Journal of Economic Dynamics and Control, Elsevier, vol. 21(7), pages 1117-1147, June.
Rose Cunningham, 2004. "Investment, Private Information and Social Learning: A Case Study of the Semiconductor Industry," Macroeconomics 0409021, University Library of Munich, Germany.
- Rose Cunningham, 2004. "Investment, Private Information, and Social Learning: A Case Study of the Semiconductor Industry," Staff Working Papers 04-32, Bank of Canada.
Arifovic, Jasmina, 1994. "Genetic algorithm learning and the cobweb model," Journal of Economic Dynamics and Control, Elsevier, vol. 18(1), pages 3-28, January.
Hommes, Cars & Lux, Thomas, 2013. "Individual Expectations And Aggregate Behavior In Learning-To-Forecast Experiments," Macroeconomic Dynamics, Cambridge University Press, vol. 17(2), pages 373-401, March.
- Hommes, Cars & Lux, Thomas, 2008. "Individual expectations and aggregate behavior in learning to forecast experiments," Kiel Working Papers 1466, Kiel Institute for the World Economy.
- Hommes, C.H. & Lux, T., 2009. "Individual Expectations and Aggregate Behavior in Learning to Forcast Experiments," CeNDEF Working Papers 09-03, Universiteit van Amsterdam, Center for Nonlinear Dynamics in Economics and Finance.
Holmstrom, Bengt & Milgrom, Paul, 1987. "Aggregation and Linearity in the Provision of Intertemporal Incentives," Econometrica, Econometric Society, vol. 55(2), pages 303-328, March.
- Bengt Holmstrom & Paul R. Milgrom, 1985. "Aggregation and Linearity in the Provision of Intertemporal Incentives," Cowles Foundation Discussion Papers 742, Cowles Foundation for Research in Economics, Yale University.
Bengt Holmstrom, 1979. "Moral Hazard and Observability," Bell Journal of Economics, The RAND Corporation, vol. 10(1), pages 74-91, Spring.
- HOLMSTROM, Bengt, 1979. "Moral hazard and observability," LIDAM Reprints CORE 379, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
- Bengt Holmstrom, 1997. "Moral Hazard and Observability," Levine's Working Paper Archive 1205, David K. Levine.
Rebecca Achee Thornton & Peter Thompson, 2001. "Learning from Experience and Learning from Others: An Exploration of Learning and Spillovers in Wartime Shipbuilding," American Economic Review, American Economic Association, vol. 91(5), pages 1350-1368, December.
Xiaobo Zhang & Shenggen Fan & Ximing Cai, 2002. "The Path Of Technology Diffusion: Which Neighbors To Learn From?," Contemporary Economic Policy, Western Economic Association International, vol. 20(4), pages 470-478, October.
Harald Uhlig & Martin Lettau, 1999. "Rules of Thumb versus Dynamic Programming," American Economic Review, American Economic Association, vol. 89(1), pages 148-174, March.
Holmstrom, Bengt & Milgrom, Paul, 1991. "Multitask Principal-Agent Analyses: Incentive Contracts, Asset Ownership, and Job Design," The Journal of Law, Economics, and Organization, Oxford University Press, vol. 7(0), pages 24-52, Special I.
K. J. Arrow, 1971. "The Economic Implications of Learning by Doing," Palgrave Macmillan Books, in: F. H. Hahn (ed.), Readings in the Theory of Growth, chapter 11, pages 131-149, Palgrave Macmillan.
- Kenneth J. Arrow, 1962. "The Economic Implications of Learning by Doing," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 29(3), pages 155-173.
Timothy G. Conley & Christopher R. Udry, 2010. "Learning about a New Technology: Pineapple in Ghana," American Economic Review, American Economic Association, vol. 100(1), pages 35-69, March.
- Conley, Timothy G. & Udry, Christopher R., 2000. "Learning About a New Technology: Pineapple In Ghana," Center Discussion Papers 28400, Yale University, Economic Growth Center.
- Conley, T.G. & Udry, C.R., 2000. "Learning about a New Technology: Pineapple in Ghana," Papers 817, Yale - Economic Growth Center.
- Timothy G. Conley & Christopher R. Udry, 2000. "Learning About a New Technology: Pineapple in Ghana," Working Papers 817, Economic Growth Center, Yale University, revised May 2004.
Roth, Alvin E. & Erev, Ido, 1995. "Learning in extensive-form games: Experimental data and simple dynamic models in the intermediate term," Games and Economic Behavior, Elsevier, vol. 8(1), pages 164-212.
Masten, Scott E & Snyder, Edward A, 1993. "United States versus United Shoe Machinery Corporation: On the Merits," Journal of Law and Economics, University of Chicago Press, vol. 36(1), pages 33-70, April.
Arifovic, Jasmina, 1996. "The Behavior of the Exchange Rate in the Genetic Algorithm and Experimental Economies," Journal of Political Economy, University of Chicago Press, vol. 104(3), pages 510-541, June.
Arifovic, Jasmina & Ledyard, John, 2007. "Call market book information and efficiency," Journal of Economic Dynamics and Control, Elsevier, vol. 31(6), pages 1971-2000, June.
Sunil Dutta & Xiao‐jun Zhang, 2002. "Revenue Recognition in a Multiperiod Agency Setting," Journal of Accounting Research, John Wiley & Sons, Ltd., vol. 40(1), pages 67-83, March.
Jasmina Arifovic & John Ledyard, 2004. "Scaling Up Learning Models in Public Good Games," Journal of Public Economic Theory, Association for Public Economic Theory, vol. 6(2), pages 203-238, May.
Colin Camerer & Teck-Hua Ho, 1999. "Experience-weighted Attraction Learning in Normal Form Games," Econometrica, Econometric Society, vol. 67(4), pages 827-874, July.
Vriend, Nicolaas J., 2000. "An illustration of the essential difference between individual and social learning, and its consequences for computational analyses," Journal of Economic Dynamics and Control, Elsevier, vol. 24(1), pages 1-19, January.
- Nicolaas J. Vriend, 1998. "An Illustration of the Essential Difference between Individual and Social Learning, and its Consequences for Computational Analyses," Working Papers 387, Queen Mary University of London, School of Economics and Finance.
Jasmina Arifovic & Michael Maschek, 2006. "Revisiting Individual Evolutionary Learning in the Cobweb Model – An Illustration of the Virtual Spite-Effect," Computational Economics, Springer;Society for Computational Economics, vol. 28(4), pages 333-354, November.
Chao, Kang, 1983. "Tenure Systems in Traditional China," Economic Development and Cultural Change, University of Chicago Press, vol. 31(2), pages 295-314, January.
Townsend, Robert M, 1982. "Optimal Multiperiod Contracts and the Gain from Enduring Relationships under Private Information," Journal of Political Economy, University of Chicago Press, vol. 90(6), pages 1166-1186, December.
Marimon, Ramon & McGrattan, Ellen & Sargent, Thomas J., 1990. "Money as a medium of exchange in an economy with artificially intelligent agents," Journal of Economic Dynamics and Control, Elsevier, vol. 14(2), pages 329-373, May.
Drew Fudenberg & David K. Levine, 1998. "The Theory of Learning in Games," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262061945, December.
- Drew Fudenberg & David K. Levine, 1996. "The Theory of Learning in Games," Levine's Working Paper Archive 624, David K. Levine.
Caves, Richard E & Crookell, Harold & Killing, J Peter, 1983. "The Imperfect Market for Technology Licenses," Oxford Bulletin of Economics and Statistics, Department of Economics, University of Oxford, vol. 45(3), pages 249-267, August.
Alexander Karaivanov & Robert M. Townsend, 2014. "Dynamic Financial Constraints: Distinguishing Mechanism Design From Exogenously Incomplete Regimes," Econometrica, Econometric Society, vol. 82(3), pages 887-959, May.
- Alexander Karaivanov & Robert M. Townsend, 2013. "Dynamic Financial Constraints: Distinguishing Mechanism Design from Exogenously Incomplete Regimes," NBER Working Papers 19617, National Bureau of Economic Research, Inc.
Lucas, Robert Jr., 1988. "On the mechanics of economic development," Journal of Monetary Economics, Elsevier, vol. 22(1), pages 3-42, July.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

is not listed on IDEAS
Anufriev, Mikhail & Arifovic, Jasmina & Donmez, Anil & Ledyard, John & Panchenko, Valentyn, 2025. "IEL-CDA model: A more accurate theory of behavior in continuous double auctions," Journal of Economic Dynamics and Control, Elsevier, vol. 172(C).
Luba Petersen & Jasmina Arifovic, 2015. "Escaping Expectations-Driven Liquidity Traps: Experimental Evidence," Discussion Papers dp15-03, Department of Economics, Simon Fraser University.
Chernomaz, K. & Goertz, J.M.M., 2023. "(A)symmetric equilibria and adaptive learning dynamics in small-committee voting," Journal of Economic Dynamics and Control, Elsevier, vol. 147(C).
Arifovic, Jasmina & Dawid, Herbert & Nanumyan, Mariam, 2025. "Efficiency gains through social influence in a minimum effort game," Journal of Economic Dynamics and Control, Elsevier, vol. 172(C).
Salle, Isabelle & Yildizoglu, Murat & Zumpe, Martin & Sénégas, Marc-Alexandre, 2017. "Coordination through social learning in a general equilibrium model," Journal of Economic Behavior & Organization, Elsevier, vol. 141(C), pages 64-82.
- Isabelle Salle & Murat Yildizoglu & Martin Zumpe & Marc-Alexandre Sénégas, 2017. "Coordination through social learning in a general equilibrium model," Post-Print hal-01848386, HAL.
Anufriev, Mikhail & Duffy, John & Panchenko, Valentyn, 2024. "Individual evolutionary learning in repeated beauty contest games," Journal of Economic Behavior & Organization, Elsevier, vol. 218(C), pages 550-567.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Duffy, John, 2006. "Agent-Based Models and Human Subject Experiments," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 19, pages 949-1011, Elsevier.
- John Duffy, 2004. "Agent-Based Models and Human Subject Experiments," Computational Economics 0412001, University Library of Munich, Germany.
Chen, Shu-Heng, 2012. "Varieties of agents in agent-based computational economics: A historical and an interdisciplinary perspective," Journal of Economic Dynamics and Control, Elsevier, vol. 36(1), pages 1-25.
Waltman, Ludo & Kaymak, Uzay, 2008. "Q-learning agents in a Cournot oligopoly model," Journal of Economic Dynamics and Control, Elsevier, vol. 32(10), pages 3275-3293, October.
Leigh Tesfatsion, 2002. "Agent-Based Computational Economics," Computational Economics 0203001, University Library of Munich, Germany, revised 15 Aug 2002.
- Tesfatsion, Leigh, 2007. "Agent-based computational economics," ISU General Staff Papers 200701010800001423, Iowa State University, Department of Economics.
- Tesfatsion, Leigh, 2003. "Agent-Based Computational Economics," ISU General Staff Papers 200301010800001248, Iowa State University, Department of Economics.
Georges, Christophre, 2006. "Learning with misspecification in an artificial currency market," Journal of Economic Behavior & Organization, Elsevier, vol. 60(1), pages 70-84, May.
LeBaron, Blake, 2006. "Agent-based Computational Finance," Handbook of Computational Economics, in: Leigh Tesfatsion & Kenneth L. Judd (ed.), Handbook of Computational Economics, edition 1, volume 2, chapter 24, pages 1187-1233, Elsevier.
Chernov, G. & Susin, I., 2019. "Models of learning in games: An overview," Journal of the New Economic Association, New Economic Association, vol. 44(4), pages 77-125.
Waltman, L. & van Eck, N.J.P., 2009. "A Mathematical Analysis of the Long-run Behavior of Genetic Algorithms for Social Modeling," ERIM Report Series Research in Management ERS-2009-011-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
Troy Tassier, 2013. "Handbook of Research on Complexity, by J. Barkley Rosser, Jr. and Edward Elgar," Eastern Economic Journal, Palgrave Macmillan;Eastern Economic Association, vol. 39(1), pages 132-133.
Antonio Doria, Francisco, 2011. "J.B. Rosser Jr. , Handbook of Research on Complexity, Edward Elgar, Cheltenham, UK--Northampton, MA, USA (2009) 436 + viii pp., index, ISBN 978 1 84542 089 5 (cased)," Journal of Economic Behavior & Organization, Elsevier, vol. 78(1-2), pages 196-204, April.
Ianni, A., 2002. "Reinforcement learning and the power law of practice: some analytical results," Discussion Paper Series In Economics And Econometrics 203, Economics Division, School of Social Sciences, University of Southampton.
Mikhail Anufriev & Jasmina Arifovic & John Ledyard & Valentyn Panchenko, 2013. "Efficiency of continuous double auctions under individual evolutionary learning with full or limited information," Journal of Evolutionary Economics, Springer, vol. 23(3), pages 539-573, July.
- Anufriev, M. & Arifovic, J. & Ledyard, D. & Panchenko, V., 2010. "Efficiency of Continuous Double Auctions under Individual Evolutionary Learning with Full or Limited Information," CeNDEF Working Papers 10-01, Universiteit van Amsterdam, Center for Nonlinear Dynamics in Economics and Finance.
Inés Macho-Stadler & David Pérez-Castrillo, 2018. "Moral hazard: Base models and two extensions," Chapters, in: Luis C. Corchón & Marco A. Marini (ed.), Handbook of Game Theory and Industrial Organization, Volume I, chapter 16, pages 453-485, Edward Elgar Publishing.
- Ines Macho-Stadler & David Pérez-Castrillo, 2016. "Moral Hazard: Base Models and Two Extensions," CESifo Working Paper Series 5851, CESifo.
- Inés Macho-Stadler & David Pérez-Castrillo, 2016. "Moral Hazard: Base Models and Two Extensions," Working Papers 883, Barcelona School of Economics.
Chernomaz, K. & Goertz, J.M.M., 2023. "(A)symmetric equilibria and adaptive learning dynamics in small-committee voting," Journal of Economic Dynamics and Control, Elsevier, vol. 147(C).
Floortje Alkemade & Han Poutré & Hans Amman, 2009. "Robust Evolutionary Algorithm Design for Socio-Economic Simulation: A Correction," Computational Economics, Springer;Society for Computational Economics, vol. 33(1), pages 99-101, February.
- Floortje Alkemade & Han Poutré & Hans Amman, 2006. "Robust Evolutionary Algorithm Design for Socio-economic Simulation," Computational Economics, Springer;Society for Computational Economics, vol. 28(4), pages 355-370, November.
Nick Feltovich, 2000. "Reinforcement-Based vs. Belief-Based Learning Models in Experimental Asymmetric-Information," Econometrica, Econometric Society, vol. 68(3), pages 605-642, May.
Ying-Fang Kao & Ragupathy Venkatachalam, 2021. "Human and Machine Learning," Computational Economics, Springer;Society for Computational Economics, vol. 57(3), pages 889-909, March.
Mauersberger, Felix, 2019. "Thompson Sampling: Endogenously Random Behavior in Games and Markets," VfS Annual Conference 2019 (Leipzig): 30 Years after the Fall of the Berlin Wall - Democracy and Market Economy 203600, Verein für Socialpolitik / German Economic Association.
Ho, Teck H. & Camerer, Colin F. & Chong, Juin-Kuan, 2007. "Self-tuning experience weighted attraction learning in games," Journal of Economic Theory, Elsevier, vol. 133(1), pages 177-198, March.
Hooper, Louise, 2008. "Paying for performance: Uncertainty, asymmetric information and the payment model," Research in Transportation Economics, Elsevier, vol. 22(1), pages 157-163, January.

More about this item

Keywords

; ; ;

JEL classification:

D83 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Search; Learning; Information and Knowledge; Communication; Belief; Unawareness
D86 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Economics of Contract Law
C63 - Mathematical and Quantitative Methods - - Mathematical Methods; Programming Models; Mathematical and Simulation Modeling - - - Computational Techniques

NEP fields

This paper has been announced in the following NEP Reports:

NEP-BEC-2008-04-21 (Business Economics)
NEP-CMP-2008-04-21 (Computational Economics)
NEP-CTA-2008-04-21 (Contract Theory and Applications)
NEP-DGE-2008-04-21 (Dynamic General Equilibrium)
NEP-EVO-2008-04-21 (Evolutionary Economics)
NEP-EXP-2008-04-21 (Experimental Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sfu:sfudps:dp07-24. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Working Paper Coordinator (email available below). General contact details of provider: https://edirc.repec.org/data/desfuca.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Learning by Doing vs. Learning from Others in a Principal-Agent Model

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Citations

Most related items

More about this item

Keywords

JEL classification:

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data