Learning by doing and the value of optimal experimentation

My bibliography Save this article

Learning by doing and the value of optimal experimentation

Author

Listed:

Wieland, Volker

Registered:

Volker Wieland

Abstract

Research on learning-by-doing has typically been restricted to cases where estimation and control can be treated separately. Recent work has provided convergence results for more general learning problems where experimentation is an important aspect of optimal control. However the associated optimal policy cannot be derived analytically because Bayesian learning introduces a nonlinearity in the dynamic programming problem. This paper characterizes the optimal policy numerically and shows that it incorporates a substantial degree of experimentation. Dynamic simulations indicate that optimal experimentation dramatically improves the speed of learning, while separating control and estimation frequently induces a long-lasting bias in the control and target variables.
(This abstract was borrowed from another version of this item.)

Suggested Citation

Wieland, Volker, 2000. "Learning by doing and the value of optimal experimentation," Journal of Economic Dynamics and Control, Elsevier, vol. 24(4), pages 501-534, April.

Handle: RePEc:eee:dyncon:v:24:y:2000:i:4:p:501-534

Download full text from publisher

As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

Other versions of this item:

Volker W. Wieland, 1996. "Learning by doing and the value of optimal experimentation," Finance and Economics Discussion Series 96-5, Board of Governors of the Federal Reserve System (U.S.).

References listed on IDEAS

Philippe Aghion & Patrick Bolton & Christopher Harris & Bruno Jullien, 1991. "Optimal Learning by Experimentation," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 58(4), pages 621-654.
- Aghion, P. & Bolton, P. & Harris, C. & Jullien, B., 1990. "Optimal Learning By Experimentation," DELTA Working Papers 90-10, DELTA (Ecole normale supérieure).
- Aghion Philippe & Bolton, Patrick & Harris Christopher & Jullien Bruno, 1991. "Optimal learning by experimentation," CEPREMAP Working Papers (Couverture Orange) 9104, CEPREMAP.
Mizrach, Bruce, 1991. "Nonconvexities in a stochastic control problem with learning," Journal of Economic Dynamics and Control, Elsevier, vol. 15(3), pages 515-538, July.
Jovanovic, Boyan & Nyarko, Yaw, 1996. "Learning by Doing and the Choice of Technology," Econometrica, Econometric Society, vol. 64(6), pages 1299-1310, November.
- Boyan Jovanovic & Yaw Nyarko, 1994. "Learning By Doing and the Choice of Technology," NBER Working Papers 4739, National Bureau of Economic Research, Inc.
- Jovanovic, B. & Nyarko, Y., 1996. "Learning by Doing and the Choice of Technology," Working Papers 96-25, C.V. Starr Center for Applied Economics, New York University.
Kenneth L. Judd, 1998. "Numerical Methods in Economics," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262100711, December.
Alfred L. Norman, 1976. "First Order Dual Control," NBER Chapters, in: Annals of Economic and Social Measurement, Volume 5, number 3, pages 311-321, National Bureau of Economic Research, Inc.
Foster, Andrew D & Rosenzweig, Mark R, 1995. "Learning by Doing and Learning from Others: Human Capital and Technical Change in Agriculture," Journal of Political Economy, University of Chicago Press, vol. 103(6), pages 1176-1209, December.
- Mark Rosenzweig & Andrew D. Foster, "undated". "Learning by Doing and Learning from Others: Human Capital and Technical Change in Agriculture," Home Pages _068, University of Pennsylvania.
Kiefer, Nicholas M., 1989. "A value function arising in the economics of information," Journal of Economic Dynamics and Control, Elsevier, vol. 13(2), pages 201-223, April.
Rothschild, Michael, 1974. "A two-armed bandit theory of market pricing," Journal of Economic Theory, Elsevier, vol. 9(2), pages 185-202, October.
Bertocchi, Graziella & Spagat, Michael, 1993. "Learning, experimentation, and monetary policy," Journal of Monetary Economics, Elsevier, vol. 32(1), pages 169-183, August.
- Bertocchi, Graziella & Spagat, Michael, 1991. "Learning, Experimentation and Monetary Policy," LIDAM Discussion Papers IRES 1991018, Université catholique de Louvain, Institut de Recherches Economiques et Sociales (IRES).
Kendrick, David, 1978. "Non-convexities from probing in adaptive control problems," Economics Letters, Elsevier, vol. 1(4), pages 347-351.
Volker Wieland, 2005. "A Numerical Dynamic Programming Algorithm for Optimal Learning Problems," Computing in Economics and Finance 2005 193, Society for Computational Economics.
Rustichini, Aldo & Wolinsky, Asher, 1995. "Learning about variable demand in the long run," Journal of Economic Dynamics and Control, Elsevier, vol. 19(5-7), pages 1283-1292.
- Aldo Rustichini & Asher Wolinsky, 1992. "Learning about Variable Demand in the Long Run," Discussion Papers 1015, Northwestern University, Center for Mathematical Studies in Economics and Management Science.
- RUSTICHINI, Aldo & WOLINSKY , Asher, 1993. "Learning about Variable Demand in the Long Run," LIDAM Discussion Papers CORE 1993017, Université catholique de Louvain, Center for Operations Research and Econometrics (CORE).
Tucci, Marco P., 1997. "Adaptive control in the presence of time-varying parameters," Journal of Economic Dynamics and Control, Elsevier, vol. 22(1), pages 39-47, November.
Balvers, Ronald J & Cosimano, Thomas F, 1990. "Actively Learning about Demand and the Dynamics of Price Adjustment," Economic Journal, Royal Economic Society, vol. 100(402), pages 882-898, September.
Amman, Hans M & Kendrick, David A, 1995. "Nonconvexities in Stochastic Control Models," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 36(2), pages 455-475, May.
Easley, David & Kiefer, Nicholas M, 1988. "Controlling a Stochastic Process with Unknown Parameters," Econometrica, Econometric Society, vol. 56(5), pages 1045-1064, September.
Alfred L. Norman & M. R. Norman & Carl Palash, 1979. "Multiple relative maxima in optimal macroeconomic policy: an illustration," Special Studies Papers 134, Board of Governors of the Federal Reserve System (U.S.).
El-Gamal, Mahmoud A. & Sundaram, Rangarajan K., 1993. "Bayesian economists ... Bayesian agents : An alternative approach to optimal learning," Journal of Economic Dynamics and Control, Elsevier, vol. 17(3), pages 355-383, May.
- El-Gamal, Mahmoud A. & Sundaram, Rangarajan K., 1989. "Bayesian Economist ... Bayesian Agents I: An Alternative Approach to Optimal Learning," Working Papers 705, California Institute of Technology, Division of the Humanities and Social Sciences.
Ronald J. Balvers & Thomas F. Cosimano, 1994. "Inflation Variability and Gradualist Monetary Policy," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 61(4), pages 721-738.
John B. Taylor, 1976. "Methods of Efficient Parameter Estimation in Control Problems," NBER Chapters, in: Annals of Economic and Social Measurement, Volume 5, number 3, pages 339-347, National Bureau of Economic Research, Inc.
Trefler, Daniel, 1993. "The Ignorant Monopolist: Optimal Learning with Endogenous Information," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 34(3), pages 565-581, August.
Prescott, Edward C, 1972. "The Multi-Period Control Problem Under Uncertainty," Econometrica, Econometric Society, vol. 40(6), pages 1043-1058, November.
Kendrick, David, 1982. "Caution and probing in a macroeconomic model," Journal of Economic Dynamics and Control, Elsevier, vol. 4(1), pages 149-170, November.
McLennan, Andrew, 1984. "Price dispersion and incomplete learning in the long run," Journal of Economic Dynamics and Control, Elsevier, vol. 7(3), pages 331-347, September.
Kiefer, Nicholas M & Nyarko, Yaw, 1989. "Optimal Control of an Unknown Linear Process with Learning," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 30(3), pages 571-586, August.
Balvers, Ronald J. & Cosimano, Thomas F., 1993. "Periodic learning about a hidden state variable," Journal of Economic Dynamics and Control, Elsevier, vol. 17(5-6), pages 805-827.
Anderson, T W & Taylor, John B, 1976. "Some Experimental Results on the Statistical Properties of Least Squares Estimates in Control Problems," Econometrica, Econometric Society, vol. 44(6), pages 1289-1302, November.
Amman, Hans M. & Kendrick, David A., 1994. "Active learning Monte Carlo results," Journal of Economic Dynamics and Control, Elsevier, vol. 18(1), pages 119-124, January.
Taylor, John B, 1974. "Asymptotic Properties of Multiperiod Control Rules in the Linear Regression Model," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 15(2), pages 472-484, June.
Jovanovic, Boyan & Nyarko, Yaw, 1994. "The Bayesian Foundations of Learning by Doing," Working Papers 94-15, C.V. Starr Center for Applied Economics, New York University.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Beck, Gunter W. & Wieland, Volker, 2002. "Learning and control in a changing economic environment," Journal of Economic Dynamics and Control, Elsevier, vol. 26(9-10), pages 1359-1377, August.
Kendrick, David A., 2005. "Stochastic control for economic models: past, present and the paths ahead," Journal of Economic Dynamics and Control, Elsevier, vol. 29(1-2), pages 3-30, January.
Wieland, Volker, 2000. "Monetary policy, parameter uncertainty and optimal learning," Journal of Monetary Economics, Elsevier, vol. 46(1), pages 199-228, August.
- Wieland, Volker, 1999. "Monetary policy, parameter uncertainty and optimal learning," ZEI Working Papers B 09-1999, University of Bonn, ZEI - Center for European Integration Studies.
- Volker W. Wieland, 1999. "Monetary policy, parameter uncertainty and optimal learning," Finance and Economics Discussion Series 1999-48, Board of Governors of the Federal Reserve System (U.S.).
D.A. Kendrick & H.M. Amman & M.P. Tucci, 2008. "Learning About Learning in Dynamic Economic Models," Working Papers 08-20, Utrecht School of Economics.
repec:use:tkiwps:2020 is not listed on IDEAS
Volker Wieland, "undated". "Monetary Policy and Uncertainty about the Natural Unemployment Rate," Computing in Economics and Finance 1997 11, Society for Computational Economics.
- Wieland, Volker, 2003. "Monetary Policy and Uncertainty about the Natural Unemployment Rate," CFS Working Paper Series 2003/05, Center for Financial Studies (CFS).
- Volker W. Wieland, 1998. "Monetary policy and uncertainty about the natural unemployment rate," Finance and Economics Discussion Series 1998-22, Board of Governors of the Federal Reserve System (U.S.).
- Wieland, Volker, 2003. "Monetary Policy and Uncertainty about the Natural Unemployment Rate," CEPR Discussion Papers 3811, C.E.P.R. Discussion Papers.
Cosimano, Thomas F., 2008. "Optimal experimentation and the perturbation method in the neighborhood of the augmented linear regulator problem," Journal of Economic Dynamics and Control, Elsevier, vol. 32(6), pages 1857-1894, June.
Tim Willems, 2017. "Actively Learning by Pricing: A Model of an Experimenting Seller," Economic Journal, Royal Economic Society, vol. 127(604), pages 2216-2239, September.
- Tim Willems, 2013. "Actively Learning by Pricing: A Model of an Experimenting Seller," Economics Series Working Papers 687, University of Oxford, Department of Economics.
H.M. Amman & D.A. Kendrick, 2012. "Conjectures on the policy function in the presence of optimal experimentation," Working Papers 12-09, Utrecht School of Economics.
Amman, Hans M. & Kendrick, David A. & Tucci, Marco P., 2020. "Approximating The Value Function For Optimal Experimentation," Macroeconomic Dynamics, Cambridge University Press, vol. 24(5), pages 1073-1086, July.
Koulovatianos, Christos & Mirman, Leonard J. & Santugini, Marc, 2009. "Optimal growth and uncertainty: Learning," Journal of Economic Theory, Elsevier, vol. 144(1), pages 280-295, January.
- Christos Koulovatianos & Leonard J. Mirman & Marc Santugini, 2007. "Optimal Growth and Uncertainty: Learning," Cahiers de recherche 07-05, HEC Montréal, Institut d'économie appliquée, revised Feb 2008.
- Christos Koulovatianos, & Leonard J. Mirman & Marc Santugini, 2008. "Optimal Growth and Uncertainty: Learning," Discussion Papers 08/08, University of Nottingham, Centre for Finance, Credit and Macroeconomics (CFCM).
Mason, Robin & Välimäki, Juuso, 2011. "Learning about the arrival of sales," Journal of Economic Theory, Elsevier, vol. 146(4), pages 1699-1711, July.
Christos Koulovatianos & Leonard J. Mirman & Marc Santugini, 2006. "Investment in a Monopoly with Bayesian Learning," Vienna Economics Papers 0603, University of Vienna, Department of Economics.
- Christos Koulovatianos & Leonard J. Mirman & Marc Santugini, 2011. "Investment in a Monopoly with Bayesian Learning," Cahiers de recherche 11-05, HEC Montréal, Institut d'économie appliquée.
Bergemann, Dirk & Valimaki, Juuso, 2002. "Entry and Vertical Differentiation," Journal of Economic Theory, Elsevier, vol. 106(1), pages 91-125, September.
- Dirk Bergemann & Juuso Valimaki, 2000. "Entry and Vertical Differentiation," Cowles Foundation Discussion Papers 1277, Cowles Foundation for Research in Economics, Yale University.
- Dirk Bergemann & Valimaki Juuso, 2001. "Entry and Vertical Differentiation," Cowles Foundation Discussion Papers 1302, Cowles Foundation for Research in Economics, Yale University.
David Kendrick & Hans Amman, 2006. "A Classification System for Economic Stochastic Control Models," Computational Economics, Springer;Society for Computational Economics, vol. 27(4), pages 453-481, June.
- Hans M. Amman & David A. Kendrick, 2003. "A Classification System for Economic Stochastic Control Models," Computing in Economics and Finance 2003 114, Society for Computational Economics.
Arnoud V. den Boer & Bert Zwart, 2014. "Simultaneously Learning and Optimizing Using Controlled Variance Pricing," Management Science, INFORMS, vol. 60(3), pages 770-783, March.
Leonard J. Mirman & Kevin Reffett & Marc Santugini, 2016. "On learning and growth," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 61(4), pages 641-684, April.
- Leonard J. Mirman & Kevin Reffett & Marc Santugini, 2013. "On Learning and Growth," Cahiers de recherche 1336, CIRPEE.
Hans M. Amman & Marco Paolo Tucci, 2018. "How active is active learning: value function method vs an approximation method," Department of Economics University of Siena 788, Department of Economics, University of Siena.
Christos Koulovatianos & Leonard J. Mirman & Marc Santugini, 2006. "Investment in a Monopoly with Bayesian Learning," Vienna Economics Papers vie0603, University of Vienna, Department of Economics.
- Christos Koulovatianos & Leonard J. Mirman & Marc Santugini, 2011. "Investment in a Monopoly with Bayesian Learning," Cahiers de recherche 11-05, HEC Montréal, Institut d'économie appliquée.
Bond, Craig A., 2008. "On the Potential Use of Adaptive Control Methods for Improving Adaptive Natural Resource Management," Working Papers 108721, Colorado State University, Department of Agricultural and Resource Economics.
Hans M. Amman & Marco P. Tucci, 2020. "How Active is Active Learning: Value Function Method Versus an Approximation Method," Computational Economics, Springer;Society for Computational Economics, vol. 56(3), pages 675-693, October.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:dyncon:v:24:y:2000:i:4:p:501-534. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jedc .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Learning by doing and the value of optimal experimentation

Author

Abstract

Suggested Citation

Download full text from publisher

Other versions of this item:

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data