IDEAS home Printed from https://ideas.repec.org/p/asu/wpaper/2133377.html
   My bibliography  Save this paper

Convergence Properties of Policy Iteration

Author

Abstract

This paper analyzes the asymptotic convergence properties of policy iteration in a class of stationary, infinite-horizon Markovian decision problems that arise in optimal growth theory. These problems have continuous state and control variables, and must therefore be discretized in order to compute an approximate solution. The discretization converts a potentially infinite dimensional fixed-point problem to a finite dimensional problem defined on a finite grid of points in the state space, and it may thus render inapplicable known convergence results for policy iteration such as those of Puterman and Brumelle (1979). Under certain regularity conditions, we prove that for piecewise linear interpolation, policy iteration converges quadratically, i.e. the sequence of errors en = |Vn - V*| (where Vn is an approximate value function produced from the nth policy iteration step) satisfies en+1 = Le2n for all n. We show how the constant L depends on the grid size of the discretization. Also, under more general conditions we establish that convergence is superlinear. We illustrate the theoretical results with numerical experiments that compare the performance of policy iteration and the method of successive approximations. The quantitative results are consistent with theoretical predictions.

Suggested Citation

  • Manuel Santos & John Rust, "undated". "Convergence Properties of Policy Iteration," Working Papers 2133377, Department of Economics, W. P. Carey School of Business, Arizona State University.
  • Handle: RePEc:asu:wpaper:2133377
    as

    Download full text from publisher

    File URL: http://wpcarey.asu.edu/tools/mytools/pubs_admin/FILES/manuel-rust.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Rust, John, 1987. "Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher," Econometrica, Econometric Society, vol. 55(5), pages 999-1033, September.
    2. Kenneth L. Judd, 1998. "Numerical Methods in Economics," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262100711, December.
    3. J. Rust & J. F. Traub & H. Wozniakowski, 2002. "Is There a Curse of Dimensionality for Contraction Fixed Points in the Worst Case?," Econometrica, Econometric Society, vol. 70(1), pages 285-329, January.
    4. Hugo Benitez-Silva & John Rust & Gunter Hitsch & Giorgio Pauletto & George Hall, 2000. "A Comparison Of Discrete And Parametric Methods For Continuous-State Dynamic Programming Problems," Computing in Economics and Finance 2000 24, Society for Computational Economics.
    5. John Rust, 1997. "Using Randomization to Break the Curse of Dimensionality," Econometrica, Econometric Society, vol. 65(3), pages 487-516, May.
    6. Santos, Manuel S., 1999. "Numerical solution of dynamic economic models," Handbook of Macroeconomics, in: J. B. Taylor & M. Woodford (ed.), Handbook of Macroeconomics, edition 1, volume 1, chapter 5, pages 311-386, Elsevier.
    7. Martin L. Puterman & Shelby L. Brumelle, 1979. "On the Convergence of Policy Iteration in Stationary Dynamic Programming," Mathematics of Operations Research, INFORMS, vol. 4(1), pages 60-69, February.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ozak, Omer, 2014. "Optimal consumption under uncertainty, liquidity constraints, and bounded rationality," Journal of Economic Dynamics and Control, Elsevier, vol. 39(C), pages 237-254.
    2. Chase Coleman & Spencer Lyon & Lilia Maliar & Serguei Maliar, 2021. "Matlab, Python, Julia: What to Choose in Economics?," Computational Economics, Springer;Society for Computational Economics, vol. 58(4), pages 1263-1288, December.
    3. Arellano, Cristina & Maliar, Lilia & Maliar, Serguei & Tsyrennikov, Viktor, 2016. "Envelope condition method with an application to default risk models," Journal of Economic Dynamics and Control, Elsevier, vol. 69(C), pages 436-459.
    4. Ayse Kabukcuoglu & Enrique Martínez García, 2016. "The market resources method for solving dynamic optimization problems," Globalization Institute Working Papers 274, Federal Reserve Bank of Dallas.
    5. Adrian Peralta-Alva & Manuel S. Santos, 2012. "Analysis of numerical errors," Working Papers 2012-062, Federal Reserve Bank of St. Louis.
    6. Fernández-Villaverde, J. & Rubio-Ramírez, J.F. & Schorfheide, F., 2016. "Solution and Estimation Methods for DSGE Models," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 527-724, Elsevier.
    7. Hassan Dadashi, 2018. "Optimal investment-consumption problem: post-retirement with minimum guarantee," Papers 1803.00611, arXiv.org, revised Aug 2020.
    8. Johannes Muhle-Karbe & Max Reppen & H. Mete Soner, 2016. "A Primer on Portfolio Choice with Small Transaction Costs," Papers 1612.01302, arXiv.org, revised May 2017.
    9. Festa, Adriano, 2018. "Domain decomposition based parallel Howard’s algorithm," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 147(C), pages 121-139.
    10. Andrew Ching & Susumu Imai & Masakazu Ishihara & Neelam Jain, 2012. "A practitioner’s guide to Bayesian estimation of discrete choice dynamic programming models," Quantitative Marketing and Economics (QME), Springer, vol. 10(2), pages 151-196, June.
    11. Mercedes Esteban-Bravo & Jose M. Vidal-Sanz & Gökhan Yildirim, 2014. "Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition," Marketing Science, INFORMS, vol. 33(5), pages 621-640, September.
    12. Ayşe Kabukçuoğlu & Enrique Martínez-García, 2021. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Computational Economics, Springer;Society for Computational Economics, vol. 58(2), pages 435-460, August.
    13. Elisabetta Carlini & Adriano Festa & Francisco J. Silva & Marie-Therese Wolfram, 2017. "A Semi-Lagrangian Scheme for a Modified Version of the Hughes’ Model for Pedestrian Flow," Dynamic Games and Applications, Springer, vol. 7(4), pages 683-705, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Aruoba, S. Boragan & Fernandez-Villaverde, Jesus & Rubio-Ramirez, Juan F., 2006. "Comparing solution methods for dynamic equilibrium economies," Journal of Economic Dynamics and Control, Elsevier, vol. 30(12), pages 2477-2508, December.
    2. Yongyang Cai & Kenneth Judd & Greg Thain & Stephen Wright, 2015. "Solving Dynamic Programming Problems on a Computational Grid," Computational Economics, Springer;Society for Computational Economics, vol. 45(2), pages 261-284, February.
    3. Kristensen, Dennis & Mogensen, Patrick K. & Moon, Jong Myun & Schjerning, Bertel, 2021. "Solving dynamic discrete choice models using smoothing and sieve methods," Journal of Econometrics, Elsevier, vol. 223(2), pages 328-360.
    4. Mercedes Esteban-Bravo & Jose M. Vidal-Sanz & Gökhan Yildirim, 2014. "Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition," Marketing Science, INFORMS, vol. 33(5), pages 621-640, September.
    5. Barillas, Francisco & Fernandez-Villaverde, Jesus, 2007. "A generalization of the endogenous grid method," Journal of Economic Dynamics and Control, Elsevier, vol. 31(8), pages 2698-2712, August.
    6. Victor Aguirregabiria & Arvind Magesan, "undated". "Soultion and Estimation of Dynamic Discrete Choice Structural Models Using Euler Equations," Working Papers 2016-32, Department of Economics, University of Calgary, revised 24 May 2016.
    7. Victor Aguirregabiria & Gustavo Vicentini, 2006. "Dynamic Spatial Competition Between Multi-Store Firms," Working Papers tecipa-253, University of Toronto, Department of Economics.
    8. Gamba, Andrea & Tesser, Matteo, 2009. "Structural estimation of real options models," Journal of Economic Dynamics and Control, Elsevier, vol. 33(4), pages 798-816, April.
    9. Ayşe Kabukçuoğlu & Enrique Martínez-García, 2021. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Computational Economics, Springer;Society for Computational Economics, vol. 58(2), pages 435-460, August.
    10. Yongyang Cai & Kenneth Judd, 2015. "Dynamic programming with Hermite approximation," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 81(3), pages 245-267, June.
    11. Arellano, Cristina & Maliar, Lilia & Maliar, Serguei & Tsyrennikov, Viktor, 2016. "Envelope condition method with an application to default risk models," Journal of Economic Dynamics and Control, Elsevier, vol. 69(C), pages 436-459.
    12. Harikesh Nair, 2007. "Intertemporal price discrimination with forward-looking consumers: Application to the US market for console video-games," Quantitative Marketing and Economics (QME), Springer, vol. 5(3), pages 239-292, September.
    13. Todd R. Stinebrickner, 2000. "Serially correlated variables in dynamic, discrete choice models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(6), pages 595-624.
    14. Patrick Bajari & Jeremy T. Fox & Kyoo il Kim & Stephen P. Ryan, 2009. "A Simple Nonparametric Estimator for the Distribution of Random Coefficients," NBER Working Papers 15210, National Bureau of Economic Research, Inc.
    15. Maria Casanova-Rivas, 2008. "Dynamic Complementarities: A Computational and Empirical Analysis of Couples' Retirement Decisions," 2008 Meeting Papers 1073, Society for Economic Dynamics.
    16. Andreas Lanz & Gregor Reich & Ole Wilms, 2022. "Adaptive grids for the estimation of dynamic models," Quantitative Marketing and Economics (QME), Springer, vol. 20(2), pages 179-238, June.
    17. Nikolaj Malchow-Møller & Michael Svarer, 2003. "Estimation of the multinomial logit model with random effects," Applied Economics Letters, Taylor & Francis Journals, vol. 10(7), pages 389-392.
    18. Alexei Onatski & Noah Williams, 2003. "Modeling Model Uncertainty," Journal of the European Economic Association, MIT Press, vol. 1(5), pages 1087-1122, September.
    19. Heer Burkhard & Maußner Alfred, 2011. "Value Function Iteration as a Solution Method for the Ramsey Model," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(4), pages 494-515, August.
    20. Joao Macieira, 2010. "Oblivious Equilibrium in Dynamic Discrete Games," 2010 Meeting Papers 680, Society for Economic Dynamics.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:asu:wpaper:2133377. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Steve Salik (email available below). General contact details of provider: https://edirc.repec.org/data/deasuus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.