Convergence Properties of Policy Iteration

My bibliography Save this paper

Convergence Properties of Policy Iteration

Author

Listed:

Manuel Santos
(W. P. Carey School of Business Department of Economics)
John Rust
(University of Maryland)

Abstract

This paper analyzes the asymptotic convergence properties of policy iteration in a class of stationary, infinite-horizon Markovian decision problems that arise in optimal growth theory. These problems have continuous state and control variables, and must therefore be discretized in order to compute an approximate solution. The discretization converts a potentially infinite dimensional fixed-point problem to a finite dimensional problem defined on a finite grid of points in the state space, and it may thus render inapplicable known convergence results for policy iteration such as those of Puterman and Brumelle (1979). Under certain regularity conditions, we prove that for piecewise linear interpolation, policy iteration converges quadratically, i.e. the sequence of errors en = |Vn - V*| (where Vn is an approximate value function produced from the nth policy iteration step) satisfies en+1 = Le2n for all n. We show how the constant L depends on the grid size of the discretization. Also, under more general conditions we establish that convergence is superlinear. We illustrate the theoretical results with numerical experiments that compare the performance of policy iteration and the method of successive approximations. The quantitative results are consistent with theoretical predictions.

Suggested Citation

Manuel Santos & John Rust, "undated". "Convergence Properties of Policy Iteration," Working Papers 2133377, Department of Economics, W. P. Carey School of Business, Arizona State University.

Handle: RePEc:asu:wpaper:2133377

Download full text from publisher

References listed on IDEAS

Rust, John, 1987. "Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher," Econometrica, Econometric Society, vol. 55(5), pages 999-1033, September.
Kenneth L. Judd, 1998. "Numerical Methods in Economics," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262100711, December.
J. Rust & J. F. Traub & H. Wozniakowski, 2002. "Is There a Curse of Dimensionality for Contraction Fixed Points in the Worst Case?," Econometrica, Econometric Society, vol. 70(1), pages 285-329, January.
Hugo Benitez-Silva & John Rust & Gunter Hitsch & Giorgio Pauletto & George Hall, 2000. "A Comparison Of Discrete And Parametric Methods For Continuous-State Dynamic Programming Problems," Computing in Economics and Finance 2000 24, Society for Computational Economics.
John Rust, 1997. "Using Randomization to Break the Curse of Dimensionality," Econometrica, Econometric Society, vol. 65(3), pages 487-516, May.
- Rust, J., 1994. "Using Randomization to Break the Curse of Dimensionality," Working papers 9429, Wisconsin Madison - Social Systems.
- John Rust & Department of Economics & University of Wisconsin, 1994. "Using Randomization to Break the Curse of Dimensionality," Computational Economics 9403001, University Library of Munich, Germany, revised 19 Nov 1996.
Santos, Manuel S., 1999. "Numerical solution of dynamic economic models," Handbook of Macroeconomics, in: J. B. Taylor & M. Woodford (ed.), Handbook of Macroeconomics, edition 1, volume 1, chapter 5, pages 311-386, Elsevier.
Martin L. Puterman & Shelby L. Brumelle, 1979. "On the Convergence of Policy Iteration in Stationary Dynamic Programming," Mathematics of Operations Research, INFORMS, vol. 4(1), pages 60-69, February.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Ozak, Omer, 2014. "Optimal consumption under uncertainty, liquidity constraints, and bounded rationality," Journal of Economic Dynamics and Control, Elsevier, vol. 39(C), pages 237-254.
- Ömer Özak, 2012. "Optimal consumption under uncertainty, liquidity constraints, and bounded rationality," Departmental Working Papers 1204, Southern Methodist University, Department of Economics.
- Ömer Özak, 2013. "Optimal consumption under uncertainty, liquidity constraints, and bounded rationality," Departmental Working Papers 1307, Southern Methodist University, Department of Economics.
Chase Coleman & Spencer Lyon & Lilia Maliar & Serguei Maliar, 2021. "Matlab, Python, Julia: What to Choose in Economics?," Computational Economics, Springer;Society for Computational Economics, vol. 58(4), pages 1263-1288, December.
- Coleman, Chase & Lyon, Spencer & Maliar, Serguei, 2018. "Matlab, Python, Julia: What to Choose in Economics?," CEPR Discussion Papers 13210, C.E.P.R. Discussion Papers.
Arellano, Cristina & Maliar, Lilia & Maliar, Serguei & Tsyrennikov, Viktor, 2016. "Envelope condition method with an application to default risk models," Journal of Economic Dynamics and Control, Elsevier, vol. 69(C), pages 436-459.
- Cristina Arellano & Lilia Maliar & Serguei Maliar & Viktor Tsyrennikov, 2014. "Envelope Condition Method with an Application to Default Risk Models," BYU Macroeconomics and Computational Laboratory Working Paper Series 2014-04, Brigham Young University, Department of Economics, BYU Macroeconomics and Computational Laboratory.
- Viktor Tsyrennikov & Serguei Maliar & Lilia Maliar & Cristina Arellano, 2015. "Envelope Condition Method with an Application to Default Risk Models," 2015 Meeting Papers 1239, Society for Economic Dynamics.
Ayse Kabukcuoglu & Enrique Martínez García, 2016. "The market resources method for solving dynamic optimization problems," Globalization Institute Working Papers 274, Federal Reserve Bank of Dallas.
- Ayse Kabukcuoglu & Enrique Martínez-García, 2016. "The Market Resources Method for Solving Dynamic Optimization Problems," Koç University-TUSIAD Economic Research Forum Working Papers 1607, Koc University-TUSIAD Economic Research Forum.
Adrian Peralta-Alva & Manuel S. Santos, 2012. "Analysis of numerical errors," Working Papers 2012-062, Federal Reserve Bank of St. Louis.
- Manuel S. Santos & Adrian Peralta-Alva, 2012. "Analysis of Numerical Errors," Working Papers 2012-6, University of Miami, Department of Economics.
FernÃ¡ndez-Villaverde, J. & Rubio-RamÃrez, J.F. & Schorfheide, F., 2016. "Solution and Estimation Methods for DSGE Models," Handbook of Macroeconomics, in: J. B. Taylor & Harald Uhlig (ed.), Handbook of Macroeconomics, edition 1, volume 2, chapter 0, pages 527-724, Elsevier.
- Jesus Fernandez-Villaverde & Juan Rubio-RamÃrez & Frank Schorfheide, 2015. "Solution and Estimation Methods for DSGE Models," PIER Working Paper Archive 15-042, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania, revised 09 Dec 2015.
- Jesús Fernández-Villaverde & Juan F. Rubio Ramírez & Frank Schorfheide, 2016. "Solution and Estimation Methods for DSGE Models," NBER Working Papers 21862, National Bureau of Economic Research, Inc.
- Rubio-RamÃrez, Juan Francisco & Schorfheide, Frank & FernÃ¡ndez-Villaverde, JesÃºs, 2015. "Solution and Estimation Methods for DSGE Models," CEPR Discussion Papers 11032, C.E.P.R. Discussion Papers.
Hassan Dadashi, 2018. "Optimal investment-consumption problem: post-retirement with minimum guarantee," Papers 1803.00611, arXiv.org, revised Aug 2020.
Johannes Muhle-Karbe & Max Reppen & H. Mete Soner, 2016. "A Primer on Portfolio Choice with Small Transaction Costs," Papers 1612.01302, arXiv.org, revised May 2017.
Festa, Adriano, 2018. "Domain decomposition based parallel Howard’s algorithm," Mathematics and Computers in Simulation (MATCOM), Elsevier, vol. 147(C), pages 121-139.
Andrew Ching & Susumu Imai & Masakazu Ishihara & Neelam Jain, 2012. "A practitioner’s guide to Bayesian estimation of discrete choice dynamic programming models," Quantitative Marketing and Economics (QME), Springer, vol. 10(2), pages 151-196, June.
- Andrew Ching & Susumu Imai & Masakazu Ishihara & Neelam Jain, 2009. "A Practitioner's Guide To Bayesian Estimation Of Discrete Choice Dynamic Programming Models," Working Paper 1201, Economics Department, Queen's University.
Mercedes Esteban-Bravo & Jose M. Vidal-Sanz & Gökhan Yildirim, 2014. "Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition," Marketing Science, INFORMS, vol. 33(5), pages 621-640, September.
- Esteban-Bravo, Mercedes & Vidal-Sanz, Jose M. & Yildirim, Gökhan, 2012. "Valuing customer portfolios with endogenous mass-and-direct-marketing interventions using a stochastic dynamic programming decomposition," DEE - Working Papers. Business Economics. WB wb121304, Universidad Carlos III de Madrid. Departamento de EconomÃa de la Empresa.
Ayşe Kabukçuoğlu & Enrique Martínez-García, 2021. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Computational Economics, Springer;Society for Computational Economics, vol. 58(2), pages 435-460, August.
- Ayse Kabukcuoglu & Enrique Martínez García, 2020. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Globalization Institute Working Papers 396, Federal Reserve Bank of Dallas.
Elisabetta Carlini & Adriano Festa & Francisco J. Silva & Marie-Therese Wolfram, 2017. "A Semi-Lagrangian Scheme for a Modified Version of the Hughes’ Model for Pedestrian Flow," Dynamic Games and Applications, Springer, vol. 7(4), pages 683-705, December.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Aruoba, S. Boragan & Fernandez-Villaverde, Jesus & Rubio-Ramirez, Juan F., 2006. "Comparing solution methods for dynamic equilibrium economies," Journal of Economic Dynamics and Control, Elsevier, vol. 30(12), pages 2477-2508, December.
- S. Boragan Aruoba & Jesus Fernandez-Villaverde & Juan F. Rubio-Ramirez, 2003. "Comparing Solution Methods for Dynamic Equilibrium Economies," PIER Working Paper Archive 04-003, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
- S. B. Aruoba & Jesús Fernández-Villaverde & Juan F. Rubio-Ramirez, 2005. "Comparing Solution Methods for Dynamic Equilibrium Economies," Levine's Bibliography 122247000000000855, UCLA Department of Economics.
- S. Boragan Aruoba & Jesús Fernández-Villaverde & Juan F. Rubio-Ramirez, 2003. "Comparing solution methods for dynamic equilibrium economies," FRB Atlanta Working Paper 2003-27, Federal Reserve Bank of Atlanta.
Yongyang Cai & Kenneth Judd & Greg Thain & Stephen Wright, 2015. "Solving Dynamic Programming Problems on a Computational Grid," Computational Economics, Springer;Society for Computational Economics, vol. 45(2), pages 261-284, February.
- Yongyang Cai & Kenneth L. Judd & Greg Thain & Stephen J. Wright, 2013. "Solving Dynamic Programming Problems on a Computational Grid," NBER Working Papers 18714, National Bureau of Economic Research, Inc.
Kristensen, Dennis & Mogensen, Patrick K. & Moon, Jong Myun & Schjerning, Bertel, 2021. "Solving dynamic discrete choice models using smoothing and sieve methods," Journal of Econometrics, Elsevier, vol. 223(2), pages 328-360.
- Dennis Kristensen & Patrick K. Mogensen & Jong Myun Moon & Bertel Schjerning, 2019. "Solving Dynamic Discrete Choice Models Using Smoothing and Sieve Methods," Papers 1904.05232, arXiv.org, revised Feb 2020.
- Dennis Kristensen & Patrick K. Mogensen & Jong-Myun Moon & Bertel Schjerning, 2019. "Solving dynamic discrete choice models using smoothing and sieve methods," CeMMAP working papers CWP15/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
Mercedes Esteban-Bravo & Jose M. Vidal-Sanz & Gökhan Yildirim, 2014. "Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition," Marketing Science, INFORMS, vol. 33(5), pages 621-640, September.
- Esteban-Bravo, Mercedes & Vidal-Sanz, Jose M. & Yildirim, Gökhan, 2012. "Valuing customer portfolios with endogenous mass-and-direct-marketing interventions using a stochastic dynamic programming decomposition," DEE - Working Papers. Business Economics. WB wb121304, Universidad Carlos III de Madrid. Departamento de EconomÃa de la Empresa.
Barillas, Francisco & Fernandez-Villaverde, Jesus, 2007. "A generalization of the endogenous grid method," Journal of Economic Dynamics and Control, Elsevier, vol. 31(8), pages 2698-2712, August.
- Francisco Barillas & Jesús Fernández-Villaverde, 2006. "A Generalization of the Endogenous Grid Method," Levine's Bibliography 122247000000001200, UCLA Department of Economics.
Victor Aguirregabiria & Arvind Magesan, "undated". "Soultion and Estimation of Dynamic Discrete Choice Structural Models Using Euler Equations," Working Papers 2016-32, Department of Economics, University of Calgary, revised 24 May 2016.
- Victor Aguirregabiria & Arvind Magesan, 2016. "Solution and Estimation of Dynamic Discrete Choice Structural Models Using Euler Equations," Working Papers tecipa-562, University of Toronto, Department of Economics.
- Aguirregabiria, Victor & Magesan, Arvind, 2016. "Solution and Estimation of Dynamic Discrete Choice Structural Models Using Euler Equations," CEPR Discussion Papers 11300, C.E.P.R. Discussion Papers.
Victor Aguirregabiria & Gustavo Vicentini, 2006. "Dynamic Spatial Competition Between Multi-Store Firms," Working Papers tecipa-253, University of Toronto, Department of Economics.
- Aguirregabiria, Victor & Vicentini, Gustavo, 2014. "Dynamic Spatial Competition Between Multi-Store Firms," CEPR Discussion Papers 10273, C.E.P.R. Discussion Papers.
- Victor Aguirregabiria & Gustavo Vicentini, 2012. "Dynamic Spatial Competition Between Multi-Store Firms," Working Papers tecipa-457, University of Toronto, Department of Economics.
Gamba, Andrea & Tesser, Matteo, 2009. "Structural estimation of real options models," Journal of Economic Dynamics and Control, Elsevier, vol. 33(4), pages 798-816, April.
Ayşe Kabukçuoğlu & Enrique Martínez-García, 2021. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Computational Economics, Springer;Society for Computational Economics, vol. 58(2), pages 435-460, August.
- Ayse Kabukcuoglu & Enrique Martínez García, 2020. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Globalization Institute Working Papers 396, Federal Reserve Bank of Dallas.
Yongyang Cai & Kenneth Judd, 2015. "Dynamic programming with Hermite approximation," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 81(3), pages 245-267, June.
- Yongyang Cai & Kenneth L. Judd, 2012. "Dynamic Programming with Hermite Approximation," NBER Working Papers 18540, National Bureau of Economic Research, Inc.
Arellano, Cristina & Maliar, Lilia & Maliar, Serguei & Tsyrennikov, Viktor, 2016. "Envelope condition method with an application to default risk models," Journal of Economic Dynamics and Control, Elsevier, vol. 69(C), pages 436-459.
- Cristina Arellano & Lilia Maliar & Serguei Maliar & Viktor Tsyrennikov, 2014. "Envelope Condition Method with an Application to Default Risk Models," BYU Macroeconomics and Computational Laboratory Working Paper Series 2014-04, Brigham Young University, Department of Economics, BYU Macroeconomics and Computational Laboratory.
- Viktor Tsyrennikov & Serguei Maliar & Lilia Maliar & Cristina Arellano, 2015. "Envelope Condition Method with an Application to Default Risk Models," 2015 Meeting Papers 1239, Society for Economic Dynamics.
- Cristina Arelano & Lilia Maliar & Serguei Maliar & Viktor Tsyrennikov, 2016. "Envelope Condition Method (ECM) in comparison with other solution methods for the neoclassical growth model with inelastic labor supply in "Envelope Condition Method with an Application to Defaul," QM&RBC Codes 203, Quantitative Macroeconomics & Real Business Cycles.
Harikesh Nair, 2007. "Intertemporal price discrimination with forward-looking consumers: Application to the US market for console video-games," Quantitative Marketing and Economics (QME), Springer, vol. 5(3), pages 239-292, September.
- Nair, Harikesh S., 2006. "Intertemporal Price Discrimination with Forward-Looking Consumers: Application to the US Market for Console Video-Games," Research Papers 1947, Stanford University, Graduate School of Business.
Todd R. Stinebrickner, 2000. "Serially correlated variables in dynamic, discrete choice models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(6), pages 595-624.
Patrick Bajari & Jeremy T. Fox & Kyoo il Kim & Stephen P. Ryan, 2009. "A Simple Nonparametric Estimator for the Distribution of Random Coefficients," NBER Working Papers 15210, National Bureau of Economic Research, Inc.
Maria Casanova-Rivas, 2008. "Dynamic Complementarities: A Computational and Empirical Analysis of Couples' Retirement Decisions," 2008 Meeting Papers 1073, Society for Economic Dynamics.
Andreas Lanz & Gregor Reich & Ole Wilms, 2022. "Adaptive grids for the estimation of dynamic models," Quantitative Marketing and Economics (QME), Springer, vol. 20(2), pages 179-238, June.
Nikolaj Malchow-Møller & Michael Svarer, 2003. "Estimation of the multinomial logit model with random effects," Applied Economics Letters, Taylor & Francis Journals, vol. 10(7), pages 389-392.
Alexei Onatski & Noah Williams, 2003. "Modeling Model Uncertainty," Journal of the European Economic Association, MIT Press, vol. 1(5), pages 1087-1122, September.
- Onatski, Alexei & Williams, Noah, 2002. "Modeling model uncertainty," Working Paper Series 169, European Central Bank.
- Alexei Onatski & Noah Williams, 2003. "Modeling Model Uncertainty," NBER Working Papers 9566, National Bureau of Economic Research, Inc.
Heer Burkhard & Maußner Alfred, 2011. "Value Function Iteration as a Solution Method for the Ramsey Model," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(4), pages 494-515, August.
- Burkhard Heer & Alfred Maussner, 2008. "Value Function Iteration as a Solution Method for the Ramsey Model," CESifo Working Paper Series 2278, CESifo.
Joao Macieira, 2010. "Oblivious Equilibrium in Dynamic Discrete Games," 2010 Meeting Papers 680, Society for Economic Dynamics.

More about this item

NEP fields

This paper has been announced in the following NEP Reports:

NEP-CMP-2005-05-14 (Computational Economics)

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:asu:wpaper:2133377. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Steve Salik (email available below). General contact details of provider: https://edirc.repec.org/data/deasuus.html .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Convergence Properties of Policy Iteration

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

NEP fields

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data