A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations

A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations

Author

Listed:

John Rust
(Department of Economics Yale University)

Registered:

John Philip Rust

Abstract

This paper compares the performance of the Howard (1960) policy iteration algorithm for infinite-horizon continuous-state Markovian decision processes (MDP's) using alternative random, quasi- random, and deterministic discretizations of the state space, or grids. Each grid corresponds to an embedded finite state MDP whose solution is used to approximate the solution to the original continuous-state Markovian decision process. I extend a result of Rust (1997), to show that policy iteration using random grids succeeds in breaking the curse of dimensionality involved in approximating the solution to a class of continuous-state discrete-action MDP's known as discrete decision processes (DDP's). I compare this ``random policy iteration algorithm'' (RPI) with policy iteration algorithms using deterministically chosen grids including uniform grids and quadrature grids both of which are subject to the curse of dimensionality. I also compare the RPI algorithm to deterministic policy iteration algorithms based on quasi-random or `low discrepancy grids' such as the Sobol' and Tezuka sequences.

Suggested Citation

John Rust, 1997. "A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations," Computational Economics 9704001, University Library of Munich, Germany.

Handle: RePEc:wpa:wuwpco:9704001
Note: TeX file, Postscript version submitted, 50 pages

Download full text from publisher

References listed on IDEAS

Rust, John, 1985. "Stationary Equilibrium in a Market for Durable Assets," Econometrica, Econometric Society, vol. 53(4), pages 783-805, July.
- Rust, John, 1984. "Stationary Equilibrium In A Market For Durable Assets," SSRI Workshop Series 292594, University of Wisconsin-Madison, Social Systems Research Institute.
Tauchen, George & Hussey, Robert, 1991. "Quadrature-Based Methods for Obtaining Approximate Solutions to Nonlinear Asset Pricing Models," Econometrica, Econometric Society, vol. 59(2), pages 371-396, March.
Anderson, Evan W. & McGrattan, Ellen R. & Hansen, Lars Peter & Sargent, Thomas J., 1996. "Mechanics of forming and estimating dynamic linear economies," Handbook of Computational Economics, in: H. M. Amman & D. A. Kendrick & J. Rust (ed.), Handbook of Computational Economics, edition 1, volume 1, chapter 4, pages 171-252, Elsevier.
- Lars Peter Hansen & Ellen R. McGrattan & Thomas J. Sargent, 1994. "Mechanics of forming and estimating dynamic linear economies," Staff Report 182, Federal Reserve Bank of Minneapolis.
- Evan W. Anderson & Lars Peter Hansen & Ellen R. McGrattan & Thomas J. Sargent, 1995. "On the mechanics of forming and estimating dynamic linear economies," Staff Report 198, Federal Reserve Bank of Minneapolis.
Tauchen, George, 1990. "Solving the Stochastic Growth Model by Using Quadrature Methods and Value-Function Iterations," Journal of Business & Economic Statistics, American Statistical Association, vol. 8(1), pages 49-51, January.
Keane, Michael P & Wolpin, Kenneth I, 1994. "The Solution and Estimation of Discrete Choice Dynamic Programming Models by Simulation and Interpolation: Monte Carlo Evidence," The Review of Economics and Statistics, MIT Press, vol. 76(4), pages 648-672, November.
- Michael P. Keane & Kenneth I. Wolpin, 1994. "The solution and estimation of discrete choice dynamic programming models by simulation and interpolation: Monte Carlo evidence," Staff Report 181, Federal Reserve Bank of Minneapolis.
John Rust, 1997. "Using Randomization to Break the Curse of Dimensionality," Econometrica, Econometric Society, vol. 65(3), pages 487-516, May.
- Rust, J., 1994. "Using Randomization to Break the Curse of Dimensionality," Working papers 9429, Wisconsin Madison - Social Systems.
- John Rust & Department of Economics & University of Wisconsin, 1994. "Using Randomization to Break the Curse of Dimensionality," Computational Economics 9403001, University Library of Munich, Germany, revised 19 Nov 1996.
Ariel Pakes & Paul McGuire, 1997. "Stochastic Algorithms for Dynamic Models: Markov Perfect Equilibrium, and the 'Curse' of Dimensionality," Cowles Foundation Discussion Papers 1144, Cowles Foundation for Research in Economics, Yale University.
Judd, Kenneth L., 1996. "Approximation, perturbation, and projection methods in economic analysis," Handbook of Computational Economics, in: H. M. Amman & D. A. Kendrick & J. Rust (ed.), Handbook of Computational Economics, edition 1, volume 1, chapter 12, pages 509-585, Elsevier.
Martin L. Puterman & Shelby L. Brumelle, 1979. "On the Convergence of Policy Iteration in Stationary Dynamic Programming," Mathematics of Operations Research, INFORMS, vol. 4(1), pages 60-69, February.
Hans M. Amman & David A. Kendrick, . "Computational Economics," Online economics textbooks, SUNY-Oswego, Department of Economics, number comp1, December.
Rust, John, 1986. "When Is It Optimal to Kill Off the Market for Used Durable Goods?," Econometrica, Econometric Society, vol. 54(1), pages 65-86, January.

Full references (including those not matched with items on IDEAS)

Citations

Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.

Cited by:

Santos, Manuel S., 1998. "Accuracy of numerical solutions using the eulers equation residuals," UC3M Working papers. Economics 4157, Universidad Carlos III de Madrid. Departamento de EconomÃa.
Reiter, Michael, 1999. "Solving higher-dimensional continuous-time stochastic control problems by value function regression," Journal of Economic Dynamics and Control, Elsevier, vol. 23(9-10), pages 1329-1353, September.
- Michael Reiter, 1997. "Solving higher-dimensional continuous time stochastic control problems by value function regression," Economics Working Papers 299, Department of Economics and Business, Universitat Pompeu Fabra, revised Jun 1998.
Manuel S. Santos, 2000. "Accuracy of Numerical Solutions using the Euler Equation Residuals," Econometrica, Econometric Society, vol. 68(6), pages 1377-1402, November.
Kristensen, Dennis & Mogensen, Patrick K. & Moon, Jong Myun & Schjerning, Bertel, 2021. "Solving dynamic discrete choice models using smoothing and sieve methods," Journal of Econometrics, Elsevier, vol. 223(2), pages 328-360.
- Dennis Kristensen & Patrick K. Mogensen & Jong-Myun Moon & Bertel Schjerning, 2019. "Solving dynamic discrete choice models using smoothing and sieve methods," CeMMAP working papers CWP15/19, Centre for Microdata Methods and Practice, Institute for Fiscal Studies.
- Dennis Kristensen & Patrick K. Mogensen & Jong Myun Moon & Bertel Schjerning, 2019. "Solving Dynamic Discrete Choice Models Using Smoothing and Sieve Methods," Papers 1904.05232, arXiv.org, revised Feb 2020.
Ronald Goettler & Ron Shachar, 2000. "Estimating Product Characteristics and Spatial Competition in the Network Television Industry," Econometric Society World Congress 2000 Contributed Papers 1691, Econometric Society.
Hugo Benitez-Silva, 2000. "A Dynamic Model of Labor Supply, Consumption/Saving, and Annuity Decisions under Uncertainty," Department of Economics Working Papers 00-06, Stony Brook University, Department of Economics.
- Hugo Benitez-Silva, 2000. "A Dynamic Model Of Labor Supply, Consumption/Saving, And Annuity Decisions Under Uncertainty," Computing in Economics and Finance 2000 128, Society for Computational Economics.
John Rust & Joseph Traub & Henryk Wozniakowski, 1999. "No Curse of Dimensionality for Contraction Fixed Points Even in the Worst Case," Computational Economics 9902001, University Library of Munich, Germany.
Hall, George & Rust, John, 2000. "An empirical model of inventory investment by durable commodity intermediaries," Carnegie-Rochester Conference Series on Public Policy, Elsevier, vol. 52(1), pages 171-214, June.
- George Hall & John Rust, 1999. "An Empirical Model of Inventory Investment by Durable Commodity Intermediaries," Macroeconomics 9904005, University Library of Munich, Germany.
- George J. Hall & John Rust, 1999. "An Empirical Model of Inventory Investment by Durable Commodity Intermediaries," Cowles Foundation Discussion Papers 1228, Cowles Foundation for Research in Economics, Yale University.
Victor Aguirregabiria & Pedro Mira, 2002. "Swapping the Nested Fixed Point Algorithm: A Class of Estimators for Discrete Markov Decision Models," Econometrica, Econometric Society, vol. 70(4), pages 1519-1543, July.
- Victor Aguirregabiria & Pedro Mira, 1999. "Swapping the Nested Fixed-Point Algorithm: a Class of Estimators for Discrete Markov Decision Models," Computing in Economics and Finance 1999 332, Society for Computational Economics.
- Víctor Aguirregabiria & Pedro Mira, 1999. "Swapping the Nested Fixed Point Algorithm: A Class of Estimators for Discrete Markov Decision Models," Working Papers wp1999_9904, CEMFI.
Hugo Benitez-Silva, 2000. "A Joint Model of Labor Supply and Consumption Decisions Under Uncertainty," Econometric Society World Congress 2000 Contributed Papers 0196, Econometric Society.
Jacek B. Krawczyk, 2000. "A Markovian Approximated Solution To A Portfolio Management Problem," Computing in Economics and Finance 2000 233, Society for Computational Economics.
Hugo Benítez-Silva, 2003. "The Annuity Puzzle Revisited," Working Papers wp055, University of Michigan, Michigan Retirement Research Center.

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

John Rust, 1997. "Using Randomization to Break the Curse of Dimensionality," Econometrica, Econometric Society, vol. 65(3), pages 487-516, May.
- Rust, J., 1994. "Using Randomization to Break the Curse of Dimensionality," Working papers 9429, Wisconsin Madison - Social Systems.
- John Rust & Department of Economics & University of Wisconsin, 1994. "Using Randomization to Break the Curse of Dimensionality," Computational Economics 9403001, University Library of Munich, Germany, revised 19 Nov 1996.
Willi Semmler & Lars GrÃ¼ne, 2004. "Asset Pricing with Delayed Consumption Decisions," Computing in Economics and Finance 2004 59, Society for Computational Economics.
Grune, Lars & Semmler, Willi, 2004. "Using dynamic programming with adaptive grid scheme for optimal control problems in economics," Journal of Economic Dynamics and Control, Elsevier, vol. 28(12), pages 2427-2456, December.
John Rust & Joseph Traub & Henryk Wozniakowski, 1999. "No Curse of Dimensionality for Contraction Fixed Points Even in the Worst Case," Computational Economics 9902001, University Library of Munich, Germany.
Lars Grüne & Willi Semmler, 2007. "Asset pricing with dynamic programming," Computational Economics, Springer;Society for Computational Economics, vol. 29(3), pages 233-265, May.
Gamba, Andrea & Tesser, Matteo, 2009. "Structural estimation of real options models," Journal of Economic Dynamics and Control, Elsevier, vol. 33(4), pages 798-816, April.
Ayşe Kabukçuoğlu & Enrique Martínez-García, 2021. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Computational Economics, Springer;Society for Computational Economics, vol. 58(2), pages 435-460, August.
- Ayse Kabukcuoglu & Enrique Martínez García, 2020. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Globalization Institute Working Papers 396, Federal Reserve Bank of Dallas.
Todd R. Stinebrickner, 2000. "Serially correlated variables in dynamic, discrete choice models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(6), pages 595-624.
Geweke, John F. & Horowitz, Joel L. & Pesaran, M. Hashem, 2006. "Econometrics: A Bird's Eye View," IZA Discussion Papers 2458, Institute of Labor Economics (IZA).
- John Geweke & Joel Horowitz & M. Hashem Pesaran, 2006. "Econometrics: A Bird’s Eye View," CESifo Working Paper Series 1870, CESifo.
- Geweke, J. & Joel Horowitz & Pesaran, M.H., 2006. "Econometrics: A Bird’s Eye View," Cambridge Working Papers in Economics 0655, Faculty of Economics, University of Cambridge.
Bound, John & Stinebrickner, Todd & Waidmann, Timothy, 2010. "Health, economic resources and the work decisions of older men," Journal of Econometrics, Elsevier, vol. 156(1), pages 106-129, May.
- John Bound & Todd Stinebrickner & Timothy Waidmann, 2007. "Health, Economic Resources and the Work Decisions of Older Men," University of Western Ontario, Economic Policy Research Institute Working Papers 20076, University of Western Ontario, Economic Policy Research Institute.
- John Bound & Todd Stinebrickner & Timothy Waidmann, 2007. "Health, Economic Resources and the Work Decisions of Older Men," NBER Working Papers 13657, National Bureau of Economic Research, Inc.
Alemdar, Nedim M. & Sirakaya, Sibel & Husseinov, Farhad, 2006. "Optimal time aggregation of infinite horizon control problems," Journal of Economic Dynamics and Control, Elsevier, vol. 30(4), pages 569-593, April.
Masakazu Ishihara & Andrew T. Ching, 2019. "Dynamic Demand for New and Used Durable Goods Without Physical Depreciation: The Case of Japanese Video Games," Marketing Science, INFORMS, vol. 38(3), pages 392-416, May.
- Andrew Ching & Masakazu Ishihara, 2014. "Dynamic Demand for New and Used Durable Goods without Physical Depreciation: The Case of Japanese Video Games," 2014 Meeting Papers 782, Society for Economic Dynamics.
Reiter, Michael, 1999. "Solving higher-dimensional continuous-time stochastic control problems by value function regression," Journal of Economic Dynamics and Control, Elsevier, vol. 23(9-10), pages 1329-1353, September.
- Michael Reiter, 1997. "Solving higher-dimensional continuous time stochastic control problems by value function regression," Economics Working Papers 299, Department of Economics and Business, Universitat Pompeu Fabra, revised Jun 1998.
Anderson, Evan W. & Hansen, Lars Peter & Sargent, Thomas J., 2012. "Small noise methods for risk-sensitive/robust economies," Journal of Economic Dynamics and Control, Elsevier, vol. 36(4), pages 468-500.
John Bound & Todd Stinebrickner & Timothy Waidman, 2004. "Using a Structural Retirement Model to Simulate the Effect of Changes to the OASDI and Medicare Programs," Working Papers wp091, University of Michigan, Michigan Retirement Research Center.
Jesús Fernández-Villaverde & Juan F. Rubio-Ramirez, 2001. "Comparing dynamic equilibrium economies to data," FRB Atlanta Working Paper 2001-23, Federal Reserve Bank of Atlanta.
- Jesús Fernández-Villaverde & Juan F. Rubio, 2003. "Comparing Dynamic Equilibrium Economies to Data," Levine's Working Paper Archive 506439000000000309, David K. Levine.
Nikolaj Malchow-Møller & Michael Svarer, 2003. "Estimation of the multinomial logit model with random effects," Applied Economics Letters, Taylor & Francis Journals, vol. 10(7), pages 389-392.
John Stachurski, 2009. "Economic Dynamics: Theory and Computation," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012774, December.
Richard Dennis & Tatiana Kirsanova, 2010. "Expectations traps and coordination failures: selecting among multiple discretionary equilibria," Working Paper Series 2010-02, Federal Reserve Bank of San Francisco.
- Dennis, Richard & Kirsanova, Tatiana, 2010. "Expectations Traps and Coordination Failures: Selecting among Multiple Discretionary Equilibria," MPRA Paper 24616, University Library of Munich, Germany.
- Richard Dennis & Tatiana Kirsanova, 2010. "Expectations Traps and Coordination Failures:Selecting Among Multiple Discretionary Equilibria," CAMA Working Papers 2010-02, Centre for Applied Macroeconomic Analysis, Crawford School of Public Policy, The Australian National University.
Ji, Yongjie & Rabotyagov, Sergey & Kling, Catherine L., 2014. "Crop Choice and Rotational Effects: A Dynamic Model of Land Use in Iowa in Recent Years," 2014 Annual Meeting, July 27-29, 2014, Minneapolis, Minnesota 170366, Agricultural and Applied Economics Association.

More about this item

JEL classification:

C8 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wpa:wuwpco:9704001. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: EconWPA The email address of this maintainer does not seem to be valid anymore. Please ask EconWPA to update the entry or send us the correct address (email available below). General contact details of provider: https://econwpa.ub.uni-muenchen.de .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Citations

Most related items

More about this item

JEL classification:

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data