IDEAS home Printed from https://ideas.repec.org/p/wpa/wuwpco/9704001.html
   My bibliography  Save this paper

A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations

Author

Listed:
  • John Rust

    (Department of Economics Yale University)

Abstract

This paper compares the performance of the Howard (1960) policy iteration algorithm for infinite-horizon continuous-state Markovian decision processes (MDP's) using alternative random, quasi- random, and deterministic discretizations of the state space, or grids. Each grid corresponds to an embedded finite state MDP whose solution is used to approximate the solution to the original continuous-state Markovian decision process. I extend a result of Rust (1997), to show that policy iteration using random grids succeeds in breaking the curse of dimensionality involved in approximating the solution to a class of continuous-state discrete-action MDP's known as discrete decision processes (DDP's). I compare this ``random policy iteration algorithm'' (RPI) with policy iteration algorithms using deterministically chosen grids including uniform grids and quadrature grids both of which are subject to the curse of dimensionality. I also compare the RPI algorithm to deterministic policy iteration algorithms based on quasi-random or `low discrepancy grids' such as the Sobol' and Tezuka sequences.

Suggested Citation

  • John Rust, 1997. "A Comparison of Policy Iteration Methods for Solving Continuous-State, Infinite-Horizon Markovian Decision Problems Using Random, Quasi-random, and Deterministic Discretizations," Computational Economics 9704001, University Library of Munich, Germany.
  • Handle: RePEc:wpa:wuwpco:9704001
    Note: TeX file, Postscript version submitted, 50 pages
    as

    Download full text from publisher

    File URL: https://econwpa.ub.uni-muenchen.de/econ-wp/comp/papers/9704/9704001.pdf
    Download Restriction: no

    File URL: https://econwpa.ub.uni-muenchen.de/econ-wp/comp/papers/9704/9704001.ps.gz
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Rust, John, 1985. "Stationary Equilibrium in a Market for Durable Assets," Econometrica, Econometric Society, vol. 53(4), pages 783-805, July.
    2. Tauchen, George & Hussey, Robert, 1991. "Quadrature-Based Methods for Obtaining Approximate Solutions to Nonlinear Asset Pricing Models," Econometrica, Econometric Society, vol. 59(2), pages 371-396, March.
    3. Anderson, Evan W. & McGrattan, Ellen R. & Hansen, Lars Peter & Sargent, Thomas J., 1996. "Mechanics of forming and estimating dynamic linear economies," Handbook of Computational Economics, in: H. M. Amman & D. A. Kendrick & J. Rust (ed.), Handbook of Computational Economics, edition 1, volume 1, chapter 4, pages 171-252, Elsevier.
    4. Tauchen, George, 1990. "Solving the Stochastic Growth Model by Using Quadrature Methods and Value-Function Iterations," Journal of Business & Economic Statistics, American Statistical Association, vol. 8(1), pages 49-51, January.
    5. Keane, Michael P & Wolpin, Kenneth I, 1994. "The Solution and Estimation of Discrete Choice Dynamic Programming Models by Simulation and Interpolation: Monte Carlo Evidence," The Review of Economics and Statistics, MIT Press, vol. 76(4), pages 648-672, November.
    6. Ariel Pakes & Paul McGuire, 1997. "Stochastic Algorithms for Dynamic Models: Markov Perfect Equilibrium, and the 'Curse' of Dimensionality," Cowles Foundation Discussion Papers 1144, Cowles Foundation for Research in Economics, Yale University.
    7. Judd, Kenneth L., 1996. "Approximation, perturbation, and projection methods in economic analysis," Handbook of Computational Economics, in: H. M. Amman & D. A. Kendrick & J. Rust (ed.), Handbook of Computational Economics, edition 1, volume 1, chapter 12, pages 509-585, Elsevier.
    8. John Rust, 1997. "Using Randomization to Break the Curse of Dimensionality," Econometrica, Econometric Society, vol. 65(3), pages 487-516, May.
    9. Martin L. Puterman & Shelby L. Brumelle, 1979. "On the Convergence of Policy Iteration in Stationary Dynamic Programming," Mathematics of Operations Research, INFORMS, vol. 4(1), pages 60-69, February.
    10. Hans M. Amman & David A. Kendrick, . "Computational Economics," Online economics textbooks, SUNY-Oswego, Department of Economics, number comp1.
    11. Rust, John, 1986. "When Is It Optimal to Kill Off the Market for Used Durable Goods?," Econometrica, Econometric Society, vol. 54(1), pages 65-86, January.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Manuel S. Santos, 2000. "Accuracy of Numerical Solutions using the Euler Equation Residuals," Econometrica, Econometric Society, vol. 68(6), pages 1377-1402, November.
    2. Kristensen, Dennis & Mogensen, Patrick K. & Moon, Jong Myun & Schjerning, Bertel, 2021. "Solving dynamic discrete choice models using smoothing and sieve methods," Journal of Econometrics, Elsevier, vol. 223(2), pages 328-360.
    3. Hall, George & Rust, John, 2000. "An empirical model of inventory investment by durable commodity intermediaries," Carnegie-Rochester Conference Series on Public Policy, Elsevier, vol. 52(1), pages 171-214, June.
    4. Hugo Benitez-Silva, 2000. "A Dynamic Model of Labor Supply, Consumption/Saving, and Annuity Decisions under Uncertainty," Department of Economics Working Papers 00-06, Stony Brook University, Department of Economics.
    5. John Rust & Joseph Traub & Henryk Wozniakowski, 1999. "No Curse of Dimensionality for Contraction Fixed Points Even in the Worst Case," Computational Economics 9902001, University Library of Munich, Germany.
    6. Jacek B. Krawczyk, 2000. "A Markovian Approximated Solution To A Portfolio Management Problem," Computing in Economics and Finance 2000 233, Society for Computational Economics.
    7. Santos, Manuel S., 1998. "Accuracy of numerical solutions using the eulers equation residuals," UC3M Working papers. Economics 4157, Universidad Carlos III de Madrid. Departamento de Economía.
    8. Victor Aguirregabiria & Pedro Mira, 2002. "Swapping the Nested Fixed Point Algorithm: A Class of Estimators for Discrete Markov Decision Models," Econometrica, Econometric Society, vol. 70(4), pages 1519-1543, July.
    9. Reiter, Michael, 1999. "Solving higher-dimensional continuous-time stochastic control problems by value function regression," Journal of Economic Dynamics and Control, Elsevier, vol. 23(9-10), pages 1329-1353, September.
    10. Ronald Goettler & Ron Shachar, 2000. "Estimating Product Characteristics and Spatial Competition in the Network Television Industry," Econometric Society World Congress 2000 Contributed Papers 1691, Econometric Society.
    11. Hugo Benitez-Silva, 2000. "A Joint Model of Labor Supply and Consumption Decisions Under Uncertainty," Econometric Society World Congress 2000 Contributed Papers 0196, Econometric Society.
    12. Hugo Benítez-Silva, 2003. "The Annuity Puzzle Revisited," Working Papers wp055, University of Michigan, Michigan Retirement Research Center.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. John Rust, 1997. "Using Randomization to Break the Curse of Dimensionality," Econometrica, Econometric Society, vol. 65(3), pages 487-516, May.
    2. Willi Semmler & Lars Grüne, 2004. "Asset Pricing with Delayed Consumption Decisions," Computing in Economics and Finance 2004 59, Society for Computational Economics.
    3. Grune, Lars & Semmler, Willi, 2004. "Using dynamic programming with adaptive grid scheme for optimal control problems in economics," Journal of Economic Dynamics and Control, Elsevier, vol. 28(12), pages 2427-2456, December.
    4. John Rust & Joseph Traub & Henryk Wozniakowski, 1999. "No Curse of Dimensionality for Contraction Fixed Points Even in the Worst Case," Computational Economics 9902001, University Library of Munich, Germany.
    5. Geweke, J. & Joel Horowitz & Pesaran, M.H., 2006. "Econometrics: A Bird’s Eye View," Cambridge Working Papers in Economics 0655, Faculty of Economics, University of Cambridge.
    6. Bound, John & Stinebrickner, Todd & Waidmann, Timothy, 2010. "Health, economic resources and the work decisions of older men," Journal of Econometrics, Elsevier, vol. 156(1), pages 106-129, May.
    7. Lars Grüne & Willi Semmler, 2007. "Asset pricing with dynamic programming," Computational Economics, Springer;Society for Computational Economics, vol. 29(3), pages 233-265, May.
    8. Masakazu Ishihara & Andrew T. Ching, 2019. "Dynamic Demand for New and Used Durable Goods Without Physical Depreciation: The Case of Japanese Video Games," Marketing Science, INFORMS, vol. 38(3), pages 392-416, May.
    9. Gamba, Andrea & Tesser, Matteo, 2009. "Structural estimation of real options models," Journal of Economic Dynamics and Control, Elsevier, vol. 33(4), pages 798-816, April.
    10. Ayşe Kabukçuoğlu & Enrique Martínez-García, 2021. "A Generalized Time Iteration Method for Solving Dynamic Optimization Problems with Occasionally Binding Constraints," Computational Economics, Springer;Society for Computational Economics, vol. 58(2), pages 435-460, August.
    11. Kristensen, Dennis & Salanié, Bernard, 2017. "Higher-order properties of approximate estimators," Journal of Econometrics, Elsevier, vol. 198(2), pages 189-208.
    12. Todd R. Stinebrickner, 2000. "Serially correlated variables in dynamic, discrete choice models," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 15(6), pages 595-624.
    13. Reiter, Michael, 1999. "Solving higher-dimensional continuous-time stochastic control problems by value function regression," Journal of Economic Dynamics and Control, Elsevier, vol. 23(9-10), pages 1329-1353, September.
    14. Alemdar, Nedim M. & Sirakaya, Sibel & Husseinov, Farhad, 2006. "Optimal time aggregation of infinite horizon control problems," Journal of Economic Dynamics and Control, Elsevier, vol. 30(4), pages 569-593, April.
    15. Anderson, Evan W. & Hansen, Lars Peter & Sargent, Thomas J., 2012. "Small noise methods for risk-sensitive/robust economies," Journal of Economic Dynamics and Control, Elsevier, vol. 36(4), pages 468-500.
    16. John Bound & Todd Stinebrickner & Timothy Waidman, 2004. "Using a Structural Retirement Model to Simulate the Effect of Changes to the OASDI and Medicare Programs," Working Papers wp091, University of Michigan, Michigan Retirement Research Center.
    17. Jesús Fernández-Villaverde & Juan F. Rubio-Ramirez, 2001. "Comparing dynamic equilibrium economies to data," FRB Atlanta Working Paper 2001-23, Federal Reserve Bank of Atlanta.
    18. Nikolaj Malchow-Møller & Michael Svarer, 2003. "Estimation of the multinomial logit model with random effects," Applied Economics Letters, Taylor & Francis Journals, vol. 10(7), pages 389-392.
    19. Linnea Polgreen & Pedro Silos, 2008. "Capital-Skill Complementarity and Inequality: A Sensitivity Analysis," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 11(2), pages 302-313, April.
    20. Kristensen, Dennis & Mogensen, Patrick K. & Moon, Jong Myun & Schjerning, Bertel, 2021. "Solving dynamic discrete choice models using smoothing and sieve methods," Journal of Econometrics, Elsevier, vol. 223(2), pages 328-360.

    More about this item

    JEL classification:

    • C8 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wpa:wuwpco:9704001. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: EconWPA (email available below). General contact details of provider: https://econwpa.ub.uni-muenchen.de .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.