IDEAS home Printed from https://ideas.repec.org/a/spr/joptap/v156y2013i2d10.1007_s10957-012-0118-2.html
   My bibliography  Save this article

Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results

Author

Listed:
  • Mauro Gaggero

    (National Research Council of Italy)

  • Giorgio Gnecco

    (University of Genova)

  • Marcello Sanguineti

    (University of Genova)

Abstract

Value-function approximation is investigated for the solution via Dynamic Programming (DP) of continuous-state sequential N-stage decision problems, in which the reward to be maximized has an additive structure over a finite number of stages. Conditions that guarantee smoothness properties of the value function at each stage are derived. These properties are exploited to approximate such functions by means of certain nonlinear approximation schemes, which include splines of suitable order and Gaussian radial-basis networks with variable centers and widths. The accuracies of suboptimal solutions obtained by combining DP with these approximation tools are estimated. The results provide insights into the successful performances appeared in the literature about the use of value-function approximators in DP. The theoretical analysis is applied to a problem of optimal consumption, with simulation results illustrating the use of the proposed solution methodology. Numerical comparisons with classical linear approximators are presented.

Suggested Citation

  • Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2013. "Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results," Journal of Optimization Theory and Applications, Springer, vol. 156(2), pages 380-416, February.
  • Handle: RePEc:spr:joptap:v:156:y:2013:i:2:d:10.1007_s10957-012-0118-2
    DOI: 10.1007/s10957-012-0118-2
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10957-012-0118-2
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10957-012-0118-2?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jerome Adda & Russell W. Cooper, 2003. "Dynamic Economics: Quantitative Methods and Applications," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012014, December.
    2. Karp, Larry & Lee, In Ho, 2001. "Learning-by-Doing and the Choice of Technology: The Role of Patience," Journal of Economic Theory, Elsevier, vol. 100(1), pages 73-92, September.
    3. Victoria C. P. Chen & David Ruppert & Christine A. Shoemaker, 1999. "Applying Experimental Design and Regression Splines to High-Dimensional Continuous-State Stochastic Dynamic Programming," Operations Research, INFORMS, vol. 47(1), pages 38-53, February.
    4. Martin L. Puterman & Moon Chirl Shin, 1978. "Modified Policy Iteration Algorithms for Discounted Markov Decision Problems," Management Science, INFORMS, vol. 24(11), pages 1127-1137, July.
    5. R. Zoppoli & M. Sanguineti & T. Parisini, 2002. "Approximating Networks and Extended Ritz Method for the Solution of Functional Optimization Problems," Journal of Optimization Theory and Applications, Springer, vol. 112(2), pages 403-440, February.
    6. Kenneth L. Judd, 1998. "Numerical Methods in Economics," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262100711, December.
    7. Wim M. Nawijn, 1990. "Look-Ahead Policies for Admission to a Single Server Loss System," Operations Research, INFORMS, vol. 38(5), pages 854-862, October.
    8. Angelo Alessandri & Giorgio Gnecco & Marcello Sanguineti, 2010. "Minimizing Sequences for a Family of Functional Optimal Estimation Problems," Journal of Optimization Theory and Applications, Springer, vol. 147(2), pages 243-262, November.
    9. Sharon A. Johnson & Jery R. Stedinger & Christine A. Shoemaker & Ying Li & José Alberto Tejada-Guibert, 1993. "Numerical Solution of Continuous-State Dynamic Programs Using Linear and Spline Interpolation," Operations Research, INFORMS, vol. 41(3), pages 484-500, June.
    10. Boldrin, Michele & Montrucchio, Luigi, 1986. "On the indeterminacy of capital accumulation paths," Journal of Economic Theory, Elsevier, vol. 40(1), pages 26-39, October.
    11. Semmler, Willi & Sieveking, Malte, 2000. "Critical debt and debt dynamics," Journal of Economic Dynamics and Control, Elsevier, vol. 24(5-7), pages 1121-1144, June.
    12. G. Gnecco & M. Sanguineti, 2010. "Suboptimal Solutions to Dynamic Optimization Problems via Approximations of the Policy Functions," Journal of Optimization Theory and Applications, Springer, vol. 146(3), pages 764-794, September.
    13. Michael Kopel & Gustav Feichtinger & Herbert Dawid, 1997. "Complex solutions of nonconcave dynamic optimization models (*)," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 9(3), pages 427-439.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty," Computational Optimization and Applications, Springer, vol. 58(1), pages 31-85, May.
    2. Andrea Bacigalupo & Giorgio Gnecco & Marco Lepidi & Luigi Gambarotta, 2020. "Machine-Learning Techniques for the Optimal Design of Acoustic Metamaterials," Journal of Optimization Theory and Applications, Springer, vol. 187(3), pages 630-653, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty," Computational Optimization and Applications, Springer, vol. 58(1), pages 31-85, May.
    2. John Stachurski, 2009. "Economic Dynamics: Theory and Computation," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012774, December.
    3. M. Baglietto & C. Cervellera & M. Sanguineti & R. Zoppoli, 2010. "Management of water resource systems in the presence of uncertainties by nonlinear approximation techniques and deterministic sampling," Computational Optimization and Applications, Springer, vol. 47(2), pages 349-376, October.
    4. Cervellera, C. & Macciò, D., 2011. "A comparison of global and semi-local approximation in T-stage stochastic optimization," European Journal of Operational Research, Elsevier, vol. 208(2), pages 109-118, January.
    5. Maliar, Lilia & Maliar, Serguei & Tsener, Inna, 2022. "Capital-skill complementarity and inequality: Twenty years after," Economics Letters, Elsevier, vol. 220(C).
    6. Mercedes Esteban-Bravo & Jose M. Vidal-Sanz & Gökhan Yildirim, 2014. "Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition," Marketing Science, INFORMS, vol. 33(5), pages 621-640, September.
    7. Serguei Maliar & John Taylor & Lilia Maliar, 2016. "The Impact of Alternative Transitions to Normalized Monetary Policy," 2016 Meeting Papers 794, Society for Economic Dynamics.
    8. King, Robert P. & Lohano, Heman D., 2006. "Accuracy of Numerical Solution to Dynamic Programming Models," Staff Papers 14230, University of Minnesota, Department of Applied Economics.
    9. Jinhui H. Bai & Roger Lagunoff, 2013. "Revealed Political Power," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 54(4), pages 1085-1115, November.
    10. Ufuk Akcigit, 2009. "Firm Size, Innovation Dynamics and Growth," 2009 Meeting Papers 1267, Society for Economic Dynamics.
    11. Gomez-Ruano, Gerardo, 2011. "Technological Change and Immigration Policy," MPRA Paper 63705, University Library of Munich, Germany.
    12. Judd, Kenneth L., 1997. "Computational economics and economic theory: Substitutes or complements?," Journal of Economic Dynamics and Control, Elsevier, vol. 21(6), pages 907-942, June.
    13. Moody Chu & Chun-Hung Kuo & Matthew Lin, 2013. "Tensor Spline Approximation in Economic Dynamics with Uncertainties," Computational Economics, Springer;Society for Computational Economics, vol. 42(2), pages 175-198, August.
    14. Heer Burkhard & Maußner Alfred, 2011. "Value Function Iteration as a Solution Method for the Ramsey Model," Journal of Economics and Statistics (Jahrbuecher fuer Nationaloekonomie und Statistik), De Gruyter, vol. 231(4), pages 494-515, August.
    15. Nikolay Gospodinov & Damba Lkhagvasuren, 2014. "A Moment‐Matching Method For Approximating Vector Autoregressive Processes By Finite‐State Markov Chains," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 29(5), pages 843-859, August.
    16. Giorgio Gnecco, 2016. "On the Curse of Dimensionality in the Ritz Method," Journal of Optimization Theory and Applications, Springer, vol. 168(2), pages 488-509, February.
    17. Cervellera, Cristiano & Chen, Victoria C.P. & Wen, Aihong, 2006. "Optimization of a large-scale water reservoir network by stochastic dynamic programming with efficient state space discretization," European Journal of Operational Research, Elsevier, vol. 171(3), pages 1139-1151, June.
    18. Padula, Mario, 2010. "An approximate consumption function," Journal of Economic Dynamics and Control, Elsevier, vol. 34(3), pages 404-416, March.
    19. Powell, Warren B., 2019. "A unified framework for stochastic optimization," European Journal of Operational Research, Elsevier, vol. 275(3), pages 795-821.
    20. T.W. Archibald & K.I.M. McKinnon & L.C. Thomas, 2006. "Modeling the operation of multireservoir systems using decomposition and stochastic dynamic programming," Naval Research Logistics (NRL), John Wiley & Sons, vol. 53(3), pages 217-225, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joptap:v:156:y:2013:i:2:d:10.1007_s10957-012-0118-2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.