IDEAS home Printed from https://ideas.repec.org/a/spr/coopap/v58y2014i1p31-85.html
   My bibliography  Save this article

Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty

Author

Listed:
  • Mauro Gaggero
  • Giorgio Gnecco
  • Marcello Sanguineti

Abstract

Stochastic optimization problems with an objective function that is additive over a finite number of stages are addressed. Although Dynamic Programming allows one to formally solve such problems, closed-form solutions can be derived only in particular cases. The search for suboptimal solutions via two approaches is addressed: approximation of the value functions and approximation of the optimal decision policies. The approximations take on the form of linear combinations of basis functions containing adjustable parameters to be optimized together with the coefficients of the combinations. Two kinds of basis functions are considered: Gaussians with varying centers and widths and sigmoids with varying weights and biases. The accuracies of such suboptimal solutions are investigated via estimates of the error propagation through the stages. Upper bounds are derived on the differences between the optimal value of the objective functional and its suboptimal values corresponding to the use at each stage of approximate value functions and approximate policies. Conditions under which the number of basis functions required for a desired approximation accuracy does not grow “too fast” with respect to the dimensions of the state and random vectors are provided. As an example of application, a multidimensional problem of optimal consumption under uncertainty is investigated, where consumers aim at maximizing a social utility function. Numerical simulations are provided, emphasizing computational pros and cons of the two approaches (i.e., value-function approximation and optimal-policy approximation) using the above-mentioned two kinds of basis functions. To investigate the dependencies of the performances on dimensionality, the numerical analysis is performed for various numbers of consumers. In the simulations, discretization techniques exploiting low-discrepancy sequences are used. Both theoretical and numerical results give insights into the possibility of coping with the curse of dimensionality in stochastic optimization problems whose decision strategies depend on large numbers of variables. Copyright Springer Science+Business Media New York 2014

Suggested Citation

  • Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Approximate dynamic programming for stochastic N-stage optimization with application to optimal consumption under uncertainty," Computational Optimization and Applications, Springer, vol. 58(1), pages 31-85, May.
  • Handle: RePEc:spr:coopap:v:58:y:2014:i:1:p:31-85
    DOI: 10.1007/s10589-013-9614-z
    as

    Download full text from publisher

    File URL: http://hdl.handle.net/10.1007/s10589-013-9614-z
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1007/s10589-013-9614-z?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Benveniste, L M & Scheinkman, J A, 1979. "On the Differentiability of the Value Function in Dynamic Models of Economics," Econometrica, Econometric Society, vol. 47(3), pages 727-732, May.
    2. Santos, Manuel S, 1991. "Smoothness of the Policy Function in Discrete Time Economic Models," Econometrica, Econometric Society, vol. 59(5), pages 1365-1382, September.
    3. Kenneth L. Judd, 1998. "Numerical Methods in Economics," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262100711, December.
    4. Victoria C. P. Chen & David Ruppert & Christine A. Shoemaker, 1999. "Applying Experimental Design and Regression Splines to High-Dimensional Continuous-State Stochastic Dynamic Programming," Operations Research, INFORMS, vol. 47(1), pages 38-53, February.
    5. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2014. "Suboptimal Policies for Stochastic $$N$$ N -Stage Optimization: Accuracy Analysis and a Case Study from Optimal Consumption," International Series in Operations Research & Management Science, in: Fouad El Ouardighi & Konstantin Kogan (ed.), Models and Methods in Economics and Management Science, edition 127, pages 27-50, Springer.
    6. Semmler, Willi & Sieveking, Malte, 2000. "Critical debt and debt dynamics," Journal of Economic Dynamics and Control, Elsevier, vol. 24(5-7), pages 1121-1144, June.
    7. Jerome Adda & Russell W. Cooper, 2003. "Dynamic Economics: Quantitative Methods and Applications," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012014, December.
    8. Montrucchio, Luigi, 1987. "Lipschitz continuous policy functions for strongly concave optimization problems," Journal of Mathematical Economics, Elsevier, vol. 16(3), pages 259-273, June.
    9. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2013. "Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results," Journal of Optimization Theory and Applications, Springer, vol. 156(2), pages 380-416, February.
    10. Martin L. Puterman & Moon Chirl Shin, 1978. "Modified Policy Iteration Algorithms for Discounted Markov Decision Problems," Management Science, INFORMS, vol. 24(11), pages 1127-1137, July.
    11. Santos, M.S. & Vila, J-L., 1988. "Smoothness Of Policy Function In Continuous Time Economic Models: The One Dimensional Case," UFAE and IAE Working Papers 112-89, Unitat de Fonaments de l'Anàlisi Econòmica (UAB) and Institut d'Anàlisi Econòmica (CSIC).
    12. R. Zoppoli & M. Sanguineti & T. Parisini, 2002. "Approximating Networks and Extended Ritz Method for the Solution of Functional Optimization Problems," Journal of Optimization Theory and Applications, Springer, vol. 112(2), pages 403-440, February.
    13. Hugo Cruz-Suárez & Raúl Montes-de-Oca, 2008. "An envelope theorem and some applications to discounted Markov decision processes," Mathematical Methods of Operations Research, Springer;Gesellschaft für Operations Research (GOR);Nederlands Genootschap voor Besliskunde (NGB), vol. 67(2), pages 299-321, April.
    14. G. Gnecco & M. Sanguineti, 2010. "Suboptimal Solutions to Dynamic Optimization Problems via Approximations of the Policy Functions," Journal of Optimization Theory and Applications, Springer, vol. 146(3), pages 764-794, September.
    15. Karp, Larry & Lee, In Ho, 2001. "Learning-by-Doing and the Choice of Technology: The Role of Patience," Journal of Economic Theory, Elsevier, vol. 100(1), pages 73-92, September.
    16. Bhattacharya,Rabi & Majumdar,Mukul, 2007. "Random Dynamical Systems," Cambridge Books, Cambridge University Press, number 9780521825658.
    17. Sharon A. Johnson & Jery R. Stedinger & Christine A. Shoemaker & Ying Li & José Alberto Tejada-Guibert, 1993. "Numerical Solution of Continuous-State Dynamic Programs Using Linear and Spline Interpolation," Operations Research, INFORMS, vol. 41(3), pages 484-500, June.
    18. Nicola Secomandi, 2010. "Optimal Commodity Trading with a Capacitated Storage Asset," Management Science, INFORMS, vol. 56(3), pages 449-467, March.
    19. Montrucchio, Luigi, 1998. "Thompson metric, contraction property and differentiability of policy functions," Journal of Economic Behavior & Organization, Elsevier, vol. 33(3-4), pages 449-466, January.
    20. Santos, Manuel S, 1993. "On High-Order Differentiability of the Policy Function," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 3(3), pages 565-570, July.
    21. S. Giulini & M. Sanguineti, 2009. "Approximation Schemes for Functional Optimization Problems," Journal of Optimization Theory and Applications, Springer, vol. 140(1), pages 33-54, January.
    22. Richard Bellman, 1957. "On a Dynamic Programming Approach to the Caterer Problem--I," Management Science, INFORMS, vol. 3(3), pages 270-278, April.
    23. Blume, Lawrence & Easley, David & O'Hara, Maureen, 1982. "Characterization of optimal plans for stochastic dynamic programs," Journal of Economic Theory, Elsevier, vol. 28(2), pages 221-234, December.
    24. Bhattacharya,Rabi & Majumdar,Mukul, 2007. "Random Dynamical Systems," Cambridge Books, Cambridge University Press, number 9780521532723.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Giorgio Gnecco & Fabio Pammolli & Berna Tuncay, 2022. "Welfare and research and development incentive effects of uniform and differential pricing schemes," Computational Management Science, Springer, vol. 19(2), pages 229-268, June.
    2. Giorgio Gnecco & Berna Tuncay & Fabio Pammolli, 2018. "A Comparison of Game-Theoretic Models for Parallel Trade," International Game Theory Review (IGTR), World Scientific Publishing Co. Pte. Ltd., vol. 20(03), pages 1-57, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Mauro Gaggero & Giorgio Gnecco & Marcello Sanguineti, 2013. "Dynamic Programming and Value-Function Approximation in Sequential Decision Problems: Error Analysis and Numerical Results," Journal of Optimization Theory and Applications, Springer, vol. 156(2), pages 380-416, February.
    2. John Stachurski, 2009. "Economic Dynamics: Theory and Computation," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012774, December.
    3. Chen, Yu & Cosimano, Thomas F. & Himonas, Alex A., 2008. "Analytic solving of asset pricing models: The by force of habit case," Journal of Economic Dynamics and Control, Elsevier, vol. 32(11), pages 3631-3660, November.
    4. Williams, Noah, 2004. "Small noise asymptotics for a stochastic growth model," Journal of Economic Theory, Elsevier, vol. 119(2), pages 271-298, December.
    5. G. Gnecco & M. Sanguineti, 2010. "Suboptimal Solutions to Dynamic Optimization Problems via Approximations of the Policy Functions," Journal of Optimization Theory and Applications, Springer, vol. 146(3), pages 764-794, September.
    6. Cuong Le Van & Lisa Morhaim, 2006. "On optimal growth models when the discount factor is near 1 or equal to 1," Post-Print halshs-00096034, HAL.
    7. Mercedes Esteban-Bravo & Jose M. Vidal-Sanz & Gökhan Yildirim, 2014. "Valuing Customer Portfolios with Endogenous Mass and Direct Marketing Interventions Using a Stochastic Dynamic Programming Decomposition," Marketing Science, INFORMS, vol. 33(5), pages 621-640, September.
    8. Mitra, Tapan & Nishimura, Kazuo, 2001. "Discounting and Long-Run Behavior: Global Bifurcation Analysis of a Family of Dynamical Systems," Journal of Economic Theory, Elsevier, vol. 96(1-2), pages 256-293, January.
    9. M. Baglietto & C. Cervellera & M. Sanguineti & R. Zoppoli, 2010. "Management of water resource systems in the presence of uncertainties by nonlinear approximation techniques and deterministic sampling," Computational Optimization and Applications, Springer, vol. 47(2), pages 349-376, October.
    10. Alain Venditti, 2012. "Weak concavity properties of indirect utility functions in multisector optimal growth models," International Journal of Economic Theory, The International Society for Economic Theory, vol. 8(1), pages 13-26, March.
    11. Aoki, Takaaki, 2013. "Some Mathematical Properties of the Dynamically Inconsistent Bellman Equation: A Note on the Two-sided Altruism Dynamics," MPRA Paper 44994, University Library of Munich, Germany.
    12. King, Robert P. & Lohano, Heman D., 2006. "Accuracy of Numerical Solution to Dynamic Programming Models," Staff Papers 14230, University of Minnesota, Department of Applied Economics.
    13. Lars J. Olson & Santanu Roy, 2006. "Theory of Stochastic Optimal Economic Growth," Springer Books, in: Rose-Anne Dana & Cuong Le Van & Tapan Mitra & Kazuo Nishimura (ed.), Handbook on Optimal Growth 1, chapter 11, pages 297-335, Springer.
    14. Mitra, Tapan & Privileggi, Fabio, 2003. "Cantor Type Invariant Distributions in the Theory of Optimal Growth under Uncertainty," Working Papers 03-09, Cornell University, Center for Analytic Economics.
    15. Bona, Jerry L. & Santos, Manuel S., 1997. "On the Role of Computation in Economic Theory," Journal of Economic Theory, Elsevier, vol. 72(2), pages 241-281, February.
    16. Kehoe, Timothy J. & Levine, David K. & Romer, Paul M., 1990. "Determinacy of equilibria in dynamic models with finitely many consumers," Journal of Economic Theory, Elsevier, vol. 50(1), pages 1-21, February.
    17. Andrea Bacigalupo & Giorgio Gnecco & Marco Lepidi & Luigi Gambarotta, 2020. "Machine-Learning Techniques for the Optimal Design of Acoustic Metamaterials," Journal of Optimization Theory and Applications, Springer, vol. 187(3), pages 630-653, December.
    18. Venditti, Alain, 1997. "Strong Concavity Properties of Indirect Utility Functions in Multisector Optimal Growth Models," Journal of Economic Theory, Elsevier, vol. 74(2), pages 349-367, June.
    19. Cervellera, Cristiano, 2023. "Optimized ensemble value function approximation for dynamic programming," European Journal of Operational Research, Elsevier, vol. 309(2), pages 719-730.
    20. Thomas, Jonathan P. & Worrall, Tim, 2018. "Dynamic relational contracts under complete information," Journal of Economic Theory, Elsevier, vol. 175(C), pages 624-651.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:coopap:v:58:y:2014:i:1:p:31-85. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.