IDEAS home Printed from https://ideas.repec.org/a/eee/mateco/v100y2022ics0304406822000143.html
   My bibliography  Save this article

Unbounded dynamic programming via the Q-transform

Author

Listed:
  • Ma, Qingyin
  • Stachurski, John
  • Toda, Alexis Akira

Abstract

We propose a new approach to solving dynamic decision problems with unbounded rewards based on the transformations used in Q-learning. In our case, however, the objective of the transform is not learning. Rather, it is to convert an unbounded dynamic program into a bounded one. The approach is general enough to handle problems for which existing methods struggle, and yet simple relative to other techniques and accessible for applied work. We show by example that a variety of common decision problems satisfy our conditions.

Suggested Citation

  • Ma, Qingyin & Stachurski, John & Toda, Alexis Akira, 2022. "Unbounded dynamic programming via the Q-transform," Journal of Mathematical Economics, Elsevier, vol. 100(C).
  • Handle: RePEc:eee:mateco:v:100:y:2022:i:c:s0304406822000143
    DOI: 10.1016/j.jmateco.2022.102652
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0304406822000143
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jmateco.2022.102652?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to look for a different version below or search for a different version of it.

    Other versions of this item:

    References listed on IDEAS

    as
    1. Mark Aguiar & Manuel Amador & Hugo Hopenhayn & Iván Werning, 2019. "Take the Short Route: Equilibrium Default and Debt Maturity," Econometrica, Econometric Society, vol. 87(2), pages 423-462, March.
    2. Rust, John, 1987. "Optimal Replacement of GMC Bus Engines: An Empirical Model of Harold Zurcher," Econometrica, Econometric Society, vol. 55(5), pages 999-1033, September.
    3. Le Van, Cuong & Vailakis, Yiannis, 2005. "Recursive utility and optimal growth with bounded or unbounded returns," Journal of Economic Theory, Elsevier, vol. 123(2), pages 187-209, August.
    4. Shenghao Zhu, 2020. "Existence Of Stationary Equilibrium In An Incomplete‐Market Model With Endogenous Labor Supply," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 61(3), pages 1115-1138, August.
    5. J. J. McCall, 1970. "Economics of Information and Job Search," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 84(1), pages 113-126.
    6. Stachurski, John & Toda, Alexis Akira, 2019. "An impossibility theorem for wealth in heterogeneous-agent models with limited heterogeneity," Journal of Economic Theory, Elsevier, vol. 182(C), pages 1-24.
    7. Ma, Qingyin & Stachurski, John & Toda, Alexis Akira, 2020. "The income fluctuation problem and the evolution of wealth," Journal of Economic Theory, Elsevier, vol. 187(C).
    8. Juan Carlos Hatchondo & Leonardo Martinez & César Sosa-Padilla, 2016. "Debt Dilution and Sovereign Default Risk," Journal of Political Economy, University of Chicago Press, vol. 124(5), pages 1383-1422.
    9. He, Hua & Pearson, Neil D., 1991. "Consumption and portfolio policies with incomplete markets and short-sale constraints: The infinite dimensional case," Journal of Economic Theory, Elsevier, vol. 54(2), pages 259-304, August.
    10. Janusz Matkowski & Andrzej Nowak, 2011. "On discounted dynamic programming with unbounded returns," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 46(3), pages 455-474, April.
    11. S. Rao Aiyagari, 1994. "Uninsured Idiosyncratic Risk and Aggregate Saving," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 109(3), pages 659-684.
    12. Andreas Fagereng & Luigi Guiso & Davide Malacrino & Luigi Pistaferri, 2020. "Heterogeneity and Persistence in Returns to Wealth," Econometrica, Econometric Society, vol. 88(1), pages 115-170, January.
    13. Benhabib, Jess & Bisin, Alberto & Zhu, Shenghao, 2015. "The wealth distribution in Bewley economies with capital income risk," Journal of Economic Theory, Elsevier, vol. 159(PA), pages 489-515.
    14. Jonathan Heathcote & Kjetil Storesletten & Giovanni L. Violante, 2010. "The Macroeconomic Implications of Rising Wage Inequality in the United States," Journal of Political Economy, University of Chicago Press, vol. 118(4), pages 681-722, August.
    15. Paul A. Samuelson, 2011. "Lifetime Portfolio Selection by Dynamic Stochastic Programming," World Scientific Book Chapters, in: Leonard C MacLean & Edward O Thorp & William T Ziemba (ed.), THE KELLY CAPITAL GROWTH INVESTMENT CRITERION THEORY and PRACTICE, chapter 31, pages 465-472, World Scientific Publishing Co. Pte. Ltd..
    16. Hua He & Neil D. Pearson, 1991. "Consumption and Portfolio Policies With Incomplete Markets and Short‐Sale Constraints: the Finite‐Dimensional Case1," Mathematical Finance, Wiley Blackwell, vol. 1(3), pages 1-10, July.
    17. Jaap H. Abbring & Jeffrey R. Campbell & Jan Tilly & Nan Yang, 2018. "Very Simple Markov‐Perfect Industry Dynamics: Theory," Econometrica, Econometric Society, vol. 86(2), pages 721-735, March.
    18. Aguiar, Mark & Amador, Manuel, 2019. "A contraction for sovereign debt models," Journal of Economic Theory, Elsevier, vol. 183(C), pages 842-875.
    19. Jaap H. Abbring & Jeffrey R. Campbell & Jan Tilly & Nan Yang, 2018. "Very Simple Markov-Perfect Industry Dynamics: Empirics," Working Paper Series WP-2018-17, Federal Reserve Bank of Chicago.
    20. Juan Carlos Hatchondo & Leonardo Martinez & Horacio Sapriza, 2009. "Heterogeneous Borrowers In Quantitative Models Of Sovereign Default," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 50(4), pages 1129-1151, November.
    21. V. Filipe Martins-da-Rocha & Yiannis Vailakis, 2010. "Existence and Uniqueness of a Fixed Point for Local Contractions," Econometrica, Econometric Society, vol. 78(3), pages 1127-1141, May.
    22. Cristina Arellano, 2008. "Default Risk and Income Fluctuations in Emerging Economies," American Economic Review, American Economic Association, vol. 98(3), pages 690-712, June.
    23. Cao, Dan, 2020. "Recursive equilibrium in Krusell and Smith (1998)," Journal of Economic Theory, Elsevier, vol. 186(C).
    24. Abbring, Jaap & Campbell, J.R. & Tilly, J. & Yang, N., 2018. "Very Simple Markov-Perfect Industry Dynamics (revision of 2017-021) : Empirics," Discussion Paper 2018-040, Tilburg University, Center for Economic Research.
    25. Takashi Kamihigashi, 2014. "Elementary results on solutions to the bellman equation of dynamic programming: existence, uniqueness, and convergence," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 56(2), pages 251-273, June.
    26. Moritz Kuhn, 2013. "Recursive Equilibria In An Aiyagari‐Style Economy With Permanent Income Shocks," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 54(3), pages 807-835, August.
    27. Dan Cao & Wenlan Luo, 2017. "Persistent Heterogeneous Returns and Top End Wealth Inequality," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 26, pages 301-326, October.
    28. Eugene A. Feinberg & Pavlo O. Kasyanov & Nina V. Zadoianchuk, 2012. "Average Cost Markov Decision Processes with Weakly Continuous Transition Probabilities," Mathematics of Operations Research, INFORMS, vol. 37(4), pages 591-607, November.
    29. Mette Ejrnæs & Martin Browning, 2014. "The persistent–transitory representation for earnings processes," Quantitative Economics, Econometric Society, vol. 5(3), pages 555-581, November.
    30. Toda, Alexis Akira, 2014. "Incomplete market dynamics and cross-sectional distributions," Journal of Economic Theory, Elsevier, vol. 154(C), pages 310-348.
    31. Jovanovic, Boyan, 1982. "Selection and the Evolution of Industry," Econometrica, Econometric Society, vol. 50(3), pages 649-670, May.
    32. Li, Huiyu & Stachurski, John, 2014. "Solving the income fluctuation problem with unbounded rewards," Journal of Economic Dynamics and Control, Elsevier, vol. 45(C), pages 353-365.
    33. Joachim Hubmer & Per Krusell & Anthony A. Smith Jr., 2020. "Sources of US Wealth Inequality: Past, Present, and Future," NBER Chapters, in: NBER Macroeconomics Annual 2020, volume 35, pages 391-455, National Bureau of Economic Research, Inc.
    34. Ma, Qingyin & Toda, Alexis Akira, 2021. "A theory of the saving rate of the rich," Journal of Economic Theory, Elsevier, vol. 192(C).
    35. Alvarez, Fernando & Stokey, Nancy L., 1998. "Dynamic Programming with Homogeneous Functions," Journal of Economic Theory, Elsevier, vol. 82(1), pages 167-189, September.
    36. Boud, John III, 1990. "Recursive utility and the Ramsey problem," Journal of Economic Theory, Elsevier, vol. 50(2), pages 326-345, April.
    37. Bäuerle, Nicole & Jaśkiewicz, Anna, 2018. "Stochastic optimal growth model with risk sensitive preferences," Journal of Economic Theory, Elsevier, vol. 173(C), pages 181-200.
    38. Charalambos D. Aliprantis & Kim C. Border, 2006. "Infinite Dimensional Analysis," Springer Books, Springer, edition 0, number 978-3-540-29587-7, November.
    39. Ma, Qingyin & Stachurski, John, 2019. "Optimal timing of decisions: A general theory based on continuation values," Journal of Economic Dynamics and Control, Elsevier, vol. 101(C), pages 62-81.
    40. Moritz Kuhn, 2013. "Recursive Equilibria In An Aiyagari‐Style Economy With Permanent Income Shocks," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 54, pages 807-835, August.
    41. Cuong Le Van & Yiannis Vailakis, 2005. "Recursive utility and optimal growth with bounded or unbounded returns," Post-Print halshs-00101201, HAL.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Thomas J. Sargent & John Stachurski, 2024. "Dynamic Programming: Finite States," Papers 2401.10473, arXiv.org.
    2. Qingyin Ma & John Stachurski, 2019. "Dynamic Optimal Choice When Rewards are Unbounded Below," Papers 1911.13025, arXiv.org.
    3. Ma, Qingyin & Stachurski, John & Toda, Alexis Akira, 2020. "The income fluctuation problem and the evolution of wealth," Journal of Economic Theory, Elsevier, vol. 187(C).
    4. Ma, Qingyin & Stachurski, John, 2019. "Optimal timing of decisions: A general theory based on continuation values," Journal of Economic Dynamics and Control, Elsevier, vol. 101(C), pages 62-81.
    5. Philippe Bich & Jean-Pierre Drugeon & Lisa Morhaim, 2018. "On Temporal Aggregators and Dynamic Programming," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01437496, HAL.
    6. Philippe Bich & Jean-Pierre Drugeon & Lisa Morhaim, 2018. "On temporal aggregators and dynamic programming," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 66(3), pages 787-817, October.
    7. Philippe Bich & Jean-Pierre Drugeon & Lisa Morhaim, 2015. "On Aggregators and Dynamic Programming," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-01169552, HAL.
    8. Philippe Bich & Jean-Pierre Drugeon & Lisa Morhaim, 2015. "On Aggregators and Dynamic Programming," Post-Print halshs-01169552, HAL.
    9. Ma, Qingyin & Toda, Alexis Akira, 2022. "Asymptotic linearity of consumption functions and computational efficiency," Journal of Mathematical Economics, Elsevier, vol. 98(C).
    10. Philippe Bich & Jean-Pierre Drugeon & Lisa Morhaim, 2015. "On Aggregators and Dynamic Programming," Documents de travail du Centre d'Economie de la Sorbonne 15053, Université Panthéon-Sorbonne (Paris 1), Centre d'Economie de la Sorbonne.
    11. John Stachurski, 2009. "Economic Dynamics: Theory and Computation," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012774, December.
    12. Toda, Alexis Akira, 2019. "Wealth distribution with random discount factors," Journal of Monetary Economics, Elsevier, vol. 104(C), pages 101-113.
    13. Joachim Hubmer & Per Krusell & Anthony A. Smith Jr., 2020. "Sources of US Wealth Inequality: Past, Present, and Future," NBER Chapters, in: NBER Macroeconomics Annual 2020, volume 35, pages 391-455, National Bureau of Economic Research, Inc.
    14. Stachurski, John & Toda, Alexis Akira, 2019. "An impossibility theorem for wealth in heterogeneous-agent models with limited heterogeneity," Journal of Economic Theory, Elsevier, vol. 182(C), pages 1-24.
    15. Gouin-Bonenfant, Emilien & Toda, Alexis Akira, 2018. "Pareto Extrapolation: Bridging Theoretical and Quantitative Models of Wealth Inequality," University of California at San Diego, Economics Working Paper Series qt90n2h2bb, Department of Economics, UC San Diego.
    16. Bloise, Gaetano & Vailakis, Yiannis, 2018. "Convex dynamic programming with (bounded) recursive utility," Journal of Economic Theory, Elsevier, vol. 173(C), pages 118-141.
    17. Marcello D'Amato & Christian Di Pietro & Marco M. Sorge, 2023. "Left and Right: A Tale of Two Tails of the Wealth Distribution," CSEF Working Papers 691, Centre for Studies in Economics and Finance (CSEF), University of Naples, Italy.
    18. Robert A. Becker & Juan Pablo Rincón-Zapatero, 2017. "Arbitration and Renegotiation in Trade Agreements," CAEPR Working Papers 2017-007, Center for Applied Economics and Policy Research, Department of Economics, Indiana University Bloomington.
    19. Rincón-Zapatero, Juan Pablo, 2022. "Existence and uniqueness of solutions to the Bellman equation in stochastic dynamic programming," UC3M Working papers. Economics 35342, Universidad Carlos III de Madrid. Departamento de Economía.
    20. Guanlong Ren & John Stachurski, 2018. "Dynamic Programming with Recursive Preferences: Optimality and Applications," Papers 1812.05748, arXiv.org, revised Jun 2020.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:mateco:v:100:y:2022:i:c:s0304406822000143. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jmateco .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.