IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2408.03034.html
   My bibliography  Save this paper

A Course in Dynamic Optimization

Author

Listed:
  • Bar Light

Abstract

These lecture notes are derived from a graduate-level course in dynamic optimization, offering an introduction to techniques and models extensively used in management science, economics, operations research, engineering, and computer science. The course emphasizes the theoretical underpinnings of discrete-time dynamic programming models and advanced algorithmic strategies for solving these models. Unlike typical treatments, it provides a proof for the principle of optimality for upper semi-continuous dynamic programming, a middle ground between the simpler countable state space case \cite{bertsekas2012dynamic}, and the involved universally measurable case \cite{bertsekas1996stochastic}. This approach is sufficiently rigorous to include important examples such as dynamic pricing, consumption-savings, and inventory management models. The course also delves into the properties of value and policy functions, leveraging classical results \cite{topkis1998supermodularity} and recent developments. Additionally, it offers an introduction to reinforcement learning, including a formal proof of the convergence of Q-learning algorithms. Furthermore, the notes delve into policy gradient methods for the average reward case, presenting a convergence result for the tabular case in this context. This result is simple and similar to the discounted case but appears to be new.

Suggested Citation

  • Bar Light, 2024. "A Course in Dynamic Optimization," Papers 2408.03034, arXiv.org, revised Oct 2024.
  • Handle: RePEc:arx:papers:2408.03034
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2408.03034
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Hopenhayn, Hugo A & Prescott, Edward C, 1992. "Stochastic Monotonicity and Stationary Distributions for Dynamic Economies," Econometrica, Econometric Society, vol. 60(6), pages 1387-1406, November.
    2. Bar Light, 2021. "Stochastic Comparative Statics in Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 46(2), pages 797-810, May.
    3. William S. Lovejoy, 1987. "Ordered Solutions for Dynamic Programs," Mathematics of Operations Research, INFORMS, vol. 12(2), pages 269-276, May.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Light, Bar & Weintraub, Gabriel, 2018. "Mean Field Equilibrium: Uniqueness, Existence, and Comparative Statics," Research Papers 3731, Stanford University, Graduate School of Business.
    2. Mirman, Leonard J. & Morand, Olivier F. & Reffett, Kevin L., 2008. "A qualitative approach to Markovian equilibrium in infinite horizon economies with capital," Journal of Economic Theory, Elsevier, vol. 139(1), pages 75-98, March.
    3. John Stachurski, 2009. "Economic Dynamics: Theory and Computation," MIT Press Books, The MIT Press, edition 1, volume 1, number 0262012774, December.
    4. Bar Light, 2021. "Stochastic Comparative Statics in Markov Decision Processes," Mathematics of Operations Research, INFORMS, vol. 46(2), pages 797-810, May.
    5. Manjira Datta & Kevin L. Reffett, 2005. "Isotone Recursive Methods: the Case of Homogeneous Agents," Tinbergen Institute Discussion Papers 05-012/2, Tinbergen Institute.
    6. Bar Light, 2019. "Stochastic Comparative Statics in Markov Decision Processes," Papers 1904.05481, arXiv.org, revised Jan 2020.
    7. Demange, Gabrielle & Laroque, Guy, 1998. "Long-sighted principal and myopic agents," Journal of Mathematical Economics, Elsevier, vol. 30(2), pages 119-146, September.
    8. Sokbae Lee & Oliver Linton & Yoon-Jae Whang, 2009. "Testing for Stochastic Monotonicity," Econometrica, Econometric Society, vol. 77(2), pages 585-602, March.
    9. Takashi Kamihigashi & John Stachurski, 2011. "Existence, Stability and Computation of Stationary Distributions: An Extension of the Hopenhayn-Prescott Theorem," Discussion Paper Series DP2011-32, Research Institute for Economics & Business Administration, Kobe University.
    10. Chambers, Marcus J & Bailey, Roy E, 1996. "A Theory of Commodity Price Fluctuations," Journal of Political Economy, University of Chicago Press, vol. 104(5), pages 924-957, October.
    11. Nicola Pavoni & G. L. Violante, 2007. "Optimal Welfare-to-Work Programs," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 74(1), pages 283-318.
    12. Marcet, Albert & Obiols-Homs, Francesc & Weil, Philippe, 2007. "Incomplete markets, labor supply and capital accumulation," Journal of Monetary Economics, Elsevier, vol. 54(8), pages 2621-2635, November.
    13. John Stachurski & Cuong Le Van, 2004. "Parametric continuity of stationary distributions," Université Paris1 Panthéon-Sorbonne (Post-Print and Working Papers) halshs-03331313, HAL.
    14. Drewianka, Scott, 2006. "A generalized model of commitment," Mathematical Social Sciences, Elsevier, vol. 52(3), pages 233-251, December.
    15. Marta González-Torrabadella & Josep Pijoan-Mas, 2006. "Flat tax reforms: a general equilibrium evaluation for Spain," Investigaciones Economicas, Fundación SEPI, vol. 30(2), pages 317-351, May.
    16. Zhang, Yuzhe, 2007. "Stochastic optimal growth with a non-compact state space," Journal of Mathematical Economics, Elsevier, vol. 43(2), pages 115-129, February.
    17. Takashi Kamihigashiw & John Stachurski, 2014. "Seeking Ergodicity in Dynamic Economies," Working Papers 2014-402, Department of Research, Ipag Business School.
    18. Masako Ikefuji & Ryo Horii, 2007. "Wealth Heterogeneity and Escape from the Poverty–Environment Trap," Journal of Public Economic Theory, Association for Public Economic Theory, vol. 9(6), pages 1041-1068, December.
    19. François Le Grand & Xavier Ragot, 2022. "Managing Inequality Over Business Cycles: Optimal Policies With Heterogeneous Agents And Aggregate Shocks," International Economic Review, Department of Economics, University of Pennsylvania and Osaka University Institute of Social and Economic Research Association, vol. 63(1), pages 511-540, February.
    20. Lilia Maliar & Serguei Maliar & John B. Taylor & Inna Tsener, 2020. "A tractable framework for analyzing a class of nonstationary Markov models," Quantitative Economics, Econometric Society, vol. 11(4), pages 1289-1323, November.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2408.03034. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.