IDEAS home Printed from https://ideas.repec.org/a/inm/oropre/v67y2019i6p1719-1737.html
   My bibliography  Save this article

Easy Affine Markov Decision Processes

Author

Listed:
  • Jie Ning

    (Department of Operations, Weatherhead School of Management, Case Western Reserve University, Cleveland, Ohio 44106)

  • Matthew J. Sobel

    (Department of Operations, Weatherhead School of Management, Case Western Reserve University, Cleveland, Ohio 44106)

Abstract

This paper characterizes the class of decomposable affine Markov decision processes (MDPs), which have continuous multidimensional endogenous states and actions, and Markov-modulated exogenous states. This class of MDPs has affine dynamics and single-period rewards, sets of feasible actions that decompose into bounded polytopes, and endogenous state variables that are nonnegative or nonpositive. It is shown that decomposable affine MDPs with discounted criteria have an affine value function and an affine optimal policy. The affine coefficients of the value function and optimal policy are determined by the solution of auxiliary equations, which themselves resemble the dynamic program of a finite MDP. This result exorcizes the curse of dimensionality for decomposable affine MDPs, which otherwise could be solved only approximately with discrete approximations. Additionally, the paper characterizes partially decomposable affine MDPs that meet only some of the assumptions for decomposable affine MDPs. It shows that they are composites of two smaller MDPs, one of which is a decomposable affine MDP. The applicability of the classes of MDPs in the paper is exemplified with models of fishery management, dynamic capacity portfolio management, and commodity procurement.

Suggested Citation

  • Jie Ning & Matthew J. Sobel, 2019. "Easy Affine Markov Decision Processes," Operations Research, INFORMS, vol. 67(6), pages 1719-1737, November.
  • Handle: RePEc:inm:oropre:v:67:y:2019:i:6:p:1719-1737
    DOI: 10.1287/opre.2018.1836
    as

    Download full text from publisher

    File URL: https://doi.org/10.1287/opre.2018.1836
    Download Restriction: no

    File URL: https://libkey.io/10.1287/opre.2018.1836?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Anders Skonhoft & Niels Vestergaard & Martin Quaas, 2012. "Optimal Harvest in an Age Structured Model with Different Fishing Selectivity," Environmental & Resource Economics, Springer;European Association of Environmental and Resource Economists, vol. 51(4), pages 525-544, April.
    2. Andrew J. Clark & Herbert Scarf, 2004. "Optimal Policies for a Multi-Echelon Inventory Problem," Management Science, INFORMS, vol. 50(12_supple), pages 1782-1790, December.
    3. Zéphyr, Luckny & Lang, Pascal & Lamond, Bernard F. & Côté, Pascal, 2017. "Approximate stochastic dynamic programming for hydroelectric production planning," European Journal of Operational Research, Elsevier, vol. 262(2), pages 586-601.
    4. Eric V. Denardo & Uriel G. Rothblum, 1983. "Affine Structure and Invariant Policies for Dynamic Programs," Mathematics of Operations Research, INFORMS, vol. 8(3), pages 342-365, August.
    5. Stephen R. Palumbi, 2004. "Why mothers matter," Nature, Nature, vol. 430(7000), pages 621-622, August.
    6. Charu Sinha & Matthew Sobel & Volodymyr Babich, 2011. "Computationally simple and unified approach to finite- and infinite-horizon Clark–Scarf inventory model," IISE Transactions, Taylor & Francis Journals, vol. 43(3), pages 207-219.
    7. William S. Lovejoy, 1986. "Policy Bounds for Markov Decision Processes," Operations Research, INFORMS, vol. 34(4), pages 630-637, August.
    8. Christian N. K. Anderson & Chih-hao Hsieh & Stuart A. Sandin & Roger Hewitt & Anne Hollowed & John Beddington & Robert M. May & George Sugihara, 2008. "Why fishing magnifies fluctuations in fish abundance," Nature, Nature, vol. 452(7189), pages 835-839, April.
    9. Sripad K. Devalkar & Ravi Anupindi & Amitabh Sinha, 2011. "Integrated Optimization of Procurement, Processing, and Trade of Commodities," Operations Research, INFORMS, vol. 59(6), pages 1369-1381, December.
    10. Matthew J. Sobel, 1990. "Myopic Solutions of Affine Dynamic Models," Operations Research, INFORMS, vol. 38(5), pages 847-853, October.
    11. Sharon A. Johnson & Jery R. Stedinger & Christine A. Shoemaker & Ying Li & José Alberto Tejada-Guibert, 1993. "Numerical Solution of Continuous-State Dynamic Programs Using Linear and Spline Interpolation," Operations Research, INFORMS, vol. 41(3), pages 484-500, June.
    12. Ward Whitt, 1979. "Approximations of Dynamic Programs, II," Mathematics of Operations Research, INFORMS, vol. 4(2), pages 179-185, May.
    13. Matthew J. Sobel, 1990. "Higher-Order and Average Reward Myopic-Affine Dynamic Models," Mathematics of Operations Research, INFORMS, vol. 15(2), pages 299-310, May.
    14. Eberly, Janice C. & Van Mieghem, Jan A., 1997. "Multi-factor Dynamic Investment under Uncertainty," Journal of Economic Theory, Elsevier, vol. 75(2), pages 345-387, August.
    15. Roy Mendelssohn, 1982. "An Iterative Aggregation Procedure for Markov Decision Processes," Operations Research, INFORMS, vol. 30(1), pages 62-73, February.
    16. Jan A. Van Mieghem, 2003. "Commissioned Paper: Capacity Management, Investment, and Hedging: Review and Recent Developments," Manufacturing & Service Operations Management, INFORMS, vol. 5(4), pages 269-302, July.
    17. Tahvonen, Olli, 2009. "Economics of harvesting age-structured fish populations," Journal of Environmental Economics and Management, Elsevier, vol. 58(3), pages 281-299, November.
    18. Matthew J. Sobel, 1981. "Myopic Solutions of Markov Decision Processes and Stochastic Games," Operations Research, INFORMS, vol. 29(5), pages 995-1009, October.
    19. Ward Whitt, 1978. "Approximations of Dynamic Programs, I," Mathematics of Operations Research, INFORMS, vol. 3(3), pages 231-243, August.
    20. Arthur F. Veinott, Jr., 1965. "Optimal Policy for a Multi-Product, Dynamic, Nonstationary Inventory Problem," Management Science, INFORMS, vol. 12(3), pages 206-222, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Slotnick, Susan A. & Sobel, Matthew J., 2022. "Collaboration with a supplier to induce fair labor practices," European Journal of Operational Research, Elsevier, vol. 302(1), pages 244-258.
    2. Jie Ning, 2021. "Reducible Markov Decision Processes and Stochastic Games," Production and Operations Management, Production and Operations Management Society, vol. 30(8), pages 2726-2751, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Matthew J. Sobel & Volodymyr Babich, 2012. "Optimality of Myopic Policies for Dynamic Lot-Sizing Problems in Serial Production Lines with Random Yields and Autoregressive Demand," Operations Research, INFORMS, vol. 60(6), pages 1520-1536, December.
    2. Matthew J. Sobel & Wei Wei, 2010. "Myopic Solutions of Homogeneous Sequential Decision Processes," Operations Research, INFORMS, vol. 58(4-part-2), pages 1235-1246, August.
    3. Jie Ning, 2021. "Reducible Markov Decision Processes and Stochastic Games," Production and Operations Management, Production and Operations Management Society, vol. 30(8), pages 2726-2751, August.
    4. Jan A. Van Mieghem & Nils Rudi, 2002. "Newsvendor Networks: Inventory Management and Capacity Investment with Discretionary Activities," Manufacturing & Service Operations Management, INFORMS, vol. 4(4), pages 313-335, August.
    5. Alexandar Angelus & Evan L. Porteus, 2008. "An Asset Assembly Problem," Operations Research, INFORMS, vol. 56(3), pages 665-680, June.
    6. Andre P. Calmon & Florin D. Ciocan & Gonzalo Romero, 2021. "Revenue Management with Repeated Customer Interactions," Management Science, INFORMS, vol. 67(5), pages 2944-2963, May.
    7. Michael Z. Spivey & Warren B. Powell, 2004. "The Dynamic Assignment Problem," Transportation Science, INFORMS, vol. 38(4), pages 399-419, November.
    8. Holland, Daniel S. & Herrera, Guillermo E., 2012. "The impact of age structure, uncertainty, and asymmetric spatial dynamics on regulatory performance in a fishery metapopulation," Ecological Economics, Elsevier, vol. 77(C), pages 207-218.
    9. Jie Ning & Matthew J. Sobel, 2018. "Production and Capacity Management with Internal Financing," Manufacturing & Service Operations Management, INFORMS, vol. 20(1), pages 147-160, February.
    10. Wenbin Wang & Mark E. Ferguson & Shanshan Hu & Gilvan C. Souza, 2013. "Dynamic Capacity Investment with Two Competing Technologies," Manufacturing & Service Operations Management, INFORMS, vol. 15(4), pages 616-629, October.
    11. Ni, Yuanming & Steinshamn, Stein I. & Kvamsdal, Sturla F., 2022. "Negative shocks in an age-structured bioeconomic model and how to deal with them," Economic Analysis and Policy, Elsevier, vol. 76(C), pages 15-30.
    12. Qin, Ruwen & Nembhard, David A., 2012. "Demand modeling of stochastic product diffusion over the life cycle," International Journal of Production Economics, Elsevier, vol. 137(2), pages 201-210.
    13. Torpong Cheevaprawatdomrong & Robert L. Smith, 2004. "Infinite Horizon Production Scheduling in Time-Varying Systems Under Stochastic Demand," Operations Research, INFORMS, vol. 52(1), pages 105-115, February.
    14. Saif Benjaafar & Daniel Jiang & Xiang Li & Xiaobo Li, 2022. "Dynamic Inventory Repositioning in On-Demand Rental Networks," Management Science, INFORMS, vol. 68(11), pages 7861-7878, November.
    15. Martin F. Quaas & Till Requate, 2013. "Sushi or Fish Fingers? Seafood Diversity, Collapsing Fish Stocks, and Multispecies Fishery Management," Scandinavian Journal of Economics, Wiley Blackwell, vol. 115(2), pages 381-422, April.
    16. Da Rocha, José María & García-Cutrín, Javier & Gutiérrez Huerta, María José & Touza, Julia, 2015. "Reconciling yield stability with international fisheries agencies precautionary preferences: the role of non constant discount factors in age structured models," DFAEII Working Papers 1988-088X, University of the Basque Country - Department of Foundations of Economic Analysis II.
    17. John R. Birge, 2015. "OM Forum—Operations and Finance Interactions," Manufacturing & Service Operations Management, INFORMS, vol. 17(1), pages 4-15, February.
    18. Drouin, Nicol & Gautier, Antoine & Lamond, Bernard F. & Lang, Pascal, 1996. "Piecewise affine approximations for the control of a one-reservoir hydroelectric system," European Journal of Operational Research, Elsevier, vol. 89(1), pages 53-69, February.
    19. Anyan Qi & Hyun-Soo Ahn & Amitabh Sinha, 2017. "Capacity Investment with Demand Learning," Operations Research, INFORMS, vol. 65(1), pages 145-164, February.
    20. Yossi Aviv, 2003. "A Time-Series Framework for Supply-Chain Inventory Management," Operations Research, INFORMS, vol. 51(2), pages 210-227, April.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:oropre:v:67:y:2019:i:6:p:1719-1737. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.