IDEAS home Printed from https://ideas.repec.org/a/sae/medema/v30y2010i4p474-483.html
   My bibliography  Save this article

Markov Decision Processes: A Tool for Sequential Decision Making under Uncertainty

Author

Listed:
  • Oguzhan Alagoz

    (Department of Industrial and Systems Engineering, University of Wisconsin-Madison, Madison, WI, alagoz@engr.wisc.edu)

  • Heather Hsu

    (Department of Industrial Engineering, University of Pittsburgh, Pittsburgh, PA, Section of Decision Sciences and Clinical Systems Modeling, Division of General Medicine, and Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA)

  • Andrew J. Schaefer

    (Department of Industrial Engineering, University of Pittsburgh, Pittsburgh, PA)

  • Mark S. Roberts

    (Department of Health Policy and Management, University of Pittsburgh Graduate School of Public Health, Pittsburgh, PA, Section of Decision Sciences and Clinical Systems Modeling, Division of General Medicine, and Department of Medicine, University of Pittsburgh School of Medicine, Pittsburgh, PA)

Abstract

We provide a tutorial on the construction and evaluation of Markov decision processes (MDPs), which are powerful analytical tools used for sequential decision making under uncertainty that have been widely used in many industrial and manufacturing applications but are underutilized in medical decision making (MDM). We demonstrate the use of an MDP to solve a sequential clinical treatment problem under uncertainty. Markov decision processes generalize standard Markov models in that a decision process is embedded in the model and multiple decisions are made over time. Furthermore, they have significant advantages over standard decision analysis. We compare MDPs to standard Markov-based simulation models by solving the problem of the optimal timing of living-donor liver transplantation using both methods. Both models result in the same optimal transplantation policy and the same total life expectancies for the same patient and living donor. The computation time for solving the MDP model is significantly smaller than that for solving the Markov model. We briefly describe the growing literature of MDPs applied to medical decisions.

Suggested Citation

  • Oguzhan Alagoz & Heather Hsu & Andrew J. Schaefer & Mark S. Roberts, 2010. "Markov Decision Processes: A Tool for Sequential Decision Making under Uncertainty," Medical Decision Making, , vol. 30(4), pages 474-483, July.
  • Handle: RePEc:sae:medema:v:30:y:2010:i:4:p:474-483
    DOI: 10.1177/0272989X09353194
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/0272989X09353194
    Download Restriction: no

    File URL: https://libkey.io/10.1177/0272989X09353194?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Oguzhan Alagoz & Cindy L. Bryce & Steven Shechter & Andrew Schaefer & Chung-Chou H. Chang & Derek C. Angus & Mark S. Roberts, 2005. "Incorporating Biological Natural History in Simulation Models: Empirical Estimates of the Progression of End-Stage Liver Disease," Medical Decision Making, , vol. 25(6), pages 620-632, November.
    2. Oguzhan Alagoz & Lisa M. Maillart & Andrew J. Schaefer & Mark S. Roberts, 2007. "Choosing Among Living-Donor and Cadaveric Livers," Management Science, INFORMS, vol. 53(11), pages 1702-1715, November.
    3. Chuanpu Hu & William S. Lovejoy & Steven L. Shafer, 1996. "Comparison of Some Suboptimal Control Policies in Medical Drug Therapy," Operations Research, INFORMS, vol. 44(5), pages 696-709, October.
    4. Oguzhan Alagoz & Lisa M. Maillart & Andrew J. Schaefer & Mark S. Roberts, 2004. "The Optimal Timing of Living-Donor Liver Transplantation," Management Science, INFORMS, vol. 50(10), pages 1420-1430, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. F. R. Rolli & M. Ruggeri & F. Kheiraoui & C. Drago & M. Basile & C. Favaretti & A. Cicchetti, 2018. "Economic evaluation of Zepatier for the management of HCV in the Italian scenario," The European Journal of Health Economics, Springer;Deutsche Gesellschaft für Gesundheitsökonomie (DGGÖ), vol. 19(9), pages 1365-1374, December.
    2. Eike Nohdurft & Elisa Long & Stefan Spinler, 2017. "Was Angelina Jolie Right? Optimizing Cancer Prevention Strategies Among BRCA Mutation Carriers," Decision Analysis, INFORMS, vol. 14(3), pages 139-169, September.
    3. Paul Kairys & Thomas Frese & Paul Voigt & Johannes Horn & Matthias Girndt & Rafael Mikolajczyk, 2022. "Development of the simulation-based German albuminuria screening model (S-GASM) for estimating the cost-effectiveness of albuminuria screening in Germany," PLOS ONE, Public Library of Science, vol. 17(1), pages 1-13, January.
    4. Nazila Bazrafshan & M. M. Lotfi, 2020. "A finite-horizon Markov decision process model for cancer chemotherapy treatment planning: an application to sequential treatment decision making in clinical trials," Annals of Operations Research, Springer, vol. 295(1), pages 483-502, December.
    5. Kotas, Jakob & Ghate, Archis, 2018. "Bayesian learning of dose–response parameters from a cohort under response-guided dosing," European Journal of Operational Research, Elsevier, vol. 265(1), pages 328-343.
    6. Louis Anthony (Tony) Cox, 2012. "Confronting Deep Uncertainties in Risk Analysis," Risk Analysis, John Wiley & Sons, vol. 32(10), pages 1607-1629, October.
    7. Hershberger, Patricia E. & Gallo, Agatha M. & Kavanaugh, Karen & Olshansky, Ellen & Schwartz, Alan & Tur-Kaspa, Ilan, 2012. "The decision-making process of genetically at-risk couples considering preimplantation genetic diagnosis: Initial findings from a grounded theory study," Social Science & Medicine, Elsevier, vol. 74(10), pages 1536-1543.
    8. Ting-Yu Ho & Shan Liu & Zelda B. Zabinsky, 2019. "A Multi-Fidelity Rollout Algorithm for Dynamic Resource Allocation in Population Disease Management," Health Care Management Science, Springer, vol. 22(4), pages 727-755, December.
    9. Malek Ebadi & Raha Akhavan-Tabatabaei, 2021. "Personalized Cotesting Policies for Cervical Cancer Screening: A POMDP Approach," Mathematics, MDPI, vol. 9(6), pages 1-20, March.
    10. Jesse G. Wales & Alexander J. Zolan & William T. Hamilton & Alexandra M. Newman & Michael J. Wagner, 2023. "Combining simulation and optimization to derive operating policies for a concentrating solar power plant," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 45(1), pages 119-150, March.
    11. Sun, Qirun & Wu, Zhi & Gu, Wei & Zhu, Tao & Zhong, Lei & Gao, Ting, 2021. "Flexible expansion planning of distribution system integrating multiple renewable energy sources: An approximate dynamic programming approach," Energy, Elsevier, vol. 226(C).
    12. Lauren E. Cipriano & Thomas A. Weber, 2018. "Population-level intervention and information collection in dynamic healthcare policy," Health Care Management Science, Springer, vol. 21(4), pages 604-631, December.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Zeynep Erkin & Matthew D. Bailey & Lisa M. Maillart & Andrew J. Schaefer & Mark S. Roberts, 2010. "Eliciting Patients' Revealed Preferences: An Inverse Markov Decision Process Approach," Decision Analysis, INFORMS, vol. 7(4), pages 358-365, December.
    2. Burhaneddin Sandıkçı & Lisa M. Maillart & Andrew J. Schaefer & Oguzhan Alagoz & Mark S. Roberts, 2008. "Estimating the Patient's Price of Privacy in Liver Transplantation," Operations Research, INFORMS, vol. 56(6), pages 1393-1410, December.
    3. Boloori, Alireza & Saghafian, Soroush & Chakkera, Harini A. A. & Cook, Curtiss B., 2017. "Data-Driven Management of Post-transplant Medications: An APOMDP Approach," Working Paper Series rwp17-036, Harvard University, John F. Kennedy School of Government.
    4. Burhaneddin Sandıkçı & Lisa M. Maillart & Andrew J. Schaefer & Mark S. Roberts, 2013. "Alleviating the Patient's Price of Privacy Through a Partially Observable Waiting List," Management Science, INFORMS, vol. 59(8), pages 1836-1854, August.
    5. Oguzhan Alagoz & Lisa M. Maillart & Andrew J. Schaefer & Mark S. Roberts, 2007. "Determining the Acceptance of Cadaveric Livers Using an Implicit Model of the Waiting List," Operations Research, INFORMS, vol. 55(1), pages 24-36, February.
    6. Mason, J.E. & Denton, B.T. & Shah, N.D. & Smith, S.A., 2014. "Optimizing the simultaneous management of blood pressure and cholesterol for type 2 diabetes patients," European Journal of Operational Research, Elsevier, vol. 233(3), pages 727-738.
    7. Mustafa Akan & Oguzhan Alagoz & Baris Ata & Fatih Safa Erenay & Adnan Said, 2012. "A Broader View of Designing the Liver Allocation System," Operations Research, INFORMS, vol. 60(4), pages 757-770, August.
    8. Miao He & Lei Zhao & Warren Powell, 2010. "Optimal control of dosage decisions in controlled ovarian hyperstimulation," Annals of Operations Research, Springer, vol. 178(1), pages 223-245, July.
    9. Zlatana Nenova & Jennifer Shang, 2022. "Personalized Chronic Disease Follow‐Up Appointments: Risk‐Stratified Care Through Big Data," Production and Operations Management, Production and Operations Management Society, vol. 31(2), pages 583-606, February.
    10. Jinghua Shi & Oguzhan Alagoz & Fatih Erenay & Qiang Su, 2014. "A survey of optimization models on cancer chemotherapy treatment planning," Annals of Operations Research, Springer, vol. 221(1), pages 331-356, October.
    11. Sepehr Nemati & Zeynep G. Icten & Lisa M. Maillart & Andrew J. Schaefer, 2020. "Mitigating Information Asymmetry in Liver Allocation," INFORMS Journal on Computing, INFORMS, vol. 32(2), pages 234-248, April.
    12. Jonathan E. Helm & Mariel S. Lavieri & Mark P. Van Oyen & Joshua D. Stein & David C. Musch, 2015. "Dynamic Forecasting and Control Algorithms of Glaucoma Progression for Clinician Decision Support," Operations Research, INFORMS, vol. 63(5), pages 979-999, October.
    13. Klein, Michael G. & Verter, Vedat & Moses, Brian G., 2020. "Designing a rural network of dialysis facilities," European Journal of Operational Research, Elsevier, vol. 282(3), pages 1088-1100.
    14. Kotas, Jakob & Ghate, Archis, 2018. "Bayesian learning of dose–response parameters from a cohort under response-guided dosing," European Journal of Operational Research, Elsevier, vol. 265(1), pages 328-343.
    15. Eike Nohdurft & Elisa Long & Stefan Spinler, 2017. "Was Angelina Jolie Right? Optimizing Cancer Prevention Strategies Among BRCA Mutation Carriers," Decision Analysis, INFORMS, vol. 14(3), pages 139-169, September.
    16. Satır, Benhür & Erenay, Fatih Safa & Bookbinder, James H., 2018. "Shipment consolidation with two demand classes: Rationing the dispatch capacity," European Journal of Operational Research, Elsevier, vol. 270(1), pages 171-184.
    17. Zhao, Xian & He, Zongda & Wu, Yaguang & Qiu, Qingan, 2022. "Joint optimization of condition-based performance control and maintenance policies for mission-critical systems," Reliability Engineering and System Safety, Elsevier, vol. 226(C).
    18. Sen Lin & Bo Li & Antonio Arreola-Risa & Yiwei Huang, 2023. "Optimizing a single-product production-inventory system under constant absolute risk aversion," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 31(3), pages 510-537, October.
    19. Wenjuan Fan & Yang Zong & Subodha Kumar, 2022. "Optimal treatment of chronic kidney disease with uncertainty in obtaining a transplantable kidney: an MDP based approach," Annals of Operations Research, Springer, vol. 316(1), pages 269-302, September.
    20. Baruch Keren & Joseph Pliskin, 2011. "Optimal timing of joint replacement using mathematical programming and stochastic programming models," Health Care Management Science, Springer, vol. 14(4), pages 361-369, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:medema:v:30:y:2010:i:4:p:474-483. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.