IDEAS home Printed from https://ideas.repec.org/a/kap/hcarem/v26y2023i3d10.1007_s10729-023-09636-5.html
   My bibliography  Save this article

A reinforcement learning-based optimal control approach for managing an elective surgery backlog after pandemic disruption

Author

Listed:
  • Huyang Xu

    (Chengdu University of Technology)

  • Yuanchen Fang

    (Sichuan University)

  • Chun-An Chou

    (Northeastern University)

  • Nasser Fard

    (Northeastern University)

  • Li Luo

    (Sichuan University)

Abstract

Contagious disease pandemics, such as COVID-19, can cause hospitals around the world to delay nonemergent elective surgeries, which results in a large surgery backlog. To develop an operational solution for providing patients timely surgical care with limited health care resources, this study proposes a stochastic control process-based method that helps hospitals make operational recovery plans to clear their surgery backlog and restore surgical activity safely. The elective surgery backlog recovery process is modeled by a general discrete-time queueing network system, which is formulated by a Markov decision process. A scheduling optimization algorithm based on the piecewise decaying $$\epsilon$$ ϵ -greedy reinforcement learning algorithm is proposed to make dynamic daily surgery scheduling plans considering newly arrived patients, waiting time and clinical urgency. The proposed method is tested through a set of simulated dataset, and implemented on an elective surgery backlog that built up in one large general hospital in China after the outbreak of COVID-19. The results show that, compared with the current policy, the proposed method can effectively and rapidly clear the surgery backlog caused by a pandemic while ensuring that all patients receive timely surgical care. These results encourage the wider adoption of the proposed method to manage surgery scheduling during all phases of a public health crisis.

Suggested Citation

  • Huyang Xu & Yuanchen Fang & Chun-An Chou & Nasser Fard & Li Luo, 2023. "A reinforcement learning-based optimal control approach for managing an elective surgery backlog after pandemic disruption," Health Care Management Science, Springer, vol. 26(3), pages 430-446, September.
  • Handle: RePEc:kap:hcarem:v:26:y:2023:i:3:d:10.1007_s10729-023-09636-5
    DOI: 10.1007/s10729-023-09636-5
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10729-023-09636-5
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10729-023-09636-5?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Yufan Zhao & Donglin Zeng & Mark A. Socinski & Michael R. Kosorok, 2011. "Reinforcement Learning Strategies for Clinical Trials in Nonsmall Cell Lung Cancer," Biometrics, The International Biometric Society, vol. 67(4), pages 1422-1433, December.
    2. Schütz, Hans-Jörg & Kolisch, Rainer, 2012. "Approximate dynamic programming for capacity allocation in the service industry," European Journal of Operational Research, Elsevier, vol. 218(1), pages 239-250.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Peter J. H. Hulshof & Martijn R. K. Mes & Richard J. Boucherie & Erwin W. Hans, 2016. "Patient admission planning using Approximate Dynamic Programming," Flexible Services and Manufacturing Journal, Springer, vol. 28(1), pages 30-61, June.
    2. Alireza F. Hesaraki & Nico P. Dellaert & Ton Kok, 2020. "Integrating nurse assignment in outpatient chemotherapy appointment scheduling," OR Spectrum: Quantitative Approaches in Management, Springer;Gesellschaft für Operations Research e.V., vol. 42(4), pages 935-963, December.
    3. Eric B. Laber & Anastasios A. Tsiatis & Marie Davidian & Shannon T. Holloway, 2014. "Discussion of “Combining biomarkers to optimize patient treatment recommendation”," Biometrics, The International Biometric Society, vol. 70(3), pages 707-710, September.
    4. Sauré, Antoine & Patrick, Jonathan & Tyldesley, Scott & Puterman, Martin L., 2012. "Dynamic multi-appointment patient scheduling for radiation therapy," European Journal of Operational Research, Elsevier, vol. 223(2), pages 573-584.
    5. Gartner, Daniel & Kolisch, Rainer, 2014. "Scheduling the hospital-wide flow of elective patients," European Journal of Operational Research, Elsevier, vol. 233(3), pages 689-699.
    6. Ulusan, Aybike & Ergun, Özlem, 2021. "Approximate dynamic programming for network recovery problems with stochastic demand," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 151(C).
    7. Abdolreza Rasouli Kenari & Mahboubeh Shamsi, 2021. "A hyper-heuristic selector algorithm for cloud computing scheduling based on workflow features," OPSEARCH, Springer;Operational Research Society of India, vol. 58(4), pages 852-868, December.
    8. Ying Ding & Bin Nan, 2015. "Estimating Mean Survival Time: When is it Possible?," Scandinavian Journal of Statistics, Danish Society for Theoretical Statistics;Finnish Statistical Society;Norwegian Statistical Association;Swedish Statistical Association, vol. 42(2), pages 397-413, June.
    9. Creemers, Stefan & Lambrecht, Marc R. & Beliën, Jeroen & Van den Broeke, Maud, 2021. "Evaluation of appointment scheduling rules: A multi-performance measurement approach," Omega, Elsevier, vol. 100(C).
    10. De Vuyst, Stijn & Bruneel, Herwig & Fiems, Dieter, 2014. "Computationally efficient evaluation of appointment schedules in health care," European Journal of Operational Research, Elsevier, vol. 237(3), pages 1142-1154.
    11. Ridvan Gedik & Shengfan Zhang & Chase Rainwater, 2017. "Strategic level proton therapy patient admission planning: a Markov decision process modeling approach," Health Care Management Science, Springer, vol. 20(2), pages 286-302, June.
    12. Xin Chen & Rui Song & Jiajia Zhang & Swann Arp Adams & Liuquan Sun & Wenbin Lu, 2022. "On estimating optimal regime for treatment initiation time based on restricted mean residual lifetime," Biometrics, The International Biometric Society, vol. 78(4), pages 1377-1389, December.
    13. Giorgos Bakoyannis, 2023. "Estimating optimal individualized treatment rules with multistate processes," Biometrics, The International Biometric Society, vol. 79(4), pages 2830-2842, December.
    14. Hongming Pu & Bo Zhang, 2021. "Estimating optimal treatment rules with an instrumental variable: A partial identification learning approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 318-345, April.
    15. Qu, Xiuli & Peng, Yidong & Shi, Jing & LaGanga, Linda, 2015. "An MDP model for walk-in patient admission management in primary care clinics," International Journal of Production Economics, Elsevier, vol. 168(C), pages 303-320.
    16. Hans-Jörg Schütz & Rainer Kolisch, 2013. "Capacity allocation for demand of different customer-product-combinations with cancellations, no-shows, and overbooking when there is a sequential delivery of service," Annals of Operations Research, Springer, vol. 206(1), pages 401-423, July.
    17. Astaraky, Davood & Patrick, Jonathan, 2015. "A simulation based approximate dynamic programming approach to multi-class, multi-resource surgical scheduling," European Journal of Operational Research, Elsevier, vol. 245(1), pages 309-319.
    18. Qing, Qiankai & Deng, Tianhu & Wang, Hongwei, 2017. "Capacity allocation under downstream competition and bargaining," European Journal of Operational Research, Elsevier, vol. 261(1), pages 97-107.
    19. I Díaz & O Savenkov & K Ballman, 2018. "Targeted learning ensembles for optimal individualized treatment rules with time-to-event outcomes," Biometrika, Biometrika Trust, vol. 105(3), pages 723-738.
    20. Caiyun Fan & Wenbin Lu & Rui Song & Yong Zhou, 2017. "Concordance-assisted learning for estimating optimal individualized treatment regimes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(5), pages 1565-1582, November.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:kap:hcarem:v:26:y:2023:i:3:d:10.1007_s10729-023-09636-5. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.