IDEAS home Printed from https://ideas.repec.org/a/bla/biomet/v79y2023i4p3676-3689.html
   My bibliography  Save this article

Imputation‐based Q‐learning for optimizing dynamic treatment regimes with right‐censored survival outcome

Author

Listed:
  • Lingyun Lyu
  • Yu Cheng
  • Abdus S. Wahed

Abstract

Q‐learning has been one of the most commonly used methods for optimizing dynamic treatment regimes (DTRs) in multistage decision‐making. Right‐censored survival outcome poses a significant challenge to Q‐Learning due to its reliance on parametric models for counterfactual estimation which are subject to misspecification and sensitive to missing covariates. In this paper, we propose an imputation‐based Q‐learning (IQ‐learning) where flexible nonparametric or semiparametric models are employed to estimate optimal treatment rules for each stage and then weighted hot‐deck multiple imputation (MI) and direct‐draw MI are used to predict optimal potential survival times. Missing data are handled using inverse probability weighting and MI, and the nonrandom treatment assignment among the observed is accounted for using a propensity‐score approach. We investigate the performance of IQ‐learning via extensive simulations and show that it is more robust to model misspecification than existing Q‐Learning methods, imputes only plausible potential survival times contrary to parametric models and provides more flexibility in terms of baseline hazard shape. Using IQ‐learning, we developed an optimal DTR for leukemia treatment based on a randomized trial with observational follow‐up that motivated this study.

Suggested Citation

  • Lingyun Lyu & Yu Cheng & Abdus S. Wahed, 2023. "Imputation‐based Q‐learning for optimizing dynamic treatment regimes with right‐censored survival outcome," Biometrics, The International Biometric Society, vol. 79(4), pages 3676-3689, December.
  • Handle: RePEc:bla:biomet:v:79:y:2023:i:4:p:3676-3689
    DOI: 10.1111/biom.13872
    as

    Download full text from publisher

    File URL: https://doi.org/10.1111/biom.13872
    Download Restriction: no

    File URL: https://libkey.io/10.1111/biom.13872?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Y. Q. Zhao & D. Zeng & E. B. Laber & R. Song & M. Yuan & M. R. Kosorok, 2015. "Doubly robust learning for estimating individualized treatment with censored data," Biometrika, Biometrika Trust, vol. 102(1), pages 151-168.
    2. Gabrielle Simoneau & Erica E. M. Moodie & Jagtar S. Nijjar & Robert W. Platt & the Scottish Early Rheumatoid Arthritis Inception Cohort Investigators, 2020. "Estimating Optimal Dynamic Treatment Regimes With Survival Outcomes," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 115(531), pages 1531-1539, July.
    3. Erica E. M. Moodie & Thomas S. Richardson & David A. Stephens, 2007. "Demystifying Optimal Dynamic Treatment Regimes," Biometrics, The International Biometric Society, vol. 63(2), pages 447-455, June.
    4. James R. Carpenter & Michael G. Kenward & Stijn Vansteelandt, 2006. "A comparison of multiple imputation and doubly robust estimation for analyses with missing data," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 169(3), pages 571-584, July.
    5. Michael P. Wallace & Erica E. M. Moodie, 2015. "Doubly‐robust dynamic treatment regimen estimation via weighted least squares," Biometrics, The International Biometric Society, vol. 71(3), pages 636-644, September.
    6. S. A. Murphy, 2003. "Optimal dynamic treatment regimes," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 65(2), pages 331-355, May.
    7. Yanxun Xu & Peter Müller & Abdus S. Wahed & Peter F. Thall, 2016. "Bayesian Nonparametric Estimation for Dynamic Treatment Regimes With Sequential Transition Times," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 111(515), pages 921-950, July.
    8. Abdus S. Wahed & Peter F. Thall, 2013. "Evaluating joint effects of induction–salvage treatment regimes on overall survival in acute leukaemia," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 62(1), pages 67-83, January.
    9. Runchao Jiang & Wenbin Lu & Rui Song & Marie Davidian, 2017. "On estimation of optimal treatment regimes for maximizing t-year survival probability," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 79(4), pages 1165-1185, September.
    10. Rebecca R. Andridge & Roderick J. A. Little, 2010. "A Review of Hot Deck Imputation for Survey Non‐response," International Statistical Review, International Statistical Institute, vol. 78(1), pages 40-64, April.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Q. Clairon & R. Henderson & N. J. Young & E. D. Wilson & C. J. Taylor, 2021. "Adaptive treatment and robust control," Biometrics, The International Biometric Society, vol. 77(1), pages 223-236, March.
    2. Jin Wang & Donglin Zeng & D. Y. Lin, 2022. "Semiparametric single-index models for optimal treatment regimens with censored outcomes," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 28(4), pages 744-763, October.
    3. Giorgos Bakoyannis, 2023. "Estimating optimal individualized treatment rules with multistate processes," Biometrics, The International Biometric Society, vol. 79(4), pages 2830-2842, December.
    4. Dana Johnson & Wenbin Lu & Marie Davidian, 2023. "A general framework for subgroup detection via one‐step value difference estimation," Biometrics, The International Biometric Society, vol. 79(3), pages 2116-2126, September.
    5. Yingchao Zhong & Chang Wang & Lu Wang, 2021. "Survival Augmented Patient Preference Incorporated Reinforcement Learning to Evaluate Tailoring Variables for Personalized Healthcare," Stats, MDPI, vol. 4(4), pages 1-17, September.
    6. Luo, Yu & Graham, Daniel J. & McCoy, Emma J., 2023. "Semiparametric Bayesian doubly robust causal estimation," LSE Research Online Documents on Economics 117944, London School of Economics and Political Science, LSE Library.
    7. Ruoqing Zhu & Ying-Qi Zhao & Guanhua Chen & Shuangge Ma & Hongyu Zhao, 2017. "Greedy outcome weighted tree learning of optimal personalized treatment rules," Biometrics, The International Biometric Society, vol. 73(2), pages 391-400, June.
    8. Rich Benjamin & Moodie Erica E. M. & A. Stephens David, 2016. "Influence Re-weighted G-Estimation," The International Journal of Biostatistics, De Gruyter, vol. 12(1), pages 157-177, May.
    9. Peng Wu & Donglin Zeng & Haoda Fu & Yuanjia Wang, 2020. "On using electronic health records to improve optimal treatment rules in randomized trials," Biometrics, The International Biometric Society, vol. 76(4), pages 1075-1086, December.
    10. Weibin Mo & Yufeng Liu, 2022. "Efficient learning of optimal individualized treatment rules for heteroscedastic or misspecified treatment‐free effect models," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 84(2), pages 440-472, April.
    11. Xiaofei Bai & Anastasios A. Tsiatis & Wenbin Lu & Rui Song, 2017. "Optimal treatment regimes for survival endpoints using a locally-efficient doubly-robust estimator from a classification perspective," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(4), pages 585-604, October.
    12. Qizhao Chen & Morgane Austern & Vasilis Syrgkanis, 2023. "Inference on Optimal Dynamic Policies via Softmax Approximation," Papers 2303.04416, arXiv.org, revised Dec 2023.
    13. Wei Liu & Zhiwei Zhang & Lei Nie & Guoxing Soon, 2017. "A Case Study in Personalized Medicine: Rilpivirine Versus Efavirenz for Treatment-Naive HIV Patients," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(520), pages 1381-1392, October.
    14. Baqun Zhang & Min Zhang, 2018. "C‐learning: A new classification framework to estimate optimal dynamic treatment regimes," Biometrics, The International Biometric Society, vol. 74(3), pages 891-899, September.
    15. Xin Chen & Rui Song & Jiajia Zhang & Swann Arp Adams & Liuquan Sun & Wenbin Lu, 2022. "On estimating optimal regime for treatment initiation time based on restricted mean residual lifetime," Biometrics, The International Biometric Society, vol. 78(4), pages 1377-1389, December.
    16. Sies Aniek & Van Mechelen Iven, 2017. "Comparing Four Methods for Estimating Tree-Based Treatment Regimes," The International Journal of Biostatistics, De Gruyter, vol. 13(1), pages 1-20, May.
    17. Ruohan Zhan & Zhimei Ren & Susan Athey & Zhengyuan Zhou, 2021. "Policy Learning with Adaptively Collected Data," Papers 2105.02344, arXiv.org, revised Nov 2022.
    18. Yanqing Wang & Yingqi Zhao & Yingye Zheng, 2022. "Targeted Search for Individualized Clinical Decision Rules to Optimize Clinical Outcomes," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 14(3), pages 564-581, December.
    19. Hongming Pu & Bo Zhang, 2021. "Estimating optimal treatment rules with an instrumental variable: A partial identification learning approach," Journal of the Royal Statistical Society Series B, Royal Statistical Society, vol. 83(2), pages 318-345, April.
    20. Jiacheng Wu & Nina Galanter & Susan M. Shortreed & Erica E.M. Moodie, 2022. "Ranking tailoring variables for constructing individualized treatment rules: An application to schizophrenia," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(2), pages 309-330, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bla:biomet:v:79:y:2023:i:4:p:3676-3689. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Wiley Content Delivery (email available below). General contact details of provider: http://www.blackwellpublishing.com/journal.asp?ref=0006-341X .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.