IDEAS home Printed from https://ideas.repec.org/a/spr/lifeda/v28y2022i3d10.1007_s10985-022-09552-w.html
   My bibliography  Save this article

Mixture survival trees for cancer risk classification

Author

Listed:
  • Beilin Jia

    (University of North Carolina at Chapel Hill)

  • Donglin Zeng

    (University of North Carolina at Chapel Hill)

  • Jason J. Z. Liao

    (Incyte Corporation)

  • Guanghan F. Liu

    (Merck & Co., Inc)

  • Xianming Tan

    (University of North Carolina at Chapel Hill)

  • Guoqing Diao

    (The George Washington University)

  • Joseph G. Ibrahim

    (University of North Carolina at Chapel Hill)

Abstract

In oncology studies, it is important to understand and characterize disease heterogeneity among patients so that patients can be classified into different risk groups and one can identify high-risk patients at the right time. This information can then be used to identify a more homogeneous patient population for developing precision medicine. In this paper, we propose a mixture survival tree approach for direct risk classification. We assume that the patients can be classified into a pre-specified number of risk groups, where each group has distinct survival profile. Our proposed tree-based methods are devised to estimate latent group membership using an EM algorithm. The observed data log-likelihood function is used as the splitting criterion in recursive partitioning. The finite sample performance is evaluated by extensive simulation studies and the proposed method is illustrated by a case study in breast cancer.

Suggested Citation

  • Beilin Jia & Donglin Zeng & Jason J. Z. Liao & Guanghan F. Liu & Xianming Tan & Guoqing Diao & Joseph G. Ibrahim, 2022. "Mixture survival trees for cancer risk classification," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 28(3), pages 356-379, July.
  • Handle: RePEc:spr:lifeda:v:28:y:2022:i:3:d:10.1007_s10985-022-09552-w
    DOI: 10.1007/s10985-022-09552-w
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10985-022-09552-w
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10985-022-09552-w?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Hoora Moradian & Denis Larocque & François Bellavance, 2017. "$$L_1$$ L 1 splitting rules in survival forests," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(4), pages 671-691, October.
    2. Donglin Zeng & Lu Mao & D. Y. Lin, 2016. "Maximum likelihood estimation for semiparametric transformation models with interval-censored data," Biometrika, Biometrika Trust, vol. 103(2), pages 253-271.
    3. Yan Zhou & John McArdle, 2015. "Rationale and Applications of Survival Tree and Survival Ensemble Methods," Psychometrika, Springer;The Psychometric Society, vol. 80(3), pages 811-833, September.
    4. Yueh-Yun Chi & Joseph G. Ibrahim, 2006. "Joint Models for Multivariate Longitudinal and Multivariate Survival Data," Biometrics, The International Biometric Society, vol. 62(2), pages 432-445, June.
    5. Juan Shen & Xuming He, 2015. "Inference for Subgroup Analysis With a Structured Logistic-Normal Mixture Model," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(509), pages 303-312, March.
    6. Martin G. Larson & Gregg E. Dinse, 1985. "A Mixture Model for the Regression Analysis of Competing Risks Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 34(3), pages 201-211, November.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Alina Schenk & Moritz Berger & Matthias Schmid, 2024. "Pseudo-value regression trees," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 30(2), pages 439-471, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Wentian Guo & Yuan Ji & Daniel V. T. Catenacci, 2017. "A subgroup cluster-based Bayesian adaptive design for precision medicine," Biometrics, The International Biometric Society, vol. 73(2), pages 367-377, June.
    2. Oi, Katsuya, 2020. "Disuse as time away from a cognitively demanding job; how does it temporally or developmentally impact late-life cognition?," Intelligence, Elsevier, vol. 82(C).
    3. Choi, Taehwa & Kim, Arlene K.H. & Choi, Sangbum, 2021. "Semiparametric least-squares regression with doubly-censored data," Computational Statistics & Data Analysis, Elsevier, vol. 164(C).
    4. Sankaran, P.G. & Anisha, P., 2012. "Additive hazards models for gap time data with multiple causes," Statistics & Probability Letters, Elsevier, vol. 82(7), pages 1454-1462.
    5. Yangxin Huang & Xiaosun Lu & Jiaqing Chen & Juan Liang & Miriam Zangmeister, 2018. "Joint model-based clustering of nonlinear longitudinal trajectories and associated time-to-event data analysis, linked by latent class membership: with application to AIDS clinical studies," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 24(4), pages 699-718, October.
    6. Mo Leo S. F. & Yau Kelvin K. W., 2010. "Survival Mixture Model for Credit Risk Analysis," Asia-Pacific Journal of Risk and Insurance, De Gruyter, vol. 4(2), pages 1-20, July.
    7. Wang, Wuyi & Su, Liangjun, 2021. "Identifying latent group structures in nonlinear panels," Journal of Econometrics, Elsevier, vol. 220(2), pages 272-295.
    8. Henrique Aparecido Laureano & Ricardo Rasmussen Petterle & Guilherme Parreira da Silva & Paulo Justiniano Ribeiro Junior & Wagner Hugo Bonat, 2024. "A multinomial generalized linear mixed model for clustered competing risks data," Computational Statistics, Springer, vol. 39(3), pages 1417-1434, May.
    9. Lu Mao & Dan-Yu Lin & Donglin Zeng, 2017. "Semiparametric regression analysis of interval-censored competing risks data," Biometrics, The International Biometric Society, vol. 73(3), pages 857-865, September.
    10. Shao, Lihui & Wu, Jiaqi & Zhang, Weiping & Chen, Yu, 2024. "Integrated subgroup identification from multi-source data," Computational Statistics & Data Analysis, Elsevier, vol. 193(C).
    11. Li, Shuwei & Hu, Tao & Zhao, Xingqiu & Sun, Jianguo, 2019. "A class of semiparametric transformation cure models for interval-censored failure time data," Computational Statistics & Data Analysis, Elsevier, vol. 133(C), pages 153-165.
    12. Pierpaolo De Blasi & Nils L. Hjort, 2007. "The Bernstein-Von Mises Theorem in Semiparametric Competing Risks Models," ICER Working Papers - Applied Mathematics Series 17-2007, ICER - International Centre for Economic Research.
    13. Xi Ning & Yinghao Pan & Yanqing Sun & Peter B. Gilbert, 2023. "A semiparametric Cox–Aalen transformation model with censored data," Biometrics, The International Biometric Society, vol. 79(4), pages 3111-3125, December.
    14. Fan Feng & Guanghui Cheng & Jianguo Sun, 2023. "Variable Selection for Length-Biased and Interval-Censored Failure Time Data," Mathematics, MDPI, vol. 11(22), pages 1-20, November.
    15. Ying Huang & Juhee Cho & Youyi Fong, 2021. "Threshold‐based subgroup testing in logistic regression models in two‐phase sampling designs," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(2), pages 291-311, March.
    16. Meitz, Mika & Saikkonen, Pentti, 2021. "Testing for observation-dependent regime switching in mixture autoregressive models," Journal of Econometrics, Elsevier, vol. 222(1), pages 601-624.
    17. Tian Tian & Jianguo Sun, 2024. "Variable Selection for Nonlinear Covariate Effects with Interval-Censored Failure Time Data," Statistics in Biosciences, Springer;International Chinese Statistical Association, vol. 16(1), pages 185-202, April.
    18. Fei Gao & Kwun Chuen Gary Chan, 2019. "Semiparametric regression analysis of length‐biased interval‐censored data," Biometrics, The International Biometric Society, vol. 75(1), pages 121-132, March.
    19. Peihua Qiu & Lu You, 2022. "Dynamic disease screening by joint modelling of survival and longitudinal data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 71(5), pages 1158-1180, November.
    20. Shuyang Song & Jiaqi Wu & Weiping Zhang, 2025. "Integrative subgroup analysis for high-dimensional mixed-type multi-response data," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 34(1), pages 151-197, March.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:lifeda:v:28:y:2022:i:3:d:10.1007_s10985-022-09552-w. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.