IDEAS home Printed from https://ideas.repec.org/a/spr/lifeda/v23y2017i4d10.1007_s10985-016-9372-1.html
   My bibliography  Save this article

$$L_1$$ L 1 splitting rules in survival forests

Author

Listed:
  • Hoora Moradian

    (HEC Montréal)

  • Denis Larocque

    (HEC Montréal)

  • François Bellavance

    (HEC Montréal)

Abstract

The log-rank test is used as the split function in many commonly used survival trees and forests algorithms. However, the log-rank test may have a significant loss of power in some circumstances, especially when the hazard functions or when the survival functions cross each other in the two compared groups. We investigate the use of the integrated absolute difference between the two children nodes survival functions as the splitting rule. Simulations studies and applications to real data sets show that forests built with this rule produce very good results in general, and that they are often better compared to forests built with the log-rank splitting rule.

Suggested Citation

  • Hoora Moradian & Denis Larocque & François Bellavance, 2017. "$$L_1$$ L 1 splitting rules in survival forests," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 23(4), pages 671-691, October.
  • Handle: RePEc:spr:lifeda:v:23:y:2017:i:4:d:10.1007_s10985-016-9372-1
    DOI: 10.1007/s10985-016-9372-1
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10985-016-9372-1
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10985-016-9372-1?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Ishwaran, Hemant & Kogalur, Udaya B. & Gorodeski, Eiran Z. & Minn, Andy J. & Lauer, Michael S., 2010. "High-Dimensional Variable Selection for Survival Data," Journal of the American Statistical Association, American Statistical Association, vol. 105(489), pages 205-217.
    2. W. Sauerbrei & P. Royston, 1999. "Building multivariable prognostic and diagnostic models: transformation of the predictors by using fractional polynomials," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 162(1), pages 71-94.
    3. Ishwaran, Hemant & Kogalur, Udaya B., 2010. "Consistency of random survival forests," Statistics & Probability Letters, Elsevier, vol. 80(13-14), pages 1056-1064, July.
    4. Hothorn, Torsten & Lausen, Berthold, 2003. "On the exact distribution of maximally selected rank statistics," Computational Statistics & Data Analysis, Elsevier, vol. 43(2), pages 121-137, June.
    5. Ruoqing Zhu & Michael R. Kosorok, 2012. "Recursively Imputed Survival Trees," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 107(497), pages 331-340, March.
    6. Mogensen, Ulla B. & Ishwaran, Hemant & Gerds, Thomas A., 2012. "Evaluating Random Forests for Survival Analysis Using Prediction Error Curves," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 50(i11).
    7. N. E. Breslow & N. Chatterjee, 1999. "Design and analysis of two‐phase studies with binary outcome applied to Wilms tumour prognosis," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 48(4), pages 457-468.
    8. Rokach, Lior, 2009. "Taxonomy for characterizing ensemble methods in classification tasks: A review and annotated bibliography," Computational Statistics & Data Analysis, Elsevier, vol. 53(12), pages 4046-4072, October.
    9. Patrick Royston & Gareth Ambler, 1999. "Multivariable fractional polynomials," Stata Technical Bulletin, StataCorp LP, vol. 8(43).
    10. Lin, Yi & Jeon, Yongho, 2006. "Random Forests and Adaptive Nearest Neighbors," Journal of the American Statistical Association, American Statistical Association, vol. 101, pages 578-590, June.
    11. Ciampi, Antonio & Thiffault, Johanne & Nakache, Jean-Pierre & Asselain, Bernard, 1986. "Stratification by stepwise regression, correspondence analysis and recursive partition: a comparison of three methods of analysis for survival data with covariates," Computational Statistics & Data Analysis, Elsevier, vol. 4(3), pages 185-204, October.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Yifei Sun & Sy Han Chiou & Mei‐Cheng Wang, 2020. "ROC‐guided survival trees and ensembles," Biometrics, The International Biometric Society, vol. 76(4), pages 1177-1189, December.
    2. Beilin Jia & Donglin Zeng & Jason J. Z. Liao & Guanghan F. Liu & Xianming Tan & Guoqing Diao & Joseph G. Ibrahim, 2022. "Mixture survival trees for cancer risk classification," Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, Springer, vol. 28(3), pages 356-379, July.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Yifei Sun & Sy Han Chiou & Mei‐Cheng Wang, 2020. "ROC‐guided survival trees and ensembles," Biometrics, The International Biometric Society, vol. 76(4), pages 1177-1189, December.
    2. Susan Athey & Julie Tibshirani & Stefan Wager, 2016. "Generalized Random Forests," Papers 1610.01271, arXiv.org, revised Apr 2018.
    3. Marco Caliendo & Stefan Tübbicke, 2020. "New evidence on long-term effects of start-up subsidies: matching estimates and their robustness," Empirical Economics, Springer, vol. 59(4), pages 1605-1631, October.
    4. Annalisa Orenti & Patrizia Boracchi & Giuseppe Marano & Elia Biganzoli & Federico Ambrogi, 2022. "A pseudo-values regression model for non-fatal event free survival in the presence of semi-competing risks," Statistical Methods & Applications, Springer;Società Italiana di Statistica, vol. 31(3), pages 709-727, September.
    5. Strasak, Alexander M. & Umlauf, Nikolaus & Pfeiffer, Ruth M. & Lang, Stefan, 2011. "Comparing penalized splines and fractional polynomials for flexible modelling of the effects of continuous predictor variables," Computational Statistics & Data Analysis, Elsevier, vol. 55(4), pages 1540-1551, April.
    6. Stefanie Hieke & Harald Binder & Alexandra Nieters & Martin Schumacher, 2014. "minPtest: a resampling based gene region-level testing procedure for genetic case-control studies," Computational Statistics, Springer, vol. 29(1), pages 51-63, February.
    7. Marisa Rifada & Vita Ratnasari & Purhadi Purhadi, 2023. "Parameter Estimation and Hypothesis Testing of The Bivariate Polynomial Ordinal Logistic Regression Model," Mathematics, MDPI, vol. 11(3), pages 1-12, January.
    8. Patrick Royston, 2012. "Tools to simulate realistic censored survival-time distributions," Stata Journal, StataCorp LP, vol. 12(4), pages 639-654, December.
    9. Schäfer, Dorothea & Werwatz, Axel & Zimmermann, Volker, 2004. "The Determinants of Debt and (Private) Equity Financing : The Case of Young, Innovative SMEs from Germany," EconStor Open Access Articles and Book Chapters, ZBW - Leibniz Information Centre for Economics, vol. 11(3), pages 225-248.
    10. Sauerbrei, W. & Meier-Hirmer, C. & Benner, A. & Royston, P., 2006. "Multivariable regression model building by using fractional polynomials: Description of SAS, STATA and R programs," Computational Statistics & Data Analysis, Elsevier, vol. 50(12), pages 3464-3485, August.
    11. Sauerbrei, Willi & Royston, Patrick & Zapien, Karina, 2007. "Detecting an interaction between treatment and a continuous covariate: A comparison of two approaches," Computational Statistics & Data Analysis, Elsevier, vol. 51(8), pages 4054-4063, May.
    12. William D. Dupont, 2010. "Review of Multivariable Model-building: A Pragmatic Approach to Regression Analysis Based on Fractional Polynomials for Modeling Continuous Variables, by Royston and Sauerbrei," Stata Journal, StataCorp LP, vol. 10(2), pages 297-302, June.
    13. Suvra Pal & Hongbo Yu & Zachary D. Loucks & Ian M. Harris, 2020. "Illustration of the Flexibility of Generalized Gamma Distribution in Modeling Right Censored Survival Data: Analysis of Two Cancer Datasets," Annals of Data Science, Springer, vol. 7(1), pages 77-90, March.
    14. Patrick Royston, 2007. "Profile likelihood for estimation and confidence intervals," Stata Journal, StataCorp LP, vol. 7(3), pages 376-387, September.
    15. Patrick Royston, 2004. "Multiple imputation of missing values," Stata Journal, StataCorp LP, vol. 4(3), pages 227-241, September.
    16. Stefanie Hieke & Axel Benner & Richard F Schlenk & Martin Schumacher & Lars Bullinger & Harald Binder, 2016. "Identifying Prognostic SNPs in Clinical Cohorts: Complementing Univariate Analyses by Resampling and Multivariable Modeling," PLOS ONE, Public Library of Science, vol. 11(5), pages 1-18, May.
    17. Ruoqing Zhu & Donglin Zeng & Michael R. Kosorok, 2015. "Reinforcement Learning Trees," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 110(512), pages 1770-1784, December.
    18. Mike G. Tsionas, 2017. "“When, Where, and How” of Efficiency Estimation: Improved Procedures for Stochastic Frontier Modeling," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 112(519), pages 948-965, July.
    19. Gérard Biau & Erwan Scornet, 2016. "A random forest guided tour," TEST: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 25(2), pages 197-227, June.
    20. Royston, P. & Sauerbrei, W., 2007. "Improving the robustness of fractional polynomial models by preliminary covariate transformation: A pragmatic approach," Computational Statistics & Data Analysis, Elsevier, vol. 51(9), pages 4240-4253, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:lifeda:v:23:y:2017:i:4:d:10.1007_s10985-016-9372-1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.