IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0315928.html
   My bibliography  Save this article

Classification-augmented survival estimation (CASE): A novel method for individualized long-term survival prediction with application to liver transplantation

Author

Listed:
  • Hamed Shourabizadeh
  • Dionne M Aleman
  • Louis-Martin Rousseau
  • Katina Zheng
  • Mamatha Bhat

Abstract

Survival analysis is critical in many fields, particularly in healthcare where it can guide medical decisions. Conventional survival analysis methods like Kaplan-Meier and Cox proportional hazards models to generate survival curves indicating probability of survival v. time have limitations, especially for long-term prediction, due to assumptions that all instances follow a general population-level survival curve. Machine learning classification models, even those designed for survival predictions like random survival forest (RSF), also struggle to provide accurate long-term predictions due to class imbalance. We improve upon traditional survival machine learning approaches through a novel framework called classification-augmented survival estimation (CASE), which treats survival as a classification task that ultimately yields survival curves, beginning with dataset augmentation to improve class imbalance for use with any classification model. Unlike other approaches, CASE additionally provides an exact survival time prediction. We demonstrate CASE on a liver transplant case study to predict >20 years survival post-transplant, finding that CASE dataset augmentation improved AUCs from 0.69 to 0.88 and F1 scores from 0.32 to 0.73. Compared to Kaplan-Meier, Cox, and RSF survival models, the CASE framework demonstrated better performance across various existing survival metrics, as well as our novel metric, mean of individual areas under the survival curve (mAUSC). Further, we develop novel temporal feature importance methods to understand how different features may vary in survival importance over time, potentially providing actionable insights in real-world survival problems.

Suggested Citation

  • Hamed Shourabizadeh & Dionne M Aleman & Louis-Martin Rousseau & Katina Zheng & Mamatha Bhat, 2025. "Classification-augmented survival estimation (CASE): A novel method for individualized long-term survival prediction with application to liver transplantation," PLOS ONE, Public Library of Science, vol. 20(1), pages 1-27, January.
  • Handle: RePEc:plo:pone00:0315928
    DOI: 10.1371/journal.pone.0315928
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0315928
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0315928&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0315928?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dimitris Papathanasiou & Konstantinos Demertzis & Nikos Tziritas, 2023. "Machine Failure Prediction Using Survival Analysis," Future Internet, MDPI, vol. 15(5), pages 1-26, April.
    2. Adrian Bowman & Stuart Young, 1996. "Graphical Comparison of Nonparametric Curves," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 45(1), pages 83-98, March.
    3. B. Larivière & D. Van Den Poel, 2004. "Investigating the role of product features in preventing customer churn, by using survival analysis and choice modeling: The case of financial services," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 04/223, Ghent University, Faculty of Economics and Business Administration.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Igor Kabashkin & Vitaly Susanin, 2024. "Decision-Making Model for Life Cycle Management of Aircraft Components," Mathematics, MDPI, vol. 12(22), pages 1-43, November.
    2. J. Burez & D. Van Den Poel, 2005. "CRM at a Pay-TV Company: Using Analytical Models to Reduce Customer Attrition by Targeted Marketing for Subscription Services," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 05/348, Ghent University, Faculty of Economics and Business Administration.
    3. Slãvescu Ecaterina Oana & Panait Iulian, 2012. "Improving Customer Churn Models as one of Customer Relationship Management Business Solutions for the Telecommunication Industry," Ovidius University Annals, Economic Sciences Series, Ovidius University of Constantza, Faculty of Economic Sciences, vol. 0(1), pages 1156-1160, May.
    4. E Lima & C Mues & B Baesens, 2009. "Domain knowledge integration in data mining using decision tables: case studies in churn prediction," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(8), pages 1096-1106, August.
    5. Ali Dehghan & Theodore Trafalis, 2012. "Examining Churn and Loyalty Using Support Vector Machine," Business and Management Research, Business and Management Research, Sciedu Press, vol. 1(4), pages 153-161, December.
    6. K. Coussement & D. Van Den Poel, 2008. "Improving Customer Attrition Prediction by Integrating Emotions from Client/Company Interaction Emails and Evaluating Multiple Classifiers," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 08/527, Ghent University, Faculty of Economics and Business Administration.
    7. Diblasi, Angela & Bowman, Adrian, 1997. "Testing for constant variance in a linear model," Statistics & Probability Letters, Elsevier, vol. 33(1), pages 95-103, April.
    8. Alvaro Arroyo & Alvaro Cartea & Fernando Moreno-Pino & Stefan Zohren, 2023. "Deep Attentive Survival Analysis in Limit Order Books: Estimating Fill Probabilities with Convolutional-Transformers," Papers 2306.05479, arXiv.org.
    9. Bowman, A. W. & Azzalini, A., 2003. "Computational aspects of nonparametric smoothing with illustrations from the sm library," Computational Statistics & Data Analysis, Elsevier, vol. 42(4), pages 545-560, April.
    10. Bram Janssens & Matthias Bogaert & Astrid Bagué & Dirk Van den Poel, 2024. "B2Boost: instance-dependent profit-driven modelling of B2B churn," Annals of Operations Research, Springer, vol. 341(1), pages 267-293, October.
    11. Prinzie, Anita & Van den Poel, Dirk, 2006. "Investigating purchasing-sequence patterns for financial services using Markov, MTD and MTDg models," European Journal of Operational Research, Elsevier, vol. 170(3), pages 710-734, May.
    12. Liu, Meijun & Hu, Xiao & Wang, Yuandi & Shi, Dongbo, 2018. "Survive or perish: Investigating the life cycle of academic journals from 1950 to 2013 using survival analysis methods," Journal of Informetrics, Elsevier, vol. 12(1), pages 344-364.
    13. Giovanni C. Porzio, 2002. "A simulated band to check binary regression models," Metron - International Journal of Statistics, Dipartimento di Statistica, Probabilità e Statistiche Applicate - University of Rome, vol. 0(1-2), pages 83-96.
    14. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    15. Vera Miguéis & Dirk Poel & Ana Camanho & João Falcão e Cunha, 2012. "Predicting partial customer churn using Markov for discrimination for modeling first purchase sequences," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 6(4), pages 337-353, December.
    16. Lin, Wei & Kulasekera, K.B., 2010. "Testing the equality of linear single-index models," Journal of Multivariate Analysis, Elsevier, vol. 101(5), pages 1156-1167, May.
    17. A. Prinzie & D. Van Den Poel, 2005. "Incorporating sequential information into traditional classification models by using an element/position- sensitive SAM," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 05/292, Ghent University, Faculty of Economics and Business Administration.
    18. M. Ballings & D. Van Den Poel & E. Verhagen, 2013. "Evaluating the Added Value of Pictorial Data for Customer Churn Prediction," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 13/869, Ghent University, Faculty of Economics and Business Administration.
    19. B. Larivière & D. Van Den Poel, 2005. "Investigating the post-complaint period by means of survival analysis," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 05/299, Ghent University, Faculty of Economics and Business Administration.
    20. B. Larivière & D. Van Den Poel, 2004. "Predicting Customer Retention and Profitability by Using Random Forests and Regression Forests Techniques," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 04/282, Ghent University, Faculty of Economics and Business Administration.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0315928. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.