IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2601.20533.html

Incorporating data drift to perform survival analysis on credit risk

Author

Listed:
  • Jianwei Peng

    (Humboldt-Universit\"at zu Berlin)

  • Stefan Lessmann

    (Humboldt-Universit\"at zu Berlin
    Bucharest University of Economic Studies)

Abstract

Survival analysis has become a standard approach for modelling time to default by time-varying covariates in credit risk. Unlike most existing methods that implicitly assume a stationary data-generating process, in practise, mortgage portfolios are exposed to various forms of data drift caused by changing borrower behaviour, macroeconomic conditions, policy regimes and so on. This study investigates the impact of data drift on survival-based credit risk models and proposes a dynamic joint modelling framework to improve robustness under non-stationary environments. The proposed model integrates a longitudinal behavioural marker derived from balance dynamics with a discrete-time hazard formulation, combined with landmark one-hot encoding and isotonic calibration. Three types of data drift (sudden, incremental and recurring) are simulated and analysed on mortgage loan datasets from Freddie Mac. Experiments and corresponding evidence show that the proposed landmark-based joint model consistently outperforms classical survival models, tree-based drift-adaptive learners and gradient boosting methods in terms of discrimination and calibration across all drift scenarios, which confirms the superiority of our model design.

Suggested Citation

  • Jianwei Peng & Stefan Lessmann, 2026. "Incorporating data drift to perform survival analysis on credit risk," Papers 2601.20533, arXiv.org.
  • Handle: RePEc:arx:papers:2601.20533
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2601.20533
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Ewout W Steyerberg & Karel G M Moons & Danielle A van der Windt & Jill A Hayden & Pablo Perel & Sara Schroter & Richard D Riley & Harry Hemingway & Douglas G Altman & for the PROGRESS Group, 2013. "Prognosis Research Strategy (PROGRESS) 3: Prognostic Model Research," PLOS Medicine, Public Library of Science, vol. 10(2), pages 1-9, February.
    2. Cristina Arellano, 2008. "Default Risk and Income Fluctuations in Emerging Economies," American Economic Review, American Economic Association, vol. 98(3), pages 690-712, June.
    3. Lore Dirick & Gerda Claeskens & Bart Baesens, 2017. "Time to default in credit scoring using survival analysis: a benchmark study," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 68(6), pages 652-665, June.
    4. Lore Dirick & Tony Bellotti & Gerda Claeskens & Bart Baesens, 2019. "Macro-Economic Factors in Credit Risk Calculations: Including Time-Varying Covariates in Mixture Cure Models," Journal of Business & Economic Statistics, Taylor & Francis Journals, vol. 37(1), pages 40-53, January.
    5. Djeundje, Viani Biatat & Crook, Jonathan, 2019. "Dynamic survival models with varying coefficients for credit risks," European Journal of Operational Research, Elsevier, vol. 275(1), pages 319-333.
    6. Medina-Olivares, Victor & Calabrese, Raffaella & Crook, Jonathan & Lindgren, Finn, 2023. "Joint models for longitudinal and discrete survival data in credit scoring," European Journal of Operational Research, Elsevier, vol. 307(3), pages 1457-1473.
    7. Bellotti, Tony & Crook, Jonathan, 2013. "Forecasting and stress testing credit card default using dynamic models," International Journal of Forecasting, Elsevier, vol. 29(4), pages 563-574.
    8. Dimitris Rizopoulos & Laura A. Hatfield & Bradley P. Carlin & Johanna J. M. Takkenberg, 2014. "Combining Dynamic Predictions From Joint Models for Longitudinal and Time-to-Event Data Using Bayesian Model Averaging," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 109(508), pages 1385-1397, December.
    9. Kamaryn T. Tanner & Linda D. Sharples & Rhian M. Daniel & Ruth H. Keogh, 2021. "Dynamic survival prediction combining landmarking with a machine learning ensemble: Methodology and empirical comparison," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 184(1), pages 3-30, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Medina-Olivares, Victor & Calabrese, Raffaella & Crook, Jonathan & Lindgren, Finn, 2023. "Joint models for longitudinal and discrete survival data in credit scoring," European Journal of Operational Research, Elsevier, vol. 307(3), pages 1457-1473.
    2. Arno Botha & Tanja Verster, 2025. "Approaches for modelling the term-structure of default risk under IFRS 9: A tutorial using discrete-time survival analysis," Papers 2507.15441, arXiv.org, revised Dec 2025.
    3. Bocchio, Cecilia & Crook, Jonathan & Andreeva, Galina, 2023. "The impact of macroeconomic scenarios on recurrent delinquency: A stress testing framework of multi-state models for mortgages," International Journal of Forecasting, Elsevier, vol. 39(4), pages 1655-1677.
    4. Oliver Blümke, 2022. "Multiperiod default probability forecasting," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(4), pages 677-696, July.
    5. Dirick, Lore & Claeskens, Gerda & Vasnev, Andrey & Baesens, Bart, 2022. "A hierarchical mixture cure model with unobserved heterogeneity for credit risk," Econometrics and Statistics, Elsevier, vol. 22(C), pages 39-55.
    6. Luong, Thi Mai & Scheule, Harald, 2022. "Benchmarking forecast approaches for mortgage credit risk for forward periods," European Journal of Operational Research, Elsevier, vol. 299(2), pages 750-767.
    7. Hu, Wenbin & Zhou, Junzi, 2025. "Measuring and forecasting financial system resilience under multiple shocks: A survival analysis approach," Pacific-Basin Finance Journal, Elsevier, vol. 94(C).
    8. Cedric H. A. Koffi & Viani Biatat Djeundje & Olivier Menoukeu Pamen, 2024. "Quantifying socio-temporal effects of loan delinquency drivers in microfinance," Papers 2410.13100, arXiv.org, revised Aug 2025.
    9. Medina-Olivares, Victor & Calabrese, Raffaella & Dong, Yizhe & Shi, Baofeng, 2022. "Spatial dependence in microfinance credit default," International Journal of Forecasting, Elsevier, vol. 38(3), pages 1071-1085.
    10. Victor Medina-Olivares & Finn Lindgren & Raffaella Calabrese & Jonathan Crook, 2023. "Joint model for longitudinal and spatio-temporal survival data," Papers 2311.04008, arXiv.org.
    11. Thi Mai Luong, 2020. "Selection Effects of Lender and Borrower Choices on Risk Measurement, Management and Prudential Regulation," PhD Thesis, Finance Discipline Group, UTS Business School, University of Technology, Sydney, number 3-2020, January-A.
    12. Djeundje, Viani Biatat & Crook, Jonathan, 2019. "Identifying hidden patterns in credit risk survival data using Generalised Additive Models," European Journal of Operational Research, Elsevier, vol. 277(1), pages 366-376.
    13. Sultan Amed & Tanmay Sen & Sayantan Banerjee, 2026. "FSL-BDP: Federated Survival Learning with Bayesian Differential Privacy for Credit Risk Modeling," Papers 2601.11134, arXiv.org.
    14. Arno Botha & Tanja Verster & Bernard Scheepers, 2025. "Exploring different subtypes of recurrent event Cox-regression models in modelling lifetime default risk: A tutorial," Papers 2505.01044, arXiv.org, revised Jan 2026.
    15. Li, Aimin & Li, Zhiyong & Bellotti, Anthony, 2023. "Predicting loss given default of unsecured consumer loans with time-varying survival scores," Pacific-Basin Finance Journal, Elsevier, vol. 78(C).
    16. Medina-Olivares, Victor & Lindgren, Finn & Calabrese, Raffaella & Crook, Jonathan, 2025. "Joint model for longitudinal and spatio-temporal survival data," European Journal of Operational Research, Elsevier, vol. 327(3), pages 892-904.
    17. Calabrese, Raffaella & Dombrowski, Timothy & Mandel, Antoine & Pace, R. Kelley & Zanin, Luca, 2024. "Impacts of extreme weather events on mortgage risks and their evolution under climate change: A case study on Florida," European Journal of Operational Research, Elsevier, vol. 314(1), pages 377-392.
    18. Wenbin Hu & Junzi Zhou, 2025. "Building Technical Analysis Strategies Using Multivariate Longitudinal and Time-to-Event Data in Stock Markets," Computational Economics, Springer;Society for Computational Economics, vol. 66(3), pages 1911-1942, September.
    19. Arno Botha & Tanja Verster & Roelinde Bester, 2024. "The TruEnd-procedure: Treating trailing zero-valued balances in credit data," Papers 2404.17008, arXiv.org, revised Nov 2025.
    20. Pei, Youquan & Peng, Heng & Xu, Jinfeng, 2024. "A latent class Cox model for heterogeneous time-to-event data," Journal of Econometrics, Elsevier, vol. 239(2).

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2601.20533. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.