IDEAS home Printed from https://ideas.repec.org/a/gam/jdataj/v10y2025i5p63-d1644662.html
   My bibliography  Save this article

From Crisis to Algorithm: Credit Delinquency Prediction in Peru Under Critical External Factors Using Machine Learning

Author

Listed:
  • Jomark Noriega

    (Facultad de Ingeniería de Sistemas e Informática, Escuela de Posgrado, Universidad Nacional Mayor de San Marcos, Campus Ciudad Universitaria, Calle Germán Amézaga 375, Lima 15801, Peru
    Financiera QAPAQ, Lima 150120, Peru
    These authors contributed equally to this work.)

  • Luis Rivera

    (Facultad de Ingeniería de Sistemas e Informática, Escuela de Posgrado, Universidad Nacional Mayor de San Marcos, Campus Ciudad Universitaria, Calle Germán Amézaga 375, Lima 15801, Peru
    Center of Sciences and Technology (CCT), Laboratory of Mathematical Sciences (LCMAT), State University of North Fluminense Darcy Ribeiro (UENF), Campos dos Goytacazes, Rio de Janeiro 28015-602, Brazil)

  • Jorge Castañeda

    (Facultad de Ingeniería, Escuela de Ingeniería Informática, Universidad San Ignacio de Loyola, Campus 2 La Molina, Av. Fontana 550, La Molina 15026, Peru
    These authors contributed equally to this work.)

  • José Herrera

    (Facultad de Ingeniería de Sistemas e Informática, Escuela de Posgrado, Universidad Nacional Mayor de San Marcos, Campus Ciudad Universitaria, Calle Germán Amézaga 375, Lima 15801, Peru
    Facultad de Ciencias Informáticas, Departamento de Lenguajes y Programación, Escuela de Postgrado, Universidad Pablo de Olavide, Campus Carretera de Utrera km 1, Andalucía, 41013 Sevilla, Spain)

Abstract

Robust credit risk prediction in emerging economies increasingly demands the integration of external factors (EFs) beyond borrowers’ control. This study introduces a scenario-based methodology to incorporate EF—namely COVID-19 severity (mortality and confirmed cases), climate anomalies (temperature deviations, weather-induced road blockages), and social unrest—into machine learning (ML) models for credit delinquency prediction. The approach is grounded in a CRISP-DM framework, combining stationarity testing (Dickey–Fuller), causality analysis (Granger), and post hoc explainability (SHAP, LIME), along with performance evaluation via AUC, ACC, KS, and F1 metrics. The empirical analysis uses nearly 8.2 million records compiled from multiple sources, including 367,000 credit operations granted to individuals and microbusiness owners by a regulated Peruvian financial institution (FMOD) between January 2020 and September 2023. These data also include time series of delinquency by economic activity, external factor indicators (e.g., mortality, climate disruptions, and protest events), and their dynamic interactions assessed through Granger causality to evaluate both the intensity and propagation of external shocks. The results confirm that EF inclusion significantly enhances model performance and robustness. Time-lagged mortality (COVID MOV) emerges as the most powerful single predictor of delinquency, while compound crises (climate and unrest) further intensify default risk—particularly in portfolios without public support. Among the evaluated models, CNN and XGB consistently demonstrate superior adaptability, defined as their ability to maintain strong predictive performance across diverse stress scenarios—including pandemic, climate, and unrest contexts—and to dynamically adjust to varying input distributions and portfolio conditions. Post hoc analyses reveal that EF effects dynamically interact with borrower income, indebtedness, and behavioral traits. This study provides a scalable, explainable framework for integrating systemic shocks into credit risk modeling. The findings contribute to more informed, adaptive, and transparent lending decisions in volatile economic contexts, relevant to financial institutions, regulators, and risk practitioners in emerging markets.

Suggested Citation

  • Jomark Noriega & Luis Rivera & Jorge Castañeda & José Herrera, 2025. "From Crisis to Algorithm: Credit Delinquency Prediction in Peru Under Critical External Factors Using Machine Learning," Data, MDPI, vol. 10(5), pages 1-53, April.
  • Handle: RePEc:gam:jdataj:v:10:y:2025:i:5:p:63-:d:1644662
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2306-5729/10/5/63/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2306-5729/10/5/63/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jdataj:v:10:y:2025:i:5:p:63-:d:1644662. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.