IDEAS home Printed from https://ideas.repec.org/a/gam/jrisks/v13y2025i2p20-d1573147.html
   My bibliography  Save this article

Data Mining for the Adjustment of Credit Scoring Models in Solidarity Economy Entities: A Methodology for Addressing Class Imbalances

Author

Listed:
  • Ivan Mauricio Bermudez Vera

    (School of Industrial Engineering, Universidad del Valle, Cali 760042, Colombia)

  • Jaime Mosquera Restrepo

    (School of Statistics, Universidad del Valle, Cali 760042, Colombia)

  • Diego Fernando Manotas-Duque

    (School of Industrial Engineering, Universidad del Valle, Cali 760042, Colombia)

Abstract

This study addresses the quantification of credit risk in solidarity economy entities, proposing a new methodology to redefine the concept of a “default” in the frequent situations of extreme class imbalances. The objective is to develop and evaluate credit scoring models that enhance risk management by incorporating internal and external data to assess default risk. Data mining techniques are applied to address class imbalances, redefining the term “default” to include external credit information and increasing the representation of the minority class. The effectiveness of machine learning and statistical models is evaluated using class-balancing methods such as under-sampling, over-sampling, and the Synthetic Minority Over-sampling Technique (SMOTE). The evaluation is based on the Balanced Accuracy metric and the holding power of the performance, ensuring a consistent predictive power of the model while avoiding overfitting. While machine learning methods can improve credit scoring, logistic regression-based models remain effective, especially when combined with class-balancing techniques. It is concluded that a balanced sample in a class size is essential to improve predictive performance.

Suggested Citation

  • Ivan Mauricio Bermudez Vera & Jaime Mosquera Restrepo & Diego Fernando Manotas-Duque, 2025. "Data Mining for the Adjustment of Credit Scoring Models in Solidarity Economy Entities: A Methodology for Addressing Class Imbalances," Risks, MDPI, vol. 13(2), pages 1-16, January.
  • Handle: RePEc:gam:jrisks:v:13:y:2025:i:2:p:20-:d:1573147
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-9091/13/2/20/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-9091/13/2/20/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. D. J. Hand & W. E. Henley, 1997. "Statistical Classification Methods in Consumer Credit Scoring: a Review," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 160(3), pages 523-541, September.
    2. Doris Fejza & Dritan Nace & Orjada Kulla, 2022. "The Credit Risk Problem—A Developing Country Case Study," Risks, MDPI, vol. 10(8), pages 1-11, July.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gregorio Izquierdo Llanes & Antonio Salcedo, 2025. "Fuzzy Non-Payment Risk Management Rooted in Optimized Household Consumption Units," Risks, MDPI, vol. 13(4), pages 1-13, April.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Dangxing Chen & Weicheng Ye & Jiahui Ye, 2022. "Interpretable Selective Learning in Credit Risk," Papers 2209.10127, arXiv.org.
    2. Jonathan K. Budd & Peter G. Taylor, 2015. "Calculating optimal limits for transacting credit card customers," Papers 1506.05376, arXiv.org, revised Aug 2015.
    3. Dinh, K. & Kleimeier, S., 2006. "Credit scoring for Vietnam's retail banking market : implementation and implications for transactional versus relationship lending," Research Memorandum 012, Maastricht University, Maastricht Research School of Economics of Technology and Organization (METEOR).
    4. Ulf Römer & Oliver Musshoff, 2017. "Can agricultural credit scoring for microfinance institutions be implemented and improved by weather data?," Agricultural Finance Review, Emerald Group Publishing Limited, vol. 78(1), pages 83-97, December.
    5. Christa Gibbs & Benedict Guttman-Kenney & Donghoon Lee & Scott Nelson & Wilbert van der Klaauw & Jialan Wang, 2025. "Consumer Credit Reporting Data," Journal of Economic Literature, American Economic Association, vol. 63(2), pages 598-636, June.
    6. Kraft, Holger & Kroisandt, Gerald & Müller, Marlene, 2002. "Assessing the discriminatory power of credit scores," SFB 373 Discussion Papers 2002,67, Humboldt University of Berlin, Interdisciplinary Research Project 373: Quantification and Simulation of Economic Processes.
    7. Chen Ying & Härdle Wolfgang K. & He Qiang & Majer Piotr, 2018. "Risk related brain regions detection and individual risk classification with 3D image FPCA," Statistics & Risk Modeling, De Gruyter, vol. 35(3-4), pages 89-110, July.
    8. Sun, Weixin & Zhang, Xuantao & Li, Minghao & Wang, Yong, 2023. "Interpretable high-stakes decision support system for credit default forecasting," Technological Forecasting and Social Change, Elsevier, vol. 196(C).
    9. Thomas Wainwright, 2011. "Elite Knowledges: Framing Risk and the Geographies of Credit," Environment and Planning A, , vol. 43(3), pages 650-665, March.
    10. Roy Cerqueti & Francesca Pampurini & Annagiulia Pezzola & Anna Grazia Quaranta, 2022. "Dangerous liasons and hot customers for banks," Review of Quantitative Finance and Accounting, Springer, vol. 59(1), pages 65-89, July.
    11. R T Stewart, 2011. "A profit-based scoring system in consumer credit: making acquisition decisions for credit cards," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 62(9), pages 1719-1725, September.
    12. Rasa Kanapickiene & Renatas Spicas, 2019. "Credit Risk Assessment Model for Small and Micro-Enterprises: The Case of Lithuania," Risks, MDPI, vol. 7(2), pages 1-23, June.
    13. Erwis Melchor Pérez & Moisés Emmanuel Ramírez Guzmán & Araceli Hernández Jiménez & Agustín Santiago Alvarado, 2024. "Predicción del riesgo crediticio a microfinanciera usando aprendizaje computacional," Remef - Revista Mexicana de Economía y Finanzas Nueva Época REMEF (The Mexican Journal of Economics and Finance), Instituto Mexicano de Ejecutivos de Finanzas, IMEF, vol. 19(4), pages 1-16, Octubre -.
    14. Rodrigo Alfaro A. & David Pacheco L. & Andrés Sagner T, 2011. "Dinámica de la Tasa de Incumplimiento de Créditos de Consumo en Cuotas," Notas de Investigación Journal Economía Chilena (The Chilean Economy), Central Bank of Chile, vol. 14(2), pages 119-124, August.
    15. Theuri, Joseph & Olukuru, John, 2022. "The impact of Artficial Intelligence and how it is shaping banking," KBA Centre for Research on Financial Markets and Policy Working Paper Series 61, Kenya Bankers Association (KBA).
    16. Zhiyong Li & Xinyi Hu & Ke Li & Fanyin Zhou & Feng Shen, 2020. "Inferring the outcomes of rejected loans: an application of semisupervised clustering," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 183(2), pages 631-654, February.
    17. K Rajaratnam & P Beling & G Overstreet, 2010. "Scoring decisions in the context of economic uncertainty," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 61(3), pages 421-429, March.
    18. Richard Chamboko & Jorge M. Bravo, 2016. "On the modelling of prognosis from delinquency to normal performance on retail consumer loans," Risk Management, Palgrave Macmillan, vol. 18(4), pages 264-287, December.
    19. Adnan Dželihodžić & Dženana Đonko & Jasmin Kevrić, 2018. "Improved Credit Scoring Model Based on Bagging Neural Network," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 17(06), pages 1725-1741, November.
    20. Manuel Hernandez & Maximo Torero, 2014. "Parametric versus nonparametric methods in risk scoring: an application to microcredit," Empirical Economics, Springer, vol. 46(3), pages 1057-1079, May.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jrisks:v:13:y:2025:i:2:p:20-:d:1573147. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.