IDEAS home Printed from https://ideas.repec.org/a/spr/annopr/v346y2025i3d10.1007_s10479-025-06516-9.html
   My bibliography  Save this article

Investigating the impact of undersampling and bagging: an empirical investigation for customer attrition modeling

Author

Listed:
  • Arno Caigny

    (Univ. Lille, CNRS, UMR 9221 - LEM - Lille Economie Management)

  • Kristof Coussement

    (Univ. Lille, CNRS, UMR 9221 - LEM - Lille Economie Management)

  • Matthijs Meire

    (Univ. Lille, CNRS, UMR 9221 - LEM - Lille Economie Management)

  • Steven Hoornaert

    (Univ. Lille, CNRS, UMR 9221 - LEM - Lille Economie Management)

Abstract

Given the growing interest in using AI and analytics to support CRM decision making, we discuss why undersampling and bagging are popular prediction techniques in customer churn prediction (CCP). The former helps in tackling the class imbalance problem and the latter improves model stability. However, extant CCP literature is unclear on the impact of undersampling on model stability and predictive performance, while bagging has difficulties in handling the class imbalance problem. Therefore, we extend existing CCP research to benchmark underbagging, which combines undersampling and bagging. Having both prediction techniques combined we recuperate customer data that would have been lost in undersampling by using them in multiple bags and passing an undersampled, more balanced training set to the classifier. In an extensive experiment including 11 real-life CCP datasets, underbagging is benchmarked against its constituents and other popular CCP classifiers in terms of predictive performance, profit and operational efficiency. Our results indicate that underbagging is a valid and reliable alternative framework for CCP prediction.

Suggested Citation

  • Arno Caigny & Kristof Coussement & Matthijs Meire & Steven Hoornaert, 2025. "Investigating the impact of undersampling and bagging: an empirical investigation for customer attrition modeling," Annals of Operations Research, Springer, vol. 346(3), pages 2401-2421, March.
  • Handle: RePEc:spr:annopr:v:346:y:2025:i:3:d:10.1007_s10479-025-06516-9
    DOI: 10.1007/s10479-025-06516-9
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10479-025-06516-9
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10479-025-06516-9?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:annopr:v:346:y:2025:i:3:d:10.1007_s10479-025-06516-9. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.