IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0278095.html
   My bibliography  Save this article

A novel customer churn prediction model for the telecommunication industry using data transformation methods and feature selection

Author

Listed:
  • Joydeb Kumar Sana
  • Mohammad Zoynul Abedin
  • M Sohel Rahman
  • M Saifur Rahman

Abstract

Customer churn is one of the most critical issues faced by the telecommunication industry (TCI). Researchers and analysts leverage customer relationship management (CRM) data through the use of various machine learning models and data transformation methods to identify the customers who are likely to churn. While several studies have been conducted in the customer churn prediction (CCP) context in TCI, a review of performance of the various models stemming from these studies show a clear room for improvement. Therefore, to improve the accuracy of customer churn prediction in the telecommunication industry, we have investigated several machine learning models, as well as, data transformation methods. To optimize the prediction models, feature selection has been performed using univariate technique and the best hyperparameters have been selected using the grid search method. Subsequently, experiments have been conducted on several publicly available TCI datasets to assess the performance of our models in terms of the widely used evaluation metrics, such as AUC, precision, recall, and F-measure. Through a rigorous experimental study, we have demonstrated the benefit of applying data transformation methods as well as feature selection while training an optimized CCP model. Our proposed technique improved the prediction performance by up to 26.2% and 17% in terms of AUC and F-measure, respectively.

Suggested Citation

  • Joydeb Kumar Sana & Mohammad Zoynul Abedin & M Sohel Rahman & M Saifur Rahman, 2022. "A novel customer churn prediction model for the telecommunication industry using data transformation methods and feature selection," PLOS ONE, Public Library of Science, vol. 17(12), pages 1-21, December.
  • Handle: RePEc:plo:pone00:0278095
    DOI: 10.1371/journal.pone.0278095
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0278095
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0278095&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0278095?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Kristof Coussement & Stefan Lessmann & Geert Verstraeten, 2017. "A comparative analysis of data preparation algorithms for customer churn prediction: A case study in the telecommunication industry," Post-Print hal-01745261, HAL.
    2. Denisa Maria Melian & Andreea Dumitrache & Stelian Stancu & Alexandra Nastu, 2022. "Customer Churn Prediction in Telecommunication Industry. A Data Analysis Techniques Approach," Postmodern Openings, Editura Lumen, Department of Economics, vol. 13(1Sup1), pages 78-104, March.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    2. Arno de Caigny & Kristof Coussement & Koen W. de Bock & Stefan Lessmann, 2019. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," Post-Print hal-02275958, HAL.
    3. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    4. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W. & Lessmann, Stefan, 2020. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1563-1578.
    5. Liu, Zhenkun & Zhang, Ying & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Hufang & Gao, Yuyang & Chen, Yinghao, 2024. "Profit-driven fusion framework based on bagging and boosting classifiers for potential purchaser prediction," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    6. Liu, Zhenkun & Jiang, Ping & De Bock, Koen W. & Wang, Jianzhou & Zhang, Lifang & Niu, Xinsong, 2024. "Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
    7. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    8. Lessmann, Stefan & Coussement, Kristof & De Bock, Koen W. & Haupt, Johannes, 2018. "Targeting customers for profit: An ensemble learning framework to support marketing decision making," IRTG 1792 Discussion Papers 2018-012, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    9. Lewlisa Saha & Hrudaya Kumar Tripathy & Tarek Gaber & Hatem El-Gohary & El-Sayed M. El-kenawy, 2023. "Deep Churn Prediction Method for Telecommunication Industry," Sustainability, MDPI, vol. 15(5), pages 1-21, March.
    10. Hui Zhu, 2023. "Oil Demand Forecasting in Importing and Exporting Countries: AI-Based Analysis of Endogenous and Exogenous Factors," Sustainability, MDPI, vol. 15(18), pages 1-19, September.
    11. Chen, Shuixia & Wang, Jian-qiang & Zhang, Hong-yu, 2019. "A hybrid PSO-SVM model based on clustering algorithm for short-term atmospheric pollutant concentration forecasting," Technological Forecasting and Social Change, Elsevier, vol. 146(C), pages 41-54.
    12. Kanellos, Nikolaos & Katsianis, Dimitrios & Varoutas, Dimitrios, 2022. "Forecasting a telecommunications provider's market share," 31st European Regional ITS Conference, Gothenburg 2022: Reining in Digital Platforms? Challenging monopolies, promoting competition and developing regulatory regimes 265639, International Telecommunications Society (ITS).
    13. Ebru Pekel Ozmen & Tuncay Ozcan, 2022. "A novel deep learning model based on convolutional neural networks for employee churn prediction," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 41(3), pages 539-550, April.
    14. Seema & Gaurav Gupta, 2024. "Development of fading channel patch based convolutional neural network models for customer churn prediction," International Journal of System Assurance Engineering and Management, Springer;The Society for Reliability, Engineering Quality and Operations Management (SREQOM),India, and Division of Operation and Maintenance, Lulea University of Technology, Sweden, vol. 15(1), pages 391-411, January.
    15. Feng, Yi & Yin, Yunqiang & Wang, Dujuan & Ignatius, Joshua & Cheng, T.C.E. & Marra, Marianna & Guo, Yihan, 2024. "Enhancing e-commerce customer churn management with a profit- and AUC-focused prescriptive analytics approach," Journal of Business Research, Elsevier, vol. 184(C).
    16. De Bock, Koen W. & Coussement, Kristof & Caigny, Arno De & Słowiński, Roman & Baesens, Bart & Boute, Robert N. & Choi, Tsan-Ming & Delen, Dursun & Kraus, Mathias & Lessmann, Stefan & Maldonado, Sebast, 2024. "Explainable AI for Operational Research: A defining framework, methods, applications, and a research agenda," European Journal of Operational Research, Elsevier, vol. 317(2), pages 249-272.
    17. Jean Robert Kala Kamdjoug & Hyacinthe Djanan Sando & Jules Raymond Kala & Arielle Ornela Ndassi Teutio & Sunil Tiwari & Samuel Fosso Wamba, 2024. "Data analytics-based auditing: a case study of fraud detection in the banking context," Annals of Operations Research, Springer, vol. 340(2), pages 1161-1188, September.
    18. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    19. Latifah Almuqren & Fatma S. Alrayes & Alexandra I. Cristea, 2021. "An Empirical Study on Customer Churn Behaviours Prediction Using Arabic Twitter Mining Approach," Future Internet, MDPI, vol. 13(7), pages 1-19, July.
    20. Arno Caigny & Kristof Coussement & Matthijs Meire & Steven Hoornaert, 2025. "Investigating the impact of undersampling and bagging: an empirical investigation for customer attrition modeling," Annals of Operations Research, Springer, vol. 346(3), pages 2401-2421, March.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0278095. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.