IDEAS home Printed from https://ideas.repec.org/a/eee/ejores/v284y2020i3p920-933.html
   My bibliography  Save this article

Profit driven decision trees for churn prediction

Author

Listed:
  • Höppner, Sebastiaan
  • Stripling, Eugen
  • Baesens, Bart
  • Broucke, Seppe vanden
  • Verdonck, Tim

Abstract

Customer retention campaigns increasingly rely on predictive models to detect potential churners in a vast customer base. From the perspective of machine learning, the task of predicting customer churn can be presented as a binary classification problem. Using data on historic behavior, classification algorithms are built with the purpose of accurately predicting the probability of a customer defecting. The predictive churn models are then commonly selected based on accuracy related performance measures such as the area under the ROC curve (AUC). However, these models are often not well aligned with the core business requirement of profit maximization, in the sense that, the models fail to take into account not only misclassification costs, but also the benefits originating from a correct classification. Therefore, the aim is to construct churn prediction models that are profitable and preferably interpretable too. The recently developed expected maximum profit measure for customer churn (EMPC) has been proposed in order to select the most profitable churn model. We present a new classifier that integrates the EMPC metric directly into the model construction. Our technique, called ProfTree, uses an evolutionary algorithm for learning profit driven decision trees. In a benchmark study with real-life datasets from various telecommunication service providers, we show that ProfTree achieves significant profit improvements compared to classic accuracy driven tree-based methods.

Suggested Citation

  • Höppner, Sebastiaan & Stripling, Eugen & Baesens, Bart & Broucke, Seppe vanden & Verdonck, Tim, 2020. "Profit driven decision trees for churn prediction," European Journal of Operational Research, Elsevier, vol. 284(3), pages 920-933.
  • Handle: RePEc:eee:ejores:v:284:y:2020:i:3:p:920-933
    DOI: 10.1016/j.ejor.2018.11.072
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0377221718310166
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.ejor.2018.11.072?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Verbraken, Thomas & Bravo, Cristián & Weber, Richard & Baesens, Bart, 2014. "Development and application of consumer credit scoring models using profit-based classification measures," European Journal of Operational Research, Elsevier, vol. 238(2), pages 505-513.
    2. Grubinger, Thomas & Zeileis, Achim & Pfeiffer, Karl-Peter, 2014. "evtree: Evolutionary Learning of Globally Optimal Classification and Regression Trees in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 61(i01).
    3. Chen, Zhen-Yu & Fan, Zhi-Ping & Sun, Minghe, 2012. "A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data," European Journal of Operational Research, Elsevier, vol. 223(2), pages 461-472.
    4. van Wezel, Michiel & Potharst, Rob, 2007. "Improved customer choice predictions using ensemble methods," European Journal of Operational Research, Elsevier, vol. 181(1), pages 436-452, August.
    5. Glady, Nicolas & Baesens, Bart & Croux, Christophe, 2009. "Modeling churn using customer lifetime value," European Journal of Operational Research, Elsevier, vol. 197(1), pages 402-411, August.
    6. Verbeke, Wouter & Dejaeger, Karel & Martens, David & Hur, Joon & Baesens, Bart, 2012. "New insights into churn prediction in the telecommunication sector: A profit driven data mining approach," European Journal of Operational Research, Elsevier, vol. 218(1), pages 211-229.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    2. Miikka Blomster & Timo Koivumäki, 2022. "Exploring the resources, competencies, and capabilities needed for successful machine learning projects in digital marketing," Information Systems and e-Business Management, Springer, vol. 20(1), pages 123-169, March.
    3. Emilio Carrizosa & Cristina Molero-Río & Dolores Romero Morales, 2021. "Mathematical optimization in classification and regression trees," TOP: An Official Journal of the Spanish Society of Statistics and Operations Research, Springer;Sociedad de Estadística e Investigación Operativa, vol. 29(1), pages 5-33, April.
    4. Lewlisa Saha & Hrudaya Kumar Tripathy & Tarek Gaber & Hatem El-Gohary & El-Sayed M. El-kenawy, 2023. "Deep Churn Prediction Method for Telecommunication Industry," Sustainability, MDPI, vol. 15(5), pages 1-21, March.
    5. Chen, Claire Y.T. & Sun, Edward W. & Miao, Wanyu & Lin, Yi-Bing, 2024. "Reconciling business analytics with graphically initialized subspace clustering for optimal nonlinear pricing," European Journal of Operational Research, Elsevier, vol. 312(3), pages 1086-1107.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    2. Maldonado, Sebastián & Domínguez, Gonzalo & Olaya, Diego & Verbeke, Wouter, 2021. "Profit-driven churn prediction for the mutual fund industry: A multisegment approach," Omega, Elsevier, vol. 100(C).
    3. Mahajan, Pravar Dilip & Maurya, Abhinav & Megahed, Aly & Elwany, Alaa & Strong, Ray & Blomberg, Jeanette, 2020. "Optimizing predictive precision in imbalanced datasets for actionable revenue change prediction," European Journal of Operational Research, Elsevier, vol. 285(3), pages 1095-1113.
    4. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    5. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    6. Lessmann, Stefan & Coussement, Kristof & De Bock, Koen W. & Haupt, Johannes, 2018. "Targeting customers for profit: An ensemble learning framework to support marketing decision making," IRTG 1792 Discussion Papers 2018-012, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    7. Maldonado, Sebastián & López, Julio & Vairetti, Carla, 2020. "Profit-based churn prediction based on Minimax Probability Machines," European Journal of Operational Research, Elsevier, vol. 284(1), pages 273-284.
    8. Clemente-Císcar, M. & San Matías, S. & Giner-Bosch, V., 2014. "A methodology based on profitability criteria for defining the partial defection of customers in non-contractual settings," European Journal of Operational Research, Elsevier, vol. 239(1), pages 276-285.
    9. Tang, Leilei & Thomas, Lyn & Fletcher, Mary & Pan, Jiazhu & Marshall, Andrew, 2014. "Assessing the impact of derived behavior information on customer attrition in the financial service industry," European Journal of Operational Research, Elsevier, vol. 236(2), pages 624-633.
    10. K. W. De Bock & D. Van Den Poel, 2011. "An empirical evaluation of rotation-based ensemble classifiers for customer churn prediction," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 11/717, Ghent University, Faculty of Economics and Business Administration.
    11. Lessmann, Stefan & Baesens, Bart & Seow, Hsin-Vonn & Thomas, Lyn C., 2015. "Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research," European Journal of Operational Research, Elsevier, vol. 247(1), pages 124-136.
    12. Höppner, Sebastiaan & Baesens, Bart & Verbeke, Wouter & Verdonck, Tim, 2022. "Instance-dependent cost-sensitive learning for detecting transfer fraud," European Journal of Operational Research, Elsevier, vol. 297(1), pages 291-300.
    13. Uner, M.Mithat & Guven, Faruk & Cavusgil, S.Tamer, 2020. "Churn and loyalty behavior of Turkish digital natives: Empirical insights and managerial implications," Telecommunications Policy, Elsevier, vol. 44(4).
    14. Aurélie Lemmens & Sunil Gupta, 2020. "Managing Churn to Maximize Profits," Marketing Science, INFORMS, vol. 39(5), pages 956-973, September.
    15. Johannes Haupt & Stefan Lessmann, 2020. "Targeting customers under response-dependent costs," Papers 2003.06271, arXiv.org, revised Aug 2021.
    16. Haupt, Johannes & Lessmann, Stefan, 2020. "Targeting Cutsomers Under Response-Dependent Costs," IRTG 1792 Discussion Papers 2020-005, Humboldt University of Berlin, International Research Training Group 1792 "High Dimensional Nonstationary Time Series".
    17. Martínez, Andrés & Schmuck, Claudia & Pereverzyev, Sergiy & Pirker, Clemens & Haltmeier, Markus, 2020. "A machine learning framework for customer purchase prediction in the non-contractual setting," European Journal of Operational Research, Elsevier, vol. 281(3), pages 588-596.
    18. K. Coussement & K. W. Bock & S. Geuens, 2022. "A decision-analytic framework for interpretable recommendation systems with multiple input data sources: a case study for a European e-tailer," Annals of Operations Research, Springer, vol. 315(2), pages 671-694, August.
    19. Fan, Zhi-Ping & Sun, Minghe, 2015. "Behavior-aware user response modeling in social media: Learning from diverse heterogeneous dataAuthor-Name: Chen, Zhen-Yu," European Journal of Operational Research, Elsevier, vol. 241(2), pages 422-434.
    20. Aimée Backiel & Bart Baesens & Gerda Claeskens, 2016. "Predicting time-to-churn of prepaid mobile telephone customers using social network analysis," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 67(9), pages 1135-1145, September.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:ejores:v:284:y:2020:i:3:p:920-933. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/eor .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.