IDEAS home Printed from https://ideas.repec.org/a/pal/jorsoc/v60y2009i8d10.1057_jors.2008.161.html
   My bibliography  Save this article

Domain knowledge integration in data mining using decision tables: case studies in churn prediction

Author

Listed:
  • E Lima

    (University of Southampton)

  • C Mues

    (University of Southampton)

  • B Baesens

    (University of Southampton)

Abstract

Companies' interest in customer relationship modelling and key issues such as customer lifetime value and churn has substantially increased over the years. However, the complexity of building, interpreting and applying these models creates obstacles for their implementation. The main contribution of this paper is to show how domain knowledge can be incorporated in the data mining process for churn prediction, viz. through the evaluation of coefficient signs in a logistic regression model, and secondly, by analysing a decision table (DT) extracted from a decision tree or rule-based classifier. An algorithm to check DTs for violations of monotonicity constraints is presented, which involves the repeated application of condition reordering and table contraction to detect counter-intuitive patterns. Both approaches are applied to two telecom data sets to empirically demonstrate how domain knowledge can be used to ensure the interpretability of the resulting models.

Suggested Citation

  • E Lima & C Mues & B Baesens, 2009. "Domain knowledge integration in data mining using decision tables: case studies in churn prediction," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(8), pages 1096-1106, August.
  • Handle: RePEc:pal:jorsoc:v:60:y:2009:i:8:d:10.1057_jors.2008.161
    DOI: 10.1057/jors.2008.161
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/jors.2008.161
    File Function: Abstract
    Download Restriction: Access to full text is restricted to subscribers.

    File URL: https://libkey.io/10.1057/jors.2008.161?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Van Gestel, Tony & Martens, David & Baesens, Bart & Feremans, Daniel & Huysmans, Johan & Vanthienen, Jan, 2007. "Forecasting and analyzing insurance companies' ratings," International Journal of Forecasting, Elsevier, vol. 23(3), pages 513-529.
    2. Bart Baesens & Rudy Setiono & Christophe Mues & Jan Vanthienen, 2003. "Using Neural Network Rule Extraction and Decision Tables for Credit-Risk Evaluation," Management Science, INFORMS, vol. 49(3), pages 312-329, March.
    3. B. Larivière & D. Van Den Poel, 2004. "Investigating the role of product features in preventing customer churn, by using survival analysis and choice modeling: The case of financial services," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 04/223, Ghent University, Faculty of Economics and Business Administration.
    4. Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July.
    5. B Baesens & T Van Gestel & S Viaene & M Stepanova & J Suykens & J Vanthienen, 2003. "Benchmarking state-of-the-art classification algorithms for credit scoring," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 54(6), pages 627-635, June.
    6. Marina Velikova & Hennie Daniels, 2004. "Decision trees for monotone price models," Computational Management Science, Springer, vol. 1(3), pages 231-244, October.
    7. K. Coussement & D. Van Den Poel, 2006. "Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparing Two Parameter-Selection Techniques," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 06/412, Ghent University, Faculty of Economics and Business Administration.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Gerpott, Torsten J. & Meinert, Phil, 2018. "Termination notice of mobile network operator customers after a tariff switch: An empirical study of postpaid subscribers in Germany," Telecommunications Policy, Elsevier, vol. 42(3), pages 212-226.
    2. Verbeke, Wouter & Dejaeger, Karel & Martens, David & Hur, Joon & Baesens, Bart, 2012. "New insights into churn prediction in the telecommunication sector: A profit driven data mining approach," European Journal of Operational Research, Elsevier, vol. 218(1), pages 211-229.
    3. Gerpott, Torsten J. & Ahmadi, Nima & Weimar, Daniel, 2015. "Who is (not) convinced to withdraw a contract termination announcement? – A discriminant analysis of mobile communications customers in Germany," Telecommunications Policy, Elsevier, vol. 39(1), pages 38-52.
    4. Ballings, Michel & Van den Poel, Dirk, 2015. "CRM in social media: Predicting increases in Facebook usage frequency," European Journal of Operational Research, Elsevier, vol. 244(1), pages 248-260.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Lessmann, Stefan & Voß, Stefan, 2009. "A reference model for customer-centric data mining with support vector machines," European Journal of Operational Research, Elsevier, vol. 199(2), pages 520-530, December.
    2. Verbeke, Wouter & Dejaeger, Karel & Martens, David & Hur, Joon & Baesens, Bart, 2012. "New insights into churn prediction in the telecommunication sector: A profit driven data mining approach," European Journal of Operational Research, Elsevier, vol. 218(1), pages 211-229.
    3. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    4. K. Coussement & D. Van Den Poel, 2008. "Improving Customer Attrition Prediction by Integrating Emotions from Client/Company Interaction Emails and Evaluating Multiple Classifiers," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 08/527, Ghent University, Faculty of Economics and Business Administration.
    5. Vera Miguéis & Dirk Poel & Ana Camanho & João Falcão e Cunha, 2012. "Predicting partial customer churn using Markov for discrimination for modeling first purchase sequences," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 6(4), pages 337-353, December.
    6. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    7. M. Ballings & D. Van Den Poel & E. Verhagen, 2013. "Evaluating the Added Value of Pictorial Data for Customer Churn Prediction," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 13/869, Ghent University, Faculty of Economics and Business Administration.
    8. Risselada, Hans & Verhoef, Peter C. & Bijmolt, Tammo H.A., 2010. "Staying Power of Churn Prediction Models," Journal of Interactive Marketing, Elsevier, vol. 24(3), pages 198-208.
    9. Hoffmann, F. & Baesens, B. & Mues, C. & Van Gestel, T. & Vanthienen, J., 2007. "Inferring descriptive and approximate fuzzy rules for credit scoring using evolutionary algorithms," European Journal of Operational Research, Elsevier, vol. 177(1), pages 540-555, February.
    10. Martens, David & Baesens, Bart & Van Gestel, Tony & Vanthienen, Jan, 2007. "Comprehensible credit scoring models using rule extraction from support vector machines," European Journal of Operational Research, Elsevier, vol. 183(3), pages 1466-1476, December.
    11. Loterman, Gert & Brown, Iain & Martens, David & Mues, Christophe & Baesens, Bart, 2012. "Benchmarking regression algorithms for loss given default modeling," International Journal of Forecasting, Elsevier, vol. 28(1), pages 161-170.
    12. Slãvescu Ecaterina Oana & Panait Iulian, 2012. "Improving Customer Churn Models as one of Customer Relationship Management Business Solutions for the Telecommunication Industry," Ovidius University Annals, Economic Sciences Series, Ovidius University of Constantza, Faculty of Economic Sciences, vol. 0(1), pages 1156-1160, May.
    13. Casado Yusta, Silvia & Nœ–ez Letamendía, Laura & Pacheco Bonrostro, Joaqu’n Antonio, 2018. "Predicting Corporate Failure: The GRASP-LOGIT Model || Predicci—n de la quiebra empresarial: el modelo GRASP-LOGIT," Revista de Métodos Cuantitativos para la Economía y la Empresa = Journal of Quantitative Methods for Economics and Business Administration, Universidad Pablo de Olavide, Department of Quantitative Methods for Economics and Business Administration, vol. 26(1), pages 294-314, Diciembre.
    14. Johannes Habel & Sascha Alavi & Nicolas Heinitz, 2023. "A theory of predictive sales analytics adoption," AMS Review, Springer;Academy of Marketing Science, vol. 13(1), pages 34-54, June.
    15. Dejaeger, Karel & Goethals, Frank & Giangreco, Antonio & Mola, Lapo & Baesens, Bart, 2012. "Gaining insight into student satisfaction using comprehensible data mining techniques," European Journal of Operational Research, Elsevier, vol. 218(2), pages 548-562.
    16. K. W. De Bock & D. Van Den Poel, 2012. "Reconciling Performance and Interpretability in Customer Churn Prediction using Ensemble Learning based on Generalized Additive Models," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/805, Ghent University, Faculty of Economics and Business Administration.
    17. Gestel, Tony Van & Baesens, Bart & Suykens, Johan A.K. & Van den Poel, Dirk & Baestaens, Dirk-Emma & Willekens, Marleen, 2006. "Bayesian kernel based classification for financial distress detection," European Journal of Operational Research, Elsevier, vol. 172(3), pages 979-1003, August.
    18. V. L. Miguéis & D. Van Den Poel & A.S. Camanho & J. Falcao E Cunha, 2012. "Modeling Partial Customer Churn: On the Value of First Product-Category Purchase Sequences," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/790, Ghent University, Faculty of Economics and Business Administration.
    19. B Baesens & C Mues & D Martens & J Vanthienen, 2009. "50 years of data mining and OR: upcoming trends and challenges," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 60(1), pages 16-23, May.
    20. Coussement, Kristof & Benoit, Dries Frederik & Van den Poel, Dirk, 2009. "Improved Marketing Decision Making in a Customer Churn Prediction Context Using Generalized Additive Models," Working Papers 2009/18, Hogeschool-Universiteit Brussel, Faculteit Economie en Management.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:jorsoc:v:60:y:2009:i:8:d:10.1057_jors.2008.161. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.palgrave-journals.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.