This file is part of IDEAS, which uses RePEc data


[ Papers | Articles | Software | Books | Chapters | Authors | Institutions | JEL Classification | NEP reports | Search | New papers by email | Author registration | Rankings | Volunteers | FAQ | Blog | Help! ]

Predicting Customer Retention and Profitability by Using Random Forests and Regression Forests Techniques

Author info | Abstract | Publisher info | Download info | Related research | Statistics
Author Info
B. LARIVIÈRE
D. VAN DEN POEL ()

Additional information is available for the following registered author(s):

Abstract

In an era of strong customer relationship management (CRM) emphasis, firms strive to build valuable relationships with their existing customer base. In this study we attempt to better understand three important measures of customer outcome: next buy, partial defection and customers’ profitability evolution. By means of random forests techniques we investigate a broad set of explanatory variables, including past customer behavior, observed customer heterogeneity and some typical variables related to intermediaries. We analyze a real-life sample of 100,000 customers taken from the data warehouse of a large European financial services company. Two types of random forests techniques are employed to analyze the data: random forests are used for binary classification, whereas regression forests are applied for the models with linear dependent variables. Our research findings demonstrate that both random forests techniques provide better fit for the estimation and validation sample compared to ordinary linear regression and logistic regression models. Furthermore, we find evidence that the same set of variables have a different impact on buying versus defection versus profitability behavior. Our findings suggest that past customer behavior is more important to generate repeat purchasing and favorable profitability evolutions, while the intermediary’s role has a greater impact on the customers’ defection proneness. Finally, our results demonstrate the benefits of analyzing different customer outcome variables simultaneously, since an extended investigation of the next buy - partial defection - customer profitability triad indicates that one cannot fully understand a particular outcome without understanding the other related behavioral outcome variables.

Download Info
To download:

If you experience problems downloading a file, check if you have the proper application to view it first. Information about this may be contained in the File-Format links below. In case of further problems read the IDEAS help page. Note that these files are not on the IDEAS site. Please be patient as the files may be large.

File URL: http://www.FEB.UGent.be/nl/Ondz/wp/Papers/wp_04_282.pdf
File Format: application/pdf
File Function:
Download Restriction: no

Publisher Info
Paper provided by Ghent University, Faculty of Economics and Business Administration in its series Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium with number 04/282.

Download reference. The following formats are available: HTML (with abstract), plain text (with abstract), BibTeX, RIS (EndNote, RefMan, ProCite), ReDIF
Length: 35 pages
Date of creation: Dec 2004
Date of revision:
Handle: RePEc:rug:rugwps:04/282

Contact details of provider:
Postal: Hoveniersberg 4, B-9000 Gent
Phone: ++ 32 (0) 9 264 34 61
Fax: ++ 32 (0) 9 264 35 92
Web page: http://www.feb.ugent.be/
More information through EDIRC

For technical questions regarding this item, or to correct its listing, contact: (Nathalie Verhaeghe).

Related research
Keywords: Data mining; Customer relationship management; Customer retention and profitability; Random forests and regression forests.;

Other versions of this item:

This paper has been announced in the following NEP Reports: References listed on IDEAS
Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.:
  1. Van den Poel, Dirk & Lariviere, Bart, 2004. "Customer attrition analysis for financial services using proportional hazard models," European Journal of Operational Research, Elsevier, vol. 157(1), pages 196-217, August. [Downloadable!] (restricted)
    Other versions:
  2. Dudoit S. & Fridlyand J. & Speed T. P, 2002. "Comparison of Discrimination Methods for the Classification of Tumors Using Gene Expression Data," Journal of the American Statistical Association, American Statistical Association, vol. 97, pages 77-87, March. [Downloadable!] (restricted)
  3. Hemant Ishwaran & Eugene H. Blackstone & Claire E. Pothier & Michael S. Lauer, 2004. "Relative Risk Forests for Exercise Heart Rate Recovery as a Predictor of Mortality," Journal of the American Statistical Association, American Statistical Association, vol. 99, pages 591-600, January. [Downloadable!] (restricted)
  4. B. Larivière & D. Van Den Poel, 2004. "Investigating the role of product features in preventing customer churn, by using survival analysis and choice modeling: The case of financial services," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 04/223, Ghent University, Faculty of Economics and Business Administration. [Downloadable!]
  5. Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July. [Downloadable!] (restricted)
    Other versions:
  6. B. Baesens & G. Verstraeten & D. Van Den Poel & M. Egmont-Petersen & P. Van Kenhove & J. Vanthienen, 2002. "Bayesian Network Classifiers for Identifying the Slope of the Customer - Lifecycle of Long-Life Customers," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 02/154, Ghent University, Faculty of Economics and Business Administration. [Downloadable!]
    Other versions:
  7. Athanassopoulos, Antreas D., 2000. "Customer Satisfaction Cues To Support Market Segmentation and Explain Switching Behavior," Journal of Business Research, Elsevier, vol. 47(3), pages 191-207, March. [Downloadable!] (restricted)
  8. Baesens, Bart & Viaene, Stijn & Van den Poel, Dirk & Vanthienen, Jan & Dedene, Guido, 2002. "Bayesian neural network learning for repeat purchase modelling in direct marketing," European Journal of Operational Research, Elsevier, vol. 138(1), pages 191-211, April. [Downloadable!] (restricted)
Full references

Cited by:
(explanations, Please report citation or reference errors to , or , if you are the registered author of the cited work, log in to your RePEc Author Service profile, click on "citations" and make appropriate adjustments.)

  1. J. Burez & D. Van Den Poel, 2008. "Handling class imbalance in customer churn prediction," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 08/517, Ghent University, Faculty of Economics and Business Administration. [Downloadable!]
  2. K. Coussement & D. Van Den Poel, 2006. "Churn Prediction in Subscription Services: an Application of Support Vector Machines While Comparing Two Parameter-Selection Techniques," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 06/412, Ghent University, Faculty of Economics and Business Administration. [Downloadable!]
  3. J. Burez & D. Van Den Poel, 2005. "CRM at a Pay-TV Company: Using Analytical Models to Reduce Customer Attrition by Targeted Marketing for Subscription Services," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 05/348, Ghent University, Faculty of Economics and Business Administration. [Downloadable!]
Statistics
Access and download statistics

Did you know? There are NEP reports in over 80 fields that deliver new research to your email.

This page was last updated on 2009-11-26.


This information is provided to you by IDEAS at the Department of Economics, College of Liberal Arts and Sciences, University of Connecticut using RePEc data on a server sponsored by the Society for Economic Dynamics.