IDEAS home Printed from https://ideas.repec.org/a/zag/market/v35y2023i1p7-22.html
   My bibliography  Save this article

Using Supervised Machine Learning Methods for RFM Segmentation: A Casino Direct Marketing Communication Case

Author

Listed:
  • Danijel Bratina

    (University of Primorska, Faculty of Management)

  • Armand Faganel

    (University of Primorska, Faculty of Management)

Abstract

Purpose – This paper explores various supervised machine learning algorithms as an additional classification method to RFM (recency, frequency, and monetary) models with the aim of improving the accuracy in predicting target groups of customers for direct marketing response campaigns conducted by a casino. The purpose of this paper is twofold – first, to test how the addition of demographic variables increases the accuracy of the basic RFM model and second, to assess if and how machine learning algorithms improve the initial model. Ultimately, we propose a model for direct marketing response at individual level using RFM scores and customer demographic and behavioral data as endogenous variables to be used by the company. The findings can be used as an alternative to the simpler RFM model when approaching customer response modeling for large datasets and can be generalized to other industries. Design/Methodology/Approach – Our research employed supervised machine learning methods tuned on historical responses to a casino’s direct marketing activities to improve the company’s RFM segmentation model. Demographic variables were also included with the aim of improving the power of the models employed. Finally, we attempted to improve the best-performing model by hypertuning its algorithm parameters. Findings and Implications – The best and most intuitive model was found to be that using decision trees with Recency (from RFM) together with age and the awarded amount (from the demographic element) as independent variables. Surprisingly, the company’s own RFM segmentation was also found to perform well. Limitations – Not all machine learning methods used for classification were included in our research nor did we use ensemble methods to improve the models’ power. While all models developed are applicable to similar data, they could lose their accuracy when applied to data from a different industry. The company’s own RFM model was not analyzed but was included in the model as is. Further insight could be gained by determining its optimal parameters. Originality – This study contributes to the existing literature by showing how direct marketing efficiency modeling using standard RFM could be improved with the addition of a company’s customer property. It also provides insight into how classification algorithms perform on a casino database of direct marketing activities.

Suggested Citation

  • Danijel Bratina & Armand Faganel, 2023. "Using Supervised Machine Learning Methods for RFM Segmentation: A Casino Direct Marketing Communication Case," Tržište/Market, Faculty of Economics and Business, University of Zagreb, vol. 35(1), pages 7-22.
  • Handle: RePEc:zag:market:v:35:y:2023:i:1:p:7-22
    as

    Download full text from publisher

    File URL: https://hrcak.srce.hr/file/437958
    Download Restriction: None
    ---><---

    References listed on IDEAS

    as
    1. Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July.
    2. Nauman Zaheer & Mihael Kline, 2018. "Use of Lifestyle Segmentation for Assessing Consumers’ Attitudes and Behavioral Outcomes towards Mobile Advertising," Tržište/Market, Faculty of Economics and Business, University of Zagreb, vol. 30(2), pages 213-229.
    3. Gitae Kim & Bongsug Chae & David Olson, 2013. "A support vector machine (SVM) approach to imbalanced datasets of customer responses: comparison with other customer response models," Service Business, Springer;Pan-Pacific Business Association, vol. 7(1), pages 167-182, March.
    4. McCarty, John A. & Hastak, Manoj, 2007. "Segmentation approaches in data-mining: A comparison of RFM, CHAID, and logistic regression," Journal of Business Research, Elsevier, vol. 60(6), pages 656-662, June.
    5. Tjasa Tabaj & Danijel Bratina, 2018. "Improving Direct Marketing Activities Effectiveness Using Analytical Models: RFM vs. Logit Model on a Casino Case," Management, University of Primorska, Faculty of Management Koper, vol. 13(4), pages 323-334.
    6. G. V. Kass, 1980. "An Exploratory Technique for Investigating Large Quantities of Categorical Data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 29(2), pages 119-127, June.
    7. Jan Roelf Bult & Tom Wansbeek, 1995. "Optimal Selection for Direct Mail," Marketing Science, INFORMS, vol. 14(4), pages 378-394.
    8. Rahim, Mussadiq Abdul & Mushafiq, Muhammad & Khan, Salabat & Arain, Zulfiqar Ali, 2021. "RFM-based repurchase behavior for customer classification and segmentation," Journal of Retailing and Consumer Services, Elsevier, vol. 61(C).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. M. Ballings & D. Van Den Poel, 2012. "The Relevant Length of Customer Event History for Churn Prediction: How long is long enough?," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/804, Ghent University, Faculty of Economics and Business Administration.
    2. Hache, Emmanuel & Leboullenger, Déborah & Mignon, Valérie, 2017. "Beyond average energy consumption in the French residential housing market: A household classification approach," Energy Policy, Elsevier, vol. 107(C), pages 82-95.
    3. I. Albarrán & P. Alonso-González & J. M. Marin, 2017. "Some criticism to a general model in Solvency II: an explanation from a clustering point of view," Empirical Economics, Springer, vol. 52(4), pages 1289-1308, June.
    4. Bas Donkers & Richard Paap & Jedid‐Jah Jonker & Philip Hans Franses, 2006. "Deriving target selection rules from endogenously selected samples," Journal of Applied Econometrics, John Wiley & Sons, Ltd., vol. 21(5), pages 549-562, July.
    5. Coussement, Kristof & De Bock, Koen W., 2013. "Customer churn prediction in the online gambling industry: The beneficial effect of ensemble learning," Journal of Business Research, Elsevier, vol. 66(9), pages 1629-1636.
    6. R Fildes & K Nikolopoulos & S F Crone & A A Syntetos, 2008. "Forecasting and operational research: a review," Journal of the Operational Research Society, Palgrave Macmillan;The OR Society, vol. 59(9), pages 1150-1172, September.
    7. Yingqiu Zhu & Qiong Deng & Danyang Huang & Bingyi Jing & Bo Zhang, 2021. "Clustering based on Kolmogorov–Smirnov statistic with application to bank card transaction data," Journal of the Royal Statistical Society Series C, Royal Statistical Society, vol. 70(3), pages 558-578, June.
    8. Coussement, Kristof & Van den Bossche, Filip A.M. & De Bock, Koen W., 2014. "Data accuracy's impact on segmentation performance: Benchmarking RFM analysis, logistic regression, and decision trees," Journal of Business Research, Elsevier, vol. 67(1), pages 2751-2758.
    9. Marko Sarstedt & Sebastian Scharf & Alexander Thamm & Michael Wolff, 2010. "Die Prognose von Serviceintervallen mit der Hazard-Raten-Analyse – Ergebnisse einer empirischen Studie im Automobilmarkt," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 20(3), pages 269-283, April.
    10. Rocío G. Martínez & Ramon A. Carrasco & Cristina Sanchez-Figueroa & Diana Gavilan, 2021. "An RFM Model Customizable to Product Catalogues and Marketing Criteria Using Fuzzy Linguistic Models: Case Study of a Retail Business," Mathematics, MDPI, vol. 9(16), pages 1-31, August.
    11. Philippe Baecke & Dirk Van Den Poel, 2010. "Improving Purchasing Behavior Predictions By Data Augmentation With Situational Variables," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 9(06), pages 853-872.
    12. Albarrán Lozano, Irene & Marín Díazaraque, Juan Miguel & Alonso, Pablo J., 2011. "Why using a general model in Solvency II is not a good idea : an explanation from a Bayesian point of view," DES - Working Papers. Statistics and Econometrics. WS ws113729, Universidad Carlos III de Madrid. Departamento de Estadística.
    13. Vera Miguéis & Dirk Poel & Ana Camanho & João Falcão e Cunha, 2012. "Predicting partial customer churn using Markov for discrimination for modeling first purchase sequences," Advances in Data Analysis and Classification, Springer;German Classification Society - Gesellschaft für Klassifikation (GfKl);Japanese Classification Society (JCS);Classification and Data Analysis Group of the Italian Statistical Society (CLADAG);International Federation of Classification Societies (IFCS), vol. 6(4), pages 337-353, December.
    14. V. L. Miguéis & D. Van Den Poel & A.S. Camanho & J. Falcao E Cunha, 2012. "Modeling Partial Customer Churn: On the Value of First Product-Category Purchase Sequences," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/790, Ghent University, Faculty of Economics and Business Administration.
    15. Potharst, R. & Kaymak, U. & Pijls, W.H.L.M., 2001. "Neural Networks for Target Selection in Direct Marketing," ERIM Report Series Research in Management ERS-2001-14-LIS, Erasmus Research Institute of Management (ERIM), ERIM is the joint research institute of the Rotterdam School of Management, Erasmus University and the Erasmus School of Economics (ESE) at Erasmus University Rotterdam.
    16. Legohérel, Patrick & Hsu, Cathy H.C. & Daucé, Bruno, 2015. "Variety-seeking: Using the CHAID segmentation approach in analyzing the international traveler market," Tourism Management, Elsevier, vol. 46(C), pages 359-366.
    17. Hayk Manucharyan, 2020. "How do managers actually choose suppliers? Evidence from revealed preference data," Working Papers 2020-12, Faculty of Economic Sciences, University of Warsaw.
    18. Gabriel Marín Díaz & Ramón Alberto Carrasco & Daniel Gómez, 2021. "RFID: A Fuzzy Linguistic Model to Manage Customers from the Perspective of Their Interactions with the Contact Center," Mathematics, MDPI, vol. 9(19), pages 1-27, September.
    19. Duncan Simester & Artem Timoshenko & Spyros I. Zoumpoulis, 2020. "Targeting Prospective Customers: Robustness of Machine-Learning Methods to Typical Data Challenges," Management Science, INFORMS, vol. 66(6), pages 2495-2522, June.
    20. A. Prinzie & D. Van Den Poel, 2005. "Constrained optimization of data-mining problems to improve model performance: A direct-marketing application," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 05/298, Ghent University, Faculty of Economics and Business Administration.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:zag:market:v:35:y:2023:i:1:p:7-22. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tanja Komarac (email available below). General contact details of provider: https://edirc.repec.org/data/fefzghr.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.