IDEAS home Printed from https://ideas.repec.org/a/eee/joreco/v85y2025ics0969698925000670.html

Improving customer retention in taxi industry using travel data analytics: A churn prediction study

Author

Listed:
  • Loureiro, A.L.D.
  • Miguéis, V.L.
  • Costa, Ã lvaro
  • Ferreira, Michel

Abstract

The retention of public transport users is widely acknowledged as a paramount challenge in the path towards the establishment of more sustainable cities and societies. In this setting, in which no contractual relationship with customers exists, an early and accurate prediction of whether a customer will remain with the company or leave, assumes great significance for businesses to develop effective retention strategies. This work focuses on this topic by identifying potential churners based on their past travel behavior. To achieve this, we developed a set of classification models using various machine learning techniques. These models were then employed as base learners within a stacking ensemble. All classifiers were developed with a profit-driven approach, optimizing for expected maximum profit. Finally, we calculated Shapley Additive Explanation values to enhance the interpretability of the proposed classifiers. The performance of the predictive models was evaluated using the data of taxi services recorded in a Portuguese city for 52 months. A broad range of predictors is proposed, including recency and frequency measures of taxi usage as well as others related to customers' satisfaction level. The predictive power of the models was also assessed for specific proportions of higher risk customers. All models have shown the capability to identify churners accurately. This study innovates in evaluating the one-to-one service provider company-customer relationship in the context of taxi industry. Retention actions to promote customers loyalty and enhance retention are also suggested.

Suggested Citation

  • Loureiro, A.L.D. & Miguéis, V.L. & Costa, à lvaro & Ferreira, Michel, 2025. "Improving customer retention in taxi industry using travel data analytics: A churn prediction study," Journal of Retailing and Consumer Services, Elsevier, vol. 85(C).
  • Handle: RePEc:eee:joreco:v:85:y:2025:i:c:s0969698925000670
    DOI: 10.1016/j.jretconser.2025.104288
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0969698925000670
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jretconser.2025.104288?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July.
    2. Ning Huan & Stephane Hess & Enjian Yao, 2022. "Understanding the effects of travel demand management on metro commuters’ behavioural loyalty: a hybrid choice modelling approach," Transportation, Springer, vol. 49(2), pages 343-372, April.
    3. Oyebayo Ridwan Olaniran & Ali Rashash R. Alzahrani, 2023. "On the Oracle Properties of Bayesian Random Forest for Sparse High-Dimensional Gaussian Regression," Mathematics, MDPI, vol. 11(24), pages 1-29, December.
    4. Sukhov, Alexandre & Friman, Margareta & Olsson, Lars E., 2023. "Unlocking potential: An integrated approach using PLS-SEM, NCA, and fsQCA for informed decision making," Journal of Retailing and Consumer Services, Elsevier, vol. 74(C).
    5. Saleh, Wafaa & Farrell, Séona, 2007. "Investigation and analysis of evidence of asymmetric churn in travel demand models," Transportation Research Part A: Policy and Practice, Elsevier, vol. 41(7), pages 691-702, August.
    6. Arno de Caigny & Kristof Coussement & Koen W. de Bock, 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," Post-Print hal-01741661, HAL.
    7. Wang, Jiangbo & Yamamoto, Toshiyuki & Liu, Kai, 2022. "Exploring the subscribing behavior of customized bus passengers: Active users versus inactive users," Journal of choice modelling, Elsevier, vol. 43(C).
    8. V. L. Migu Is & D. Van Den Poel & A.S. Camanho & J. Falcao E Cunha, 2012. "Modeling Partial Customer Churn: On the Value of First Product-Category Purchase Sequences," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/790, Ghent University, Faculty of Economics and Business Administration.
    9. Maldonado, Sebastián & López, Julio & Vairetti, Carla, 2020. "Profit-based churn prediction based on Minimax Probability Machines," European Journal of Operational Research, Elsevier, vol. 284(1), pages 273-284.
    10. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    11. Dea van Lierop & Madhav G. Badami & Ahmed M. El-Geneidy, 2018. "What influences satisfaction and loyalty in public transport? A review of the literature," Transport Reviews, Taylor & Francis Journals, vol. 38(1), pages 52-72, January.
    12. Efthymiou, Dimitrios & Antoniou, Constantinos & Tyrinopoulos, Yannis & Skaltsogianni, Eleana, 2018. "Factors affecting bus users’ satisfaction in times of economic crisis," Transportation Research Part A: Policy and Practice, Elsevier, vol. 114(PA), pages 3-12.
    13. Yang, Shuo & Fan, Yingling & Deng, Wei & Cheng, Long, 2019. "Do built environment effects on travel behavior differ between household members? A case study of Nanjing, China," Transport Policy, Elsevier, vol. 81(C), pages 360-370.
    14. Höppner, Sebastiaan & Stripling, Eugen & Baesens, Bart & Broucke, Seppe vanden & Verdonck, Tim, 2020. "Profit driven decision trees for churn prediction," European Journal of Operational Research, Elsevier, vol. 284(3), pages 920-933.
    15. Ma, Zhenliang & Koutsopoulos, Haris N. & Liu, Tianyou & Basu, Abhishek Arunasis, 2020. "Behavioral response to promotion-based public transport demand management: Longitudinal analysis and implications for optimal promotion design," Transportation Research Part A: Policy and Practice, Elsevier, vol. 141(C), pages 356-372.
    16. Zhenkun Liu & Koen de Bock & Lifang Zhang, 2025. "Explainable Profit-Driven Hotel Booking Cancellation Prediction based on Heterogeneous Stacking-Based Ensemble Classification," Post-Print hal-04679539, HAL.
    17. Jiang, Ping & Liu, Zhenkun & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Wendong & Dong, Qingli, 2024. "Profit-driven weighted classifier with interpretable ability for customer churn prediction," Omega, Elsevier, vol. 125(C).
    18. Liu, Zhenkun & Zhang, Ying & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Hufang & Gao, Yuyang & Chen, Yinghao, 2024. "Profit-driven fusion framework based on bagging and boosting classifiers for potential purchaser prediction," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    19. Zhenkun Liu & Ping Jiang & Koen W. de Bock & Jianzhou Wang & Lifang Zhang & Xinsong Niu, 2024. "Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction," Post-Print hal-04273578, HAL.
    20. Dongil Kim & Seokho Kang, 2019. "Effect of Irrelevant Variables on Faulty Wafer Detection in Semiconductor Manufacturing," Energies, MDPI, vol. 12(13), pages 1-11, July.
    21. Clemente-Císcar, M. & San Matías, S. & Giner-Bosch, V., 2014. "A methodology based on profitability criteria for defining the partial defection of customers in non-contractual settings," European Journal of Operational Research, Elsevier, vol. 239(1), pages 276-285.
    22. Liu, Zhenkun & Jiang, Ping & De Bock, Koen W. & Wang, Jianzhou & Zhang, Lifang & Niu, Xinsong, 2024. "Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
    23. Thiago Carvalho dos Reis Silveira & Cezar Augusto Romano & Tatiana Maria Cecy Gadda, 2022. "Loyalty and public transit: a quantitative systematic review of the literature," Transport Reviews, Taylor & Francis Journals, vol. 42(3), pages 362-383, May.
    24. Liu, Zhenkun & De Bock, Koen W. & Zhang, Lifang, 2025. "Explainable profit-driven hotel booking cancellation prediction based on heterogeneous stacking-based ensemble classification," European Journal of Operational Research, Elsevier, vol. 321(1), pages 284-301.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Ghazaleh Motamedi & Alireza Sheikh & Alireza Hashemi Golpayegani & Samira Khodabandehlou, 2026. "A hybrid approach for customer churn prediction and prevention in tourism," Information Technology & Tourism, Springer, vol. 28(1), pages 1-36, June.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liu, Zhenkun & Zhang, Ying & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Hufang & Gao, Yuyang & Chen, Yinghao, 2024. "Profit-driven fusion framework based on bagging and boosting classifiers for potential purchaser prediction," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    2. Rahman, Shimanto & Janssens, Bram & Bogaert, Matthias, 2025. "Profit-driven pre-processing in B2B customer churn modeling using fairness techniques," Journal of Business Research, Elsevier, vol. 189(C).
    3. Feng, Yi & Yin, Yunqiang & Wang, Dujuan & Ignatius, Joshua & Cheng, T.C.E. & Marra, Marianna & Guo, Yihan, 2024. "Enhancing e-commerce customer churn management with a profit- and AUC-focused prescriptive analytics approach," Journal of Business Research, Elsevier, vol. 184(C).
    4. Bram Janssens & Matthias Bogaert & Astrid Bagué & Dirk Van den Poel, 2024. "B2Boost: instance-dependent profit-driven modelling of B2B churn," Annals of Operations Research, Springer, vol. 341(1), pages 267-293, October.
    5. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    6. Liu, Zhenkun & Jiang, Ping & De Bock, Koen W. & Wang, Jianzhou & Zhang, Lifang & Niu, Xinsong, 2024. "Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
    7. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    8. Usama Arshad & Gohar Khan & Fawaz Khaled Alarfaj & Zahid Halim & Sajid Anwar, 2025. "Q-ensemble learning for customer churn prediction with blockchain-enabled data transparency," Annals of Operations Research, Springer, vol. 353(2), pages 607-633, October.
    9. Arno Caigny & Kristof Coussement & Matthijs Meire & Steven Hoornaert, 2025. "Investigating the impact of undersampling and bagging: an empirical investigation for customer attrition modeling," Annals of Operations Research, Springer, vol. 346(3), pages 2401-2421, March.
    10. Koen W. De Bock & Matthias Bogaert & Philippe Jardin, 2025. "Ensemble learning for operations research and business analytics," Annals of Operations Research, Springer, vol. 353(2), pages 419-448, October.
    11. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    12. Wang, Lei & Wang, Xinyu & Zhao, Zhongchao, 2024. "Mid-term electricity demand forecasting using improved multi-mode reconstruction and particle swarm-enhanced support vector regression," Energy, Elsevier, vol. 304(C).
    13. Arno de Caigny & Kristof Coussement & Koen de Bock, 2020. "Leveraging fine-grained transaction data for customer life event predictions," Post-Print hal-02507998, HAL.
    14. Vasileios Gkonis & Ioannis Tsakalos, 2025. "Deep Dive Into Churn Prediction in the Banking Sector: The Challenge of Hyperparameter Selection and Imbalanced Learning," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 44(2), pages 281-296, March.
    15. Hamed Jahani & Richa Jain & Dmitry Ivanov, 2026. "Data science and big data analytics: a systematic review of methodologies used in the supply chain and logistics research," Annals of Operations Research, Springer, vol. 359(2), pages 1297-1354, April.
    16. Zengyuan Wu & Lizheng Jing & Bei Wu & Lingmin Jin, 2025. "A PCA-AdaBoost model for E-commerce customer churn prediction," Annals of Operations Research, Springer, vol. 350(2), pages 537-554, July.
    17. Lewlisa Saha & Hrudaya Kumar Tripathy & Tarek Gaber & Hatem El-Gohary & El-Sayed M. El-kenawy, 2023. "Deep Churn Prediction Method for Telecommunication Industry," Sustainability, MDPI, vol. 15(5), pages 1-21, March.
    18. De Bock, Koen W. & Coussement, Kristof & Caigny, Arno De & Słowiński, Roman & Baesens, Bart & Boute, Robert N. & Choi, Tsan-Ming & Delen, Dursun & Kraus, Mathias & Lessmann, Stefan & Maldonado, Sebast, 2024. "Explainable AI for Operational Research: A defining framework, methods, applications, and a research agenda," European Journal of Operational Research, Elsevier, vol. 317(2), pages 249-272.
    19. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    20. Liu, Zhenkun & De Bock, Koen W. & Zhang, Lifang, 2025. "Explainable profit-driven hotel booking cancellation prediction based on heterogeneous stacking-based ensemble classification," European Journal of Operational Research, Elsevier, vol. 321(1), pages 284-301.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:joreco:v:85:y:2025:i:c:s0969698925000670. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/journal-of-retailing-and-consumer-services .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.