IDEAS home Printed from https://ideas.repec.org/a/eee/joreco/v85y2025ics0969698925000670.html
   My bibliography  Save this article

Improving customer retention in taxi industry using travel data analytics: A churn prediction study

Author

Listed:
  • Loureiro, A.L.D.
  • Miguéis, V.L.
  • Costa, Ã lvaro
  • Ferreira, Michel

Abstract

The retention of public transport users is widely acknowledged as a paramount challenge in the path towards the establishment of more sustainable cities and societies. In this setting, in which no contractual relationship with customers exists, an early and accurate prediction of whether a customer will remain with the company or leave, assumes great significance for businesses to develop effective retention strategies. This work focuses on this topic by identifying potential churners based on their past travel behavior. To achieve this, we developed a set of classification models using various machine learning techniques. These models were then employed as base learners within a stacking ensemble. All classifiers were developed with a profit-driven approach, optimizing for expected maximum profit. Finally, we calculated Shapley Additive Explanation values to enhance the interpretability of the proposed classifiers. The performance of the predictive models was evaluated using the data of taxi services recorded in a Portuguese city for 52 months. A broad range of predictors is proposed, including recency and frequency measures of taxi usage as well as others related to customers' satisfaction level. The predictive power of the models was also assessed for specific proportions of higher risk customers. All models have shown the capability to identify churners accurately. This study innovates in evaluating the one-to-one service provider company-customer relationship in the context of taxi industry. Retention actions to promote customers loyalty and enhance retention are also suggested.

Suggested Citation

  • Loureiro, A.L.D. & Miguéis, V.L. & Costa, à lvaro & Ferreira, Michel, 2025. "Improving customer retention in taxi industry using travel data analytics: A churn prediction study," Journal of Retailing and Consumer Services, Elsevier, vol. 85(C).
  • Handle: RePEc:eee:joreco:v:85:y:2025:i:c:s0969698925000670
    DOI: 10.1016/j.jretconser.2025.104288
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0969698925000670
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jretconser.2025.104288?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Ning Huan & Stephane Hess & Enjian Yao, 2022. "Understanding the effects of travel demand management on metro commuters’ behavioural loyalty: a hybrid choice modelling approach," Transportation, Springer, vol. 49(2), pages 343-372, April.
    2. Yang, Shuo & Fan, Yingling & Deng, Wei & Cheng, Long, 2019. "Do built environment effects on travel behavior differ between household members? A case study of Nanjing, China," Transport Policy, Elsevier, vol. 81(C), pages 360-370.
    3. Z. Liu & K. de Bock & L. Zhang, 2025. "Explainable Profit-Driven Hotel Booking Cancellation Prediction based on Heterogeneous Stacking-Based Ensemble Classification," Post-Print hal-04679539, HAL.
    4. Jiang, Ping & Liu, Zhenkun & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Wendong & Dong, Qingli, 2024. "Profit-driven weighted classifier with interpretable ability for customer churn prediction," Omega, Elsevier, vol. 125(C).
    5. Zhenkun Liu & Ping Jiang & Koen W. de Bock & Jianzhou Wang & Lifang Zhang & Xinsong Niu, 2024. "Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction," Post-Print hal-04273578, HAL.
    6. Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July.
    7. Sukhov, Alexandre & Friman, Margareta & Olsson, Lars E., 2023. "Unlocking potential: An integrated approach using PLS-SEM, NCA, and fsQCA for informed decision making," Journal of Retailing and Consumer Services, Elsevier, vol. 74(C).
    8. Arno de Caigny & Kristof Coussement & Koen W. de Bock, 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," Post-Print hal-01741661, HAL.
    9. Wang, Jiangbo & Yamamoto, Toshiyuki & Liu, Kai, 2022. "Exploring the subscribing behavior of customized bus passengers: Active users versus inactive users," Journal of choice modelling, Elsevier, vol. 43(C).
    10. Maldonado, Sebastián & López, Julio & Vairetti, Carla, 2020. "Profit-based churn prediction based on Minimax Probability Machines," European Journal of Operational Research, Elsevier, vol. 284(1), pages 273-284.
    11. Ma, Zhenliang & Koutsopoulos, Haris N. & Liu, Tianyou & Basu, Abhishek Arunasis, 2020. "Behavioral response to promotion-based public transport demand management: Longitudinal analysis and implications for optimal promotion design," Transportation Research Part A: Policy and Practice, Elsevier, vol. 141(C), pages 356-372.
    12. Dongil Kim & Seokho Kang, 2019. "Effect of Irrelevant Variables on Faulty Wafer Detection in Semiconductor Manufacturing," Energies, MDPI, vol. 12(13), pages 1-11, July.
    13. Liu, Zhenkun & Jiang, Ping & De Bock, Koen W. & Wang, Jianzhou & Zhang, Lifang & Niu, Xinsong, 2024. "Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
    14. Liu, Zhenkun & De Bock, Koen W. & Zhang, Lifang, 2025. "Explainable profit-driven hotel booking cancellation prediction based on heterogeneous stacking-based ensemble classification," European Journal of Operational Research, Elsevier, vol. 321(1), pages 284-301.
    15. Oyebayo Ridwan Olaniran & Ali Rashash R. Alzahrani, 2023. "On the Oracle Properties of Bayesian Random Forest for Sparse High-Dimensional Gaussian Regression," Mathematics, MDPI, vol. 11(24), pages 1-29, December.
    16. Dea van Lierop & Madhav G. Badami & Ahmed M. El-Geneidy, 2018. "What influences satisfaction and loyalty in public transport? A review of the literature," Transport Reviews, Taylor & Francis Journals, vol. 38(1), pages 52-72, January.
    17. Liu, Zhenkun & Zhang, Ying & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Hufang & Gao, Yuyang & Chen, Yinghao, 2024. "Profit-driven fusion framework based on bagging and boosting classifiers for potential purchaser prediction," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    18. Clemente-Císcar, M. & San Matías, S. & Giner-Bosch, V., 2014. "A methodology based on profitability criteria for defining the partial defection of customers in non-contractual settings," European Journal of Operational Research, Elsevier, vol. 239(1), pages 276-285.
    19. Thiago Carvalho dos Reis Silveira & Cezar Augusto Romano & Tatiana Maria Cecy Gadda, 2022. "Loyalty and public transit: a quantitative systematic review of the literature," Transport Reviews, Taylor & Francis Journals, vol. 42(3), pages 362-383, May.
    20. Saleh, Wafaa & Farrell, Séona, 2007. "Investigation and analysis of evidence of asymmetric churn in travel demand models," Transportation Research Part A: Policy and Practice, Elsevier, vol. 41(7), pages 691-702, August.
    21. V. L. Migu Is & D. Van Den Poel & A.S. Camanho & J. Falcao E Cunha, 2012. "Modeling Partial Customer Churn: On the Value of First Product-Category Purchase Sequences," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/790, Ghent University, Faculty of Economics and Business Administration.
    22. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    23. Efthymiou, Dimitrios & Antoniou, Constantinos & Tyrinopoulos, Yannis & Skaltsogianni, Eleana, 2018. "Factors affecting bus users’ satisfaction in times of economic crisis," Transportation Research Part A: Policy and Practice, Elsevier, vol. 114(PA), pages 3-12.
    24. Höppner, Sebastiaan & Stripling, Eugen & Baesens, Bart & Broucke, Seppe vanden & Verdonck, Tim, 2020. "Profit driven decision trees for churn prediction," European Journal of Operational Research, Elsevier, vol. 284(3), pages 920-933.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Liu, Zhenkun & Zhang, Ying & Abedin, Mohammad Zoynul & Wang, Jianzhou & Yang, Hufang & Gao, Yuyang & Chen, Yinghao, 2024. "Profit-driven fusion framework based on bagging and boosting classifiers for potential purchaser prediction," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    2. Feng, Yi & Yin, Yunqiang & Wang, Dujuan & Ignatius, Joshua & Cheng, T.C.E. & Marra, Marianna & Guo, Yihan, 2024. "Enhancing e-commerce customer churn management with a profit- and AUC-focused prescriptive analytics approach," Journal of Business Research, Elsevier, vol. 184(C).
    3. Rahman, Shimanto & Janssens, Bram & Bogaert, Matthias, 2025. "Profit-driven pre-processing in B2B customer churn modeling using fairness techniques," Journal of Business Research, Elsevier, vol. 189(C).
    4. Bram Janssens & Matthias Bogaert & Astrid Bagué & Dirk Van den Poel, 2024. "B2Boost: instance-dependent profit-driven modelling of B2B churn," Annals of Operations Research, Springer, vol. 341(1), pages 267-293, October.
    5. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    6. Liu, Zhenkun & Jiang, Ping & De Bock, Koen W. & Wang, Jianzhou & Zhang, Lifang & Niu, Xinsong, 2024. "Extreme gradient boosting trees with efficient Bayesian optimization for profit-driven customer churn prediction," Technological Forecasting and Social Change, Elsevier, vol. 198(C).
    7. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    8. Arno Caigny & Kristof Coussement & Matthijs Meire & Steven Hoornaert, 2025. "Investigating the impact of undersampling and bagging: an empirical investigation for customer attrition modeling," Annals of Operations Research, Springer, vol. 346(3), pages 2401-2421, March.
    9. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    10. Wang, Lei & Wang, Xinyu & Zhao, Zhongchao, 2024. "Mid-term electricity demand forecasting using improved multi-mode reconstruction and particle swarm-enhanced support vector regression," Energy, Elsevier, vol. 304(C).
    11. Arno de Caigny & Kristof Coussement & Koen de Bock, 2020. "Leveraging fine-grained transaction data for customer life event predictions," Post-Print hal-02507998, HAL.
    12. Lewlisa Saha & Hrudaya Kumar Tripathy & Tarek Gaber & Hatem El-Gohary & El-Sayed M. El-kenawy, 2023. "Deep Churn Prediction Method for Telecommunication Industry," Sustainability, MDPI, vol. 15(5), pages 1-21, March.
    13. De Bock, Koen W. & Coussement, Kristof & Caigny, Arno De & Słowiński, Roman & Baesens, Bart & Boute, Robert N. & Choi, Tsan-Ming & Delen, Dursun & Kraus, Mathias & Lessmann, Stefan & Maldonado, Sebast, 2024. "Explainable AI for Operational Research: A defining framework, methods, applications, and a research agenda," European Journal of Operational Research, Elsevier, vol. 317(2), pages 249-272.
    14. Koen W. de Bock & Kristof Coussement & Arno De Caigny & Roman Slowiński & Bart Baesens & Robert N Boute & Tsan-Ming Choi & Dursun Delen & Mathias Kraus & Stefan Lessmann & Sebastián Maldonado & David , 2023. "Explainable AI for Operational Research: A Defining Framework, Methods, Applications, and a Research Agenda," Post-Print hal-04219546, HAL.
    15. Gómez-Vargas, Nuria & Maldonado, Sebastián & Vairetti, Carla, 2025. "A predict-and-optimize approach to profit-driven churn prevention," European Journal of Operational Research, Elsevier, vol. 324(2), pages 555-566.
    16. Niu, Zhewen & Han, Xiaoqing & Zhang, Dongxia & Wu, Yuxiang & Lan, Songyan, 2024. "Interpretable wind power forecasting combining seasonal-trend representations learning with temporal fusion transformers architecture," Energy, Elsevier, vol. 306(C).
    17. Vasileios Gkonis & Ioannis Tsakalos, 2025. "Deep Dive Into Churn Prediction in the Banking Sector: The Challenge of Hyperparameter Selection and Imbalanced Learning," Journal of Forecasting, John Wiley & Sons, Ltd., vol. 44(2), pages 281-296, March.
    18. Liu, Zhenkun & De Bock, Koen W. & Zhang, Lifang, 2025. "Explainable profit-driven hotel booking cancellation prediction based on heterogeneous stacking-based ensemble classification," European Journal of Operational Research, Elsevier, vol. 321(1), pages 284-301.
    19. Chandrasekhar Valluri & Sudhakar Raju & Vivek H. Patil, 2022. "Customer determinants of used auto loan churn: comparing predictive performance using machine learning techniques," Journal of Marketing Analytics, Palgrave Macmillan, vol. 10(3), pages 279-296, September.
    20. Koen W. de Bock & Arno de Caigny, 2021. "Spline-rule ensemble classifiers with structured sparsity regularization for interpretable customer churn modeling," Post-Print hal-03391564, HAL.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:joreco:v:85:y:2025:i:c:s0969698925000670. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: https://www.journals.elsevier.com/journal-of-retailing-and-consumer-services .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.