IDEAS home Printed from https://ideas.repec.org/a/eee/csdana/v181y2023ics0167947322002638.html
   My bibliography  Save this article

On the optimal binary classifier with an application

Author

Listed:
  • López-Díaz, María Concepción
  • López-Díaz, Miguel
  • Martínez-Fernández, Sergio

Abstract

The alternative accumulated improvement curve stochastic order is a criterion for the comparison of the performance of classifiers that predict binary responses. An explicit optimal classifier for this criterion is obtained. That optimal classifier has the largest ROC and CAP curves and indexes, that is, it is also optimal for the criteria based on the comparison of such curves and indexes. An application of the results to the search of the best classifier to predict clients of a bank which will make a transaction in the future is developed.

Suggested Citation

  • López-Díaz, María Concepción & López-Díaz, Miguel & Martínez-Fernández, Sergio, 2023. "On the optimal binary classifier with an application," Computational Statistics & Data Analysis, Elsevier, vol. 181(C).
  • Handle: RePEc:eee:csdana:v:181:y:2023:i:c:s0167947322002638
    DOI: 10.1016/j.csda.2022.107683
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0167947322002638
    Download Restriction: Full text for ScienceDirect subscribers only.

    File URL: https://libkey.io/10.1016/j.csda.2022.107683?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Buckinx, Wouter & Van den Poel, Dirk, 2005. "Customer base analysis: partial defection of behaviourally loyal clients in a non-contractual FMCG retail setting," European Journal of Operational Research, Elsevier, vol. 164(1), pages 252-268, July.
    2. Yousef, Waleed A., 2013. "Assessing classifiers in terms of the partial area under the ROC curve," Computational Statistics & Data Analysis, Elsevier, vol. 64(C), pages 51-70.
    3. Yuxin Zhu & Mei‐Cheng Wang, 2022. "Obtaining optimal cutoff values for tree classifiers using multiple biomarkers," Biometrics, The International Biometric Society, vol. 78(1), pages 128-140, March.
    4. David J. Hand, 2012. "Assessing the Performance of Classification Methods," International Statistical Review, International Statistical Institute, vol. 80(3), pages 400-414, December.
    5. Jiayin Qi & Li Zhang & Yanping Liu & Ling Li & Yongpin Zhou & Yao Shen & Liang Liang & Huaizu Li, 2009. "ADTreesLogit model for customer churn prediction," Annals of Operations Research, Springer, vol. 168(1), pages 247-265, April.
    6. Sabri Boughorbel & Fethi Jarray & Mohammed El-Anbari, 2017. "Optimal classifier for imbalanced data using Matthews Correlation Coefficient metric," PLOS ONE, Public Library of Science, vol. 12(6), pages 1-17, June.
    7. Clara-Cecilie Günther & Ingunn Tvete & Kjersti Aas & Geir Sandnes & Ørnulf Borgan, 2014. "Modelling and predicting customer churn from an insurance company," Scandinavian Actuarial Journal, Taylor & Francis Journals, vol. 2014(1), pages 58-71.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Brandner, Hubertus & Lessmann, Stefan & Voß, Stefan, 2013. "A memetic approach to construct transductive discrete support vector machines," European Journal of Operational Research, Elsevier, vol. 230(3), pages 581-595.
    2. K. W. De Bock & D. Van Den Poel, 2012. "Reconciling Performance and Interpretability in Customer Churn Prediction using Ensemble Learning based on Generalized Additive Models," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 12/805, Ghent University, Faculty of Economics and Business Administration.
    3. Gigliarano, Chiara & Figini, Silvia & Muliere, Pietro, 2014. "Making classifier performance comparisons when ROC curves intersect," Computational Statistics & Data Analysis, Elsevier, vol. 77(C), pages 300-312.
    4. Chen, Zhen-Yu & Fan, Zhi-Ping & Sun, Minghe, 2012. "A hierarchical multiple kernel support vector machine for customer churn prediction using longitudinal behavioral data," European Journal of Operational Research, Elsevier, vol. 223(2), pages 461-472.
    5. Risselada, Hans & Verhoef, Peter C. & Bijmolt, Tammo H.A., 2010. "Staying Power of Churn Prediction Models," Journal of Interactive Marketing, Elsevier, vol. 24(3), pages 198-208.
    6. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    7. Koen W. de Bock & Arno de Caigny, 2021. "Spline-rule ensemble classifiers with structured sparsity regularization for interpretable customer churn modeling," Post-Print hal-03391564, HAL.
    8. Jerath, Kinshuk & Fader, Peter S. & Hardie, Bruce G.S., 2016. "Customer-base analysis using repeated cross-sectional summary (RCSS) data," European Journal of Operational Research, Elsevier, vol. 249(1), pages 340-350.
    9. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    10. Johannes Habel & Sascha Alavi & Nicolas Heinitz, 2023. "A theory of predictive sales analytics adoption," AMS Review, Springer;Academy of Marketing Science, vol. 13(1), pages 34-54, June.
    11. Mitrović, Sandra & Baesens, Bart & Lemahieu, Wilfried & De Weerdt, Jochen, 2018. "On the operational efficiency of different feature types for telco Churn prediction," European Journal of Operational Research, Elsevier, vol. 267(3), pages 1141-1155.
    12. Arno de Caigny & Kristof Coussement & Koen de Bock, 2020. "Leveraging fine-grained transaction data for customer life event predictions," Post-Print hal-02507998, HAL.
    13. Wang Chamont & Gevertz Jana L., 2016. "Finding causative genes from high-dimensional data: an appraisal of statistical and machine learning approaches," Statistical Applications in Genetics and Molecular Biology, De Gruyter, vol. 15(4), pages 321-347, August.
    14. Christian Kauten & Ashish Gupta & Xiao Qin & Glenn Richey, 2022. "Predicting Blood Donors Using Machine Learning Techniques," Information Systems Frontiers, Springer, vol. 24(5), pages 1547-1562, October.
    15. K. Coussement & D. Van Den Poel, 2008. "Improving Customer Attrition Prediction by Integrating Emotions from Client/Company Interaction Emails and Evaluating Multiple Classifiers," Working Papers of Faculty of Economics and Business Administration, Ghent University, Belgium 08/527, Ghent University, Faculty of Economics and Business Administration.
    16. Bazargan, Amirhossein & Karray, Salma & Zolfaghari, Saeed, 2017. "Modeling reward expiry for loyalty programs in a competitive market," International Journal of Production Economics, Elsevier, vol. 193(C), pages 352-364.
    17. Marko Sarstedt & Sebastian Scharf & Alexander Thamm & Michael Wolff, 2010. "Die Prognose von Serviceintervallen mit der Hazard-Raten-Analyse – Ergebnisse einer empirischen Studie im Automobilmarkt," Metrika: International Journal for Theoretical and Applied Statistics, Springer, vol. 20(3), pages 269-283, April.
    18. Tan, Pei Jie & Corsi, Armando & Cohen, Justin & Sharp, Anne & Lockshin, Larry & Caruso, William & Bogomolova, Svetlana, 2018. "Assessing the sales effectiveness of differently located endcaps in a supermarket," Journal of Retailing and Consumer Services, Elsevier, vol. 43(C), pages 200-208.
    19. David Cemernek & Sandra Cemernek & Heimo Gursch & Ashwini Pandeshwar & Thomas Leitner & Matthias Berger & Gerald Klösch & Roman Kern, 2022. "Machine learning in continuous casting of steel: a state-of-the-art survey," Journal of Intelligent Manufacturing, Springer, vol. 33(6), pages 1561-1579, August.
    20. David J. Hand, 2022. "Trustworthiness of statistical inference," Journal of the Royal Statistical Society Series A, Royal Statistical Society, vol. 185(1), pages 329-347, January.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:csdana:v:181:y:2023:i:c:s0167947322002638. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/csda .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.