IDEAS home Printed from https://ideas.repec.org/a/spr/trosos/v16y2022i2d10.1007_s12626-022-00127-7.html
   My bibliography  Save this article

Enhancing the Predictive Performance of Credibility-Based Fake News Detection Using Ensemble Learning

Author

Listed:
  • Amit Neil Ramkissoon

    (The University of the West Indies at St Augustine)

  • Wayne Goodridge

    (The University of the West Indies at St Augustine)

Abstract

Fake news detection continues to be a major problem that affects our society today. Fake news can be classified using a variety of methods. Predicting and detecting fake news has proven to be challenging even for machine learning algorithms. This research employs Legitimacy, a unique ensemble machine learning model to accomplish the task of Credibility-Based Fake News Detection. The Legitimacy ensemble combines the learning potential of a Two-Class Boosted Decision Tree and a Two-Class Neural Network. The ensemble technique follows a pseudo-mixture-of-experts methodology. For the gating model, an instance of Two-Class Logistic Regression is implemented. This study validates Legitimacy using a standard dataset with features relating to the credibility of news publishers to predict fake news. These features are analysed using the ensemble algorithm. The results of these experiments are examined using four evaluation methodologies. The analysis of the results reveals positive performance with the use of the ensemble ML method with an accuracy of 96.9%. This ensemble’s performance is compared with the performance of the two base machine learning models of the ensemble. The performance of the ensemble surpasses that of the two base models. The performance of Legitimacy is also analysed as the size of the dataset increases to demonstrate its scalability. Hence, based on our selected dataset, the Legitimacy ensemble model has proven to be most appropriate for Credibility-Based Fake News Detection.

Suggested Citation

  • Amit Neil Ramkissoon & Wayne Goodridge, 2022. "Enhancing the Predictive Performance of Credibility-Based Fake News Detection Using Ensemble Learning," The Review of Socionetwork Strategies, Springer, vol. 16(2), pages 259-289, October.
  • Handle: RePEc:spr:trosos:v:16:y:2022:i:2:d:10.1007_s12626-022-00127-7
    DOI: 10.1007/s12626-022-00127-7
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s12626-022-00127-7
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s12626-022-00127-7?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Amit Neil Ramkissoon & Shareeda Mohammed & Wayne Goodridge, 2021. "Determining an Optimal Data Classification Model for Credibility-Based Fake News Detection," The Review of Socionetwork Strategies, Springer, vol. 15(2), pages 347-380, November.
    2. Arno de Caigny & Kristof Coussement & Koen W. de Bock, 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," Post-Print hal-01741661, HAL.
    3. Iftikhar Ahmad & Muhammad Yousaf & Suhail Yousaf & Muhammad Ovais Ahmad, 2020. "Fake News Detection Using Machine Learning Ensemble Methods," Complexity, Hindawi, vol. 2020, pages 1-11, October.
    4. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W., 2018. "A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees," European Journal of Operational Research, Elsevier, vol. 269(2), pages 760-772.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chou, Ping & Chuang, Howard Hao-Chun & Chou, Yen-Chun & Liang, Ting-Peng, 2022. "Predictive analytics for customer repurchase: Interdisciplinary integration of buy till you die modeling and machine learning," European Journal of Operational Research, Elsevier, vol. 296(2), pages 635-651.
    2. Koen W. de Bock & Arno de Caigny, 2021. "Spline-rule ensemble classifiers with structured sparsity regularization for interpretable customer churn modeling," Post-Print hal-03391564, HAL.
    3. Louis Geiler & Séverine Affeldt & Mohamed Nadif, 2022. "A survey on machine learning methods for churn prediction," Post-Print hal-03824873, HAL.
    4. Arno de Caigny & Kristof Coussement & Koen W. de Bock & Stefan Lessmann, 2019. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," Post-Print hal-02275958, HAL.
    5. Matthias Bogaert & Lex Delaere, 2023. "Ensemble Methods in Customer Churn Prediction: A Comparative Analysis of the State-of-the-Art," Mathematics, MDPI, vol. 11(5), pages 1-28, February.
    6. De Caigny, Arno & Coussement, Kristof & De Bock, Koen W. & Lessmann, Stefan, 2020. "Incorporating textual information in customer churn prediction models based on a convolutional neural network," International Journal of Forecasting, Elsevier, vol. 36(4), pages 1563-1578.
    7. Yuan, Kunpeng & Chi, Guotai & Zhou, Ying & Yin, Hailei, 2022. "A novel two-stage hybrid default prediction model with k-means clustering and support vector domain description," Research in International Business and Finance, Elsevier, vol. 59(C).
    8. Youngkeun Choi & Jae W. Choi, 2023. "Assessing the Predictive Performance of Machine Learning in Direct Marketing Response," International Journal of E-Business Research (IJEBR), IGI Global, vol. 19(1), pages 1-12, January.
    9. Arno de Caigny & Kristof Coussement & Koen de Bock, 2020. "Leveraging fine-grained transaction data for customer life event predictions," Post-Print hal-02507998, HAL.
    10. Chen, Yan & Zhang, Lei & Zhao, Yulu & Xu, Bing, 2022. "Implementation of penalized survival models in churn prediction of vehicle insurance," Journal of Business Research, Elsevier, vol. 153(C), pages 162-171.
    11. Dumitrescu, Elena & Hué, Sullivan & Hurlin, Christophe & Tokpavi, Sessi, 2022. "Machine learning for credit scoring: Improving logistic regression with non-linear decision-tree effects," European Journal of Operational Research, Elsevier, vol. 297(3), pages 1178-1192.
    12. Li, Mengyu & Vanberkel, Peter & Zhong, Xiang, 2022. "Predicting ambulance offload delay using a hybrid decision tree model," Socio-Economic Planning Sciences, Elsevier, vol. 80(C).
    13. Narendra Singh & Pushpa Singh & Mukul Gupta, 2020. "An inclusive survey on machine learning for CRM: a paradigm shift," DECISION: Official Journal of the Indian Institute of Management Calcutta, Springer;Indian Institute of Management Calcutta, vol. 47(4), pages 447-457, December.
    14. Petra P. Šimović & Claire Y. T. Chen & Edward W. Sun, 2023. "Classifying the Variety of Customers’ Online Engagement for Churn Prediction with a Mixed-Penalty Logistic Regression," Computational Economics, Springer;Society for Computational Economics, vol. 61(1), pages 451-485, January.
    15. Ni, Ji & Chen, Bowei & Allinson, Nigel M. & Ye, Xujiong, 2020. "A hybrid model for predicting human physical activity status from lifelogging data," European Journal of Operational Research, Elsevier, vol. 281(3), pages 532-542.
    16. Gattermann-Itschert, Theresa & Thonemann, Ulrich W., 2021. "How training on multiple time slices improves performance in churn prediction," European Journal of Operational Research, Elsevier, vol. 295(2), pages 664-674.
    17. Elena Ivona DUMITRESCU & Sullivan HUE & Christophe HURLIN & Sessi TOKPAVI, 2020. "Machine Learning or Econometrics for Credit Scoring: Let’s Get the Best of Both Worlds," LEO Working Papers / DR LEO 2839, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    18. Amin, Adnan & Shah, Babar & Khattak, Asad Masood & Lopes Moreira, Fernando Joaquim & Ali, Gohar & Rocha, Alvaro & Anwar, Sajid, 2019. "Cross-company customer churn prediction in telecommunication: A comparison of data transformation methods," International Journal of Information Management, Elsevier, vol. 46(C), pages 304-319.
    19. Lewlisa Saha & Hrudaya Kumar Tripathy & Tarek Gaber & Hatem El-Gohary & El-Sayed M. El-kenawy, 2023. "Deep Churn Prediction Method for Telecommunication Industry," Sustainability, MDPI, vol. 15(5), pages 1-21, March.
    20. Wen Zhang & Andrew Dunkley & Urvi Kanabar & David Elliott & Henry P. Wynn, 2022. "A decision support system for liability in civil litigation: a case study from an insurance company," Annals of Operations Research, Springer, vol. 315(2), pages 695-706, August.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:trosos:v:16:y:2022:i:2:d:10.1007_s12626-022-00127-7. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.