IDEAS home Printed from https://ideas.repec.org/a/spr/infosf/v27y2025i2d10.1007_s10796-023-10446-x.html
   My bibliography  Save this article

Comparing Machine Learning and Deep Learning Techniques for Text Analytics: Detecting the Severity of Hate Comments Online

Author

Listed:
  • Alaa Marshan

    (University of Surrey)

  • Farah Nasreen Mohamed Nizar

    (Brunel University)

  • Athina Ioannou

    (University of Surrey)

  • Konstantina Spanaki

    (Audencia Business School)

Abstract

Social media platforms have become an increasingly popular tool for individuals to share their thoughts and opinions with other people. However, very often people tend to misuse social media posting abusive comments. Abusive and harassing behaviours can have adverse effects on people's lives. This study takes a novel approach to combat harassment in online platforms by detecting the severity of abusive comments, that has not been investigated before. The study compares the performance of machine learning models such as Naïve Bayes, Random Forest, and Support Vector Machine, with deep learning models such as Convolutional Neural Network (CNN) and Bi-directional Long Short-Term Memory (Bi-LSTM). Moreover, in this work we investigate the effect of text pre-processing on the performance of the machine and deep learning models, the feature set for the abusive comments was made using unigrams and bigrams for the machine learning models and word embeddings for the deep learning models. The comparison of the models’ performances showed that the Random Forest with bigrams achieved the best overall performance with an accuracy of (0.94), a precision of (0.91), a recall of (0.94), and an F1 score of (0.92). The study develops an efficient model to detect severity of abusive language in online platforms, offering important implications both to theory and practice.

Suggested Citation

  • Alaa Marshan & Farah Nasreen Mohamed Nizar & Athina Ioannou & Konstantina Spanaki, 2025. "Comparing Machine Learning and Deep Learning Techniques for Text Analytics: Detecting the Severity of Hate Comments Online," Information Systems Frontiers, Springer, vol. 27(2), pages 487-505, April.
  • Handle: RePEc:spr:infosf:v:27:y:2025:i:2:d:10.1007_s10796-023-10446-x
    DOI: 10.1007/s10796-023-10446-x
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s10796-023-10446-x
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s10796-023-10446-x?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Jyoti Prakash Singh & Abhinav Kumar & Nripendra P. Rana & Yogesh K. Dwivedi, 2022. "Attention-Based LSTM Network for Rumor Veracity Estimation of Tweets," Information Systems Frontiers, Springer, vol. 24(2), pages 459-474, April.
    2. Bandeh Ali Talpur & Declan O’Sullivan, 2020. "Cyberbullying severity detection: A machine learning approach," PLOS ONE, Public Library of Science, vol. 15(10), pages 1-19, October.
    3. Minh Phan & Arno de Caigny & Kristof Coussement, 2023. "A decision support framework to incorporate textual data for early student dropout prediction in higher education," Post-Print hal-04274684, HAL.
    4. Amgad Muneer & Suliman Mohamed Fati, 2020. "A Comparative Analysis of Machine Learning Techniques for Cyberbullying Detection on Twitter," Future Internet, MDPI, vol. 12(11), pages 1-20, October.
    5. Gabriel Ferrettini & Elodie Escriva & Julien Aligon & Jean-Baptiste Excoffier & Chantal Soulé-Dupuy, 2022. "Coalitional Strategies for Efficient Individual Prediction Explanation," Information Systems Frontiers, Springer, vol. 24(1), pages 49-75, February.
    6. Alice Tontodimamma & Eugenia Nissi & Annalina Sarra & Lara Fontanella, 2021. "Thirty years of research into hate speech: topics of interest and their evolution," Scientometrics, Springer;Akadémiai Kiadó, vol. 126(1), pages 157-179, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Pablo Madriaza & Ghayda Hassan & Sébastien Brouillette‐Alarie & Aoudou Njingouo Mounchingam & Loïc Durocher‐Corfa & Eugene Borokhovski & David Pickup & Sabrina Paillé, 2025. "Exposure to hate in online and traditional media: A systematic review and meta‐analysis of the impact of this exposure on individuals and communities," Campbell Systematic Reviews, John Wiley & Sons, vol. 21(1), March.
    2. JongSerl Chun & Serim Lee & Jinyung Kim, 2024. "Ontology Development for Cyber Violence Victimization in Korean Adolescents," SAGE Open, , vol. 14(2), pages 21582440241, May.
    3. Dursun Delen & Behrooz Davazdahemami & Elham Rasouli Dezfouli, 2024. "Predicting and Mitigating Freshmen Student Attrition: A Local-Explainable Machine Learning Framework," Information Systems Frontiers, Springer, vol. 26(2), pages 641-662, April.
    4. Zhang, Xi & Cheng, Yihang & Chen, Aoshuang & Lytras, Miltiadis & de Pablos, Patricia Ordóñez & Zhang, Renyu, 2022. "How rumors diffuse in the infodemic: Evidence from the healthy online social change in China," Technological Forecasting and Social Change, Elsevier, vol. 185(C).
    5. Suliman Mohamed Fati & Amgad Muneer & Ayed Alwadain & Abdullateef O. Balogun, 2023. "Cyberbullying Detection on Twitter Using Deep Learning-Based Attention Mechanisms and Continuous Bag of Words Feature Extraction," Mathematics, MDPI, vol. 11(16), pages 1-21, August.
    6. Jérôme Darmont & Boris Novikov & Robert Wrembel & Ladjel Bellatreche, 2022. "Advances on Data Management and Information Systems," Information Systems Frontiers, Springer, vol. 24(1), pages 1-10, February.
    7. Badiee, Aghdas & Moshtari, Mohammad & Berenguer, Gemma, 2024. "A systematic review of operations research and management science modeling techniques in the study of higher education institutions," Socio-Economic Planning Sciences, Elsevier, vol. 93(C).
    8. Hongzhe Kang & Yao Wang & Min Wang & Megat Imran Yasin & Mohd Nizam Osman & Lay Hoon Ang, 2024. "Navigating Digital Network: Mindfulness as a Shield Against Cyberbullying in the Knowledge Economy Era," Journal of the Knowledge Economy, Springer;Portland International Center for Management of Engineering and Technology (PICMET), vol. 15(3), pages 13233-13271, September.
    9. Emmanuel W. Ayaburi & Francis Kofi Andoh-Baidoo & Yogesh K. Dwivedi & Banita Lal, 2022. "Editorial: Special Issue on “Bright ICT: Security, Privacy and Risk Issues”," Information Systems Frontiers, Springer, vol. 24(2), pages 371-373, April.
    10. Ebrahim A. A. Ghaleb & P. D. D. Dominic & Suliman Mohamed Fati & Amgad Muneer & Rao Faizan Ali, 2021. "The Assessment of Big Data Adoption Readiness with a Technology–Organization–Environment Framework: A Perspective towards Healthcare Employees," Sustainability, MDPI, vol. 13(15), pages 1-33, July.
    11. Jung Ryeol Park & Yituo Feng, 2023. "Trajectory tracking of changes digital divide prediction factors in the elderly through machine learning," PLOS ONE, Public Library of Science, vol. 18(2), pages 1-20, February.
    12. José Manuel Ortiz-Marcos & María Tomé-Fernández & Christian Fernández-Leyva, 2021. "Cyberbullying Analysis in Intercultural Educational Environments Using Binary Logistic Regressions," Future Internet, MDPI, vol. 13(1), pages 1-15, January.
    13. Thuy, Arthur & Benoit, Dries F., 2024. "Explainability through uncertainty: Trustworthy decision-making with neural networks," European Journal of Operational Research, Elsevier, vol. 317(2), pages 330-340.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:infosf:v:27:y:2025:i:2:d:10.1007_s10796-023-10446-x. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.