IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0302583.html
   My bibliography  Save this article

HAPI: An efficient Hybrid Feature Engineering-based Approach for Propaganda Identification in social media

Author

Listed:
  • Akib Mohi Ud Din Khanday
  • Mudasir Ahmad Wani
  • Syed Tanzeel Rabani
  • Qamar Rayees Khan
  • Ahmed A Abd El-Latif

Abstract

Social media platforms serve as communication tools where users freely share information regardless of its accuracy. Propaganda on these platforms refers to the dissemination of biased or deceptive information aimed at influencing public opinion, encompassing various forms such as political campaigns, fake news, and conspiracy theories. This study introduces a Hybrid Feature Engineering Approach for Propaganda Identification (HAPI), designed to detect propaganda in text-based content like news articles and social media posts. HAPI combines conventional feature engineering methods with machine learning techniques to achieve high accuracy in propaganda detection. This study is conducted on data collected from Twitter via its API, and an annotation scheme is proposed to categorize tweets into binary classes (propaganda and non-propaganda). Hybrid feature engineering entails the amalgamation of various features, including Term Frequency-Inverse Document Frequency (TF-IDF), Bag of Words (BoW), Sentimental features, and tweet length, among others. Multiple Machine Learning classifiers undergo training and evaluation utilizing the proposed methodology, leveraging a selection of 40 pertinent features identified through the hybrid feature selection technique. All the selected algorithms including Multinomial Naive Bayes (MNB), Support Vector Machine (SVM), Decision Tree (DT), and Logistic Regression (LR) achieved promising results. The SVM-based HaPi (SVM-HaPi) exhibits superior performance among traditional algorithms, achieving precision, recall, F-Measure, and overall accuracy of 0.69, 0.69, 0.69, and 69.2%, respectively. Furthermore, the proposed approach is compared to well-known existing approaches where it overperformed most of the studies on several evaluation metrics. This research contributes to the development of a comprehensive system tailored for propaganda identification in textual content. Nonetheless, the purview of propaganda detection transcends textual data alone. Deep learning algorithms like Artificial Neural Networks (ANN) offer the capability to manage multimodal data, incorporating text, images, audio, and video, thereby considering not only the content itself but also its presentation and contextual nuances during dissemination.

Suggested Citation

  • Akib Mohi Ud Din Khanday & Mudasir Ahmad Wani & Syed Tanzeel Rabani & Qamar Rayees Khan & Ahmed A Abd El-Latif, 2024. "HAPI: An efficient Hybrid Feature Engineering-based Approach for Propaganda Identification in social media," PLOS ONE, Public Library of Science, vol. 19(7), pages 1-29, July.
  • Handle: RePEc:plo:pone00:0302583
    DOI: 10.1371/journal.pone.0302583
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0302583
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0302583&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0302583?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. repec:cup:apsrev:v:21:y:1927:i:03:p:627-631_02 is not listed on IDEAS
    2. Gianpietro Mazzoleni & Roberta Bracciale, 2018. "Socially mediated populism: the communicative strategies of political leaders on Facebook," Palgrave Communications, Palgrave Macmillan, vol. 4(1), pages 1-10, December.
    3. Gerard Salton & Chris Buckley, 1990. "Improving retrieval performance by relevance feedback," Journal of the American Society for Information Science, Association for Information Science & Technology, vol. 41(4), pages 288-297, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Michel Zitt, 2015. "Meso-level retrieval: IR-bibliometrics interplay and hybrid citation-words methods in scientific fields delineation," Scientometrics, Springer;Akadémiai Kiadó, vol. 102(3), pages 2223-2245, March.
    2. Isabella Mingo & Maria Paola Faggiano, 2020. "Trust in Institutions Between Objective and Subjective Determinants: A Multilevel Analysis in European Countries," Social Indicators Research: An International and Interdisciplinary Journal for Quality-of-Life Measurement, Springer, vol. 151(3), pages 815-839, October.
    3. Fischer, Agneta & Brands, Charlotte & Abadi, David, 2019. "The Expression of Right-Wing Populism in the Netherlands across Facebook Posts," OSF Preprints 35puf, Center for Open Science.
    4. Sigurd Hilmo Lundheim & Giuseppe Pellegrini-Masini & Christian A. Klöckner & Stefan Geiss, 2022. "Developing a Theoretical Framework to Explain the Social Acceptability of Wind Energy," Energies, MDPI, vol. 15(14), pages 1-24, July.
    5. Zhixiang Chen & Bin Fu & John Abraham, 2010. "A quadratic lower bound for Rocchio’s similarity-based relevance feedback algorithm with a fixed query updating factor," Journal of Combinatorial Optimization, Springer, vol. 19(2), pages 134-157, February.
    6. Roland Graef & Mathias Klier & Kilian Kluge & Jan Felix Zolitschka, 2021. "Human-machine collaboration in online customer service – a long-term feedback-based approach," Electronic Markets, Springer;IIM University of St. Gallen, vol. 31(2), pages 319-341, June.
    7. Asim Roy & Patrick Mackin & Jyrki Wallenius & James Corner & Mark Keith & Gregory Schymik & Hina Arora, 2008. "An Interactive Search Method Based on User Preferences," Decision Analysis, INFORMS, vol. 5(4), pages 203-229, December.
    8. Mariam Daoud & Jimmy Xiangji Huang, 2013. "Modeling geographic, temporal, and proximity contexts for improving geotemporal search," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 64(1), pages 190-212, January.
    9. Veda C. Storey & Andrew Burton-Jones & Vijayan Sugumaran & Sandeep Purao, 2008. "CONQUER: A Methodology for Context-Aware Query Processing on the World Wide Web," Information Systems Research, INFORMS, vol. 19(1), pages 3-25, March.
    10. Hélder Prior, 2024. "Social media and the rise of radical right populism in Portugal: the communicative strategies of André Ventura on X in the 2022 elections," Palgrave Communications, Palgrave Macmillan, vol. 11(1), pages 1-10, December.
    11. Piergiuseppe Fortunato & Marco Pecoraro, 2022. "Social media, education, and the rise of populist Euroscepticism," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-13, December.
    12. repec:osf:osfxxx:35puf_v1 is not listed on IDEAS
    13. Yousif A. Alhaj & Abdelghani Dahou & Mohammed A. A. Al-qaness & Laith Abualigah & Aaqif Afzaal Abbasi & Nasser Ahmed Obad Almaweri & Mohamed Abd Elaziz & Robertas Damaševičius, 2022. "A Novel Text Classification Technique Using Improved Particle Swarm Optimization: A Case Study of Arabic Language," Future Internet, MDPI, vol. 14(7), pages 1-18, June.
    14. Hoppenbrouwers, J.J.A.C., 1998. "Advanced conceptual network usage in library database queries," Other publications TiSEM 711b739d-edc9-4f72-8fb1-2, Tilburg University, School of Economics and Management.
    15. Piergiuseppe Fortunato & Marco Pecoraro, 2020. "Yes, The Medium Matters: How Facebook and Twitter boost Populism in Europe," IRENE Working Papers 20-01, IRENE Institute of Economic Research.
    16. Mario Datts, 2020. "Social Media, Populism, and Migration," Media and Communication, Cogitatio Press, vol. 8(4), pages 73-83.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0302583. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.