IDEAS home Printed from https://ideas.repec.org/a/wsi/jikmxx/v19y2020i01ns0219649220400183.html
   My bibliography  Save this article

Application of Support Vector Machine for Arabic Sentiment Classification Using Twitter-Based Dataset

Author

Listed:
  • Sarah N. Alyami

    (College of Computer Science and Information Technology, Community College Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia)

  • Sunday O. Olatunji

    (College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia)

Abstract

Sentiment classification is the process of classifying emotions and opinions in texts. In this study, the problem of Arabic sentiment analysis was addressed. A support vector machine (SVM) model was proposed to classify opinions in Arabic micro-texts as being positive or negative. To evaluate the performance of the SVM model, a dataset was built from tweets discussing several social issues in Saudi Arabia. These issues include changes that were implemented by the country as part of a newly established vision, known as Saudi Arabia Vision 2030. The constructed dataset was manually annotated according to the sentiment conveyed in the text. To achieve the best sentiment classification accuracy, several procedures were implemented within the proposed framework including light stemming, feature extraction (Ngrams, emoji and tweet-topic features), parameter optimisation and feature-set reduction. The experimental results revealed excellent outcomes. An accuracy of 89.83% was achieved using the proposed SVM model.

Suggested Citation

  • Sarah N. Alyami & Sunday O. Olatunji, 2020. "Application of Support Vector Machine for Arabic Sentiment Classification Using Twitter-Based Dataset," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 19(01), pages 1-13, April.
  • Handle: RePEc:wsi:jikmxx:v:19:y:2020:i:01:n:s0219649220400183
    DOI: 10.1142/S0219649220400183
    as

    Download full text from publisher

    File URL: https://www.worldscientific.com/doi/abs/10.1142/S0219649220400183
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219649220400183?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wang, Haifeng & Zheng, Bichen & Yoon, Sang Won & Ko, Hoo Sang, 2018. "A support vector machine-based ensemble algorithm for breast cancer diagnosis," European Journal of Operational Research, Elsevier, vol. 267(2), pages 687-699.
    2. Mohammed Rushdi-Saleh & M. Teresa Martín-Valdivia & L. Alfonso Ureña-López & José M. Perea-Ortega, 2011. "OCA: Opinion corpus for Arabic," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(10), pages 2045-2054, October.
    3. Petra Kralj Novak & Jasmina Smailović & Borut Sluban & Igor Mozetič, 2015. "Sentiment of Emojis," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-22, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Béatrice BOULU-RESHEF & Catherine BRUNEAU & Maxime NICOLAS & Thomas RENAULT, 2022. "An Experimental Analysis of Investor Sentiment," LEO Working Papers / DR LEO 2940, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.
    2. Abdur Rasool & Chayut Bunterngchit & Luo Tiejian & Md. Ruhul Islam & Qiang Qu & Qingshan Jiang, 2022. "Improved Machine Learning-Based Predictive Models for Breast Cancer Diagnosis," IJERPH, MDPI, vol. 19(6), pages 1-19, March.
    3. Sumayh S. Aljameel & Dina A. Alabbad & Norah A. Alzahrani & Shouq M. Alqarni & Fatimah A. Alamoudi & Lana M. Babili & Somiah K. Aljaafary & Fatima M. Alshamrani, 2020. "A Sentiment Analysis Approach to Predict an Individual’s Awareness of the Precautionary Procedures to Prevent COVID-19 Outbreaks in Saudi Arabia," IJERPH, MDPI, vol. 18(1), pages 1-12, December.
    4. Joanna Błajda & Edyta Barnaś & Anna Kucab, 2022. "Application of Personalized Education in the Mobile Medical App for Breast Self-Examination," IJERPH, MDPI, vol. 19(8), pages 1-21, April.
    5. Tiziana CARPI & Airo HINO & Stefano Maria IACUS & Giuseppe PORRO, 2022. "A Japanese Subjective Well-Being Indicator Based on Twitter Data [‘Collective Smile: Measuring Societal Happiness from Geolocated Images’]," Social Science Japan Journal, University of Tokyo and Oxford University Press, vol. 25(2), pages 273-296.
    6. Astorino, Annabella & Avolio, Matteo & Fuduli, Antonio, 2022. "A maximum-margin multisphere approach for binary Multiple Instance Learning," European Journal of Operational Research, Elsevier, vol. 299(2), pages 642-652.
    7. Meshwa Rameshbhai Savalia & Jaiprakash Vinodkumar Verma, 2023. "Classifying Malignant and Benign Tumors of Breast Cancer: A Comparative Investigation Using Machine Learning Techniques," International Journal of Reliable and Quality E-Healthcare (IJRQEH), IGI Global, vol. 12(1), pages 1-19, January.
    8. Baldomero-Naranjo, Marta & Martínez-Merino, Luisa I. & Rodríguez-Chía, Antonio M., 2020. "Tightening big Ms in integer programming formulations for support vector machines with ramp loss," European Journal of Operational Research, Elsevier, vol. 286(1), pages 84-100.
    9. Onur Demiray & Evrim D. Gunes & Ercan Kulak & Emrah Dogan & Seyma Gorcin Karaketir & Serap Cifcili & Mehmet Akman & Sibel Sakarya, 2023. "Classification of patients with chronic disease by activation level using machine learning methods," Health Care Management Science, Springer, vol. 26(4), pages 626-650, December.
    10. Blanquero, R. & Carrizosa, E. & Jiménez-Cordero, A. & Martín-Barragán, B., 2019. "Functional-bandwidth kernel for Support Vector Machine with Functional Data: An alternating optimization algorithm," European Journal of Operational Research, Elsevier, vol. 275(1), pages 195-207.
    11. Anand, Abhinav & Pathak, Jalaj, 2022. "The role of Reddit in the GameStop short squeeze," Economics Letters, Elsevier, vol. 211(C).
    12. Carlos Henríquez Miranda & German Sanchez-Torres & Dixon Salcedo, 2023. "Exploring the Evolution of Sentiment in Spanish Pandemic Tweets: A Data Analysis Based on a Fine-Tuned BERT Architecture," Data, MDPI, vol. 8(6), pages 1-18, May.
    13. P. K. Viswanathan & Sandeep Srivathsan & Wayne L. Winston, 2022. "Multiclass Discriminant Analysis using Ensemble Technique: Case Illustration from the Banking Industry," Journal of Emerging Market Finance, Institute for Financial Management and Research, vol. 21(1), pages 92-115, March.
    14. Oriol J. Bosch & Melanie Revilla, 2021. "Using emojis in mobile web surveys for Millennials? A study in Spain and Mexico," Quality & Quantity: International Journal of Methodology, Springer, vol. 55(1), pages 39-61, February.
    15. Das, Gopal & Wiener, Hillary J.D. & Kareklas, Ioannis, 2019. "To emoji or not to emoji? Examining the influence of emoji on consumer reactions to advertising," Journal of Business Research, Elsevier, vol. 96(C), pages 147-156.
    16. Ganga S. Urumutta Hewage & Yue Liu & Ze Wang & Huifang Mao, 2021. "Consumer responses toward symmetric versus asymmetric facial expression emojis," Marketing Letters, Springer, vol. 32(2), pages 219-230, June.
    17. Ebaa Fayyoumi & Sahar Idwan, 2021. "Semantic Partitioning and Machine Learning in Sentiment Analysis," Data, MDPI, vol. 6(6), pages 1-17, June.
    18. Christina Bannier & Thomas Pauls & Andreas Walter, 2019. "Content analysis of business communication: introducing a German dictionary," Journal of Business Economics, Springer, vol. 89(1), pages 79-123, February.
    19. Benhoumane Ahmed, 2020. "What Makes You "Like", "Retweet" or "Comment" a Fundraising Content on Social Media? Exploring the Characteristics of Fundraising Messages on Social Networks [Qu'est-c," Post-Print hal-03390922, HAL.
    20. Golmohammadi, Davood & Zhao, Lingyu & Dreyfus, David, 2023. "Using machine learning techniques to reduce uncertainty for outpatient appointment scheduling practices in outpatient clinics," Omega, Elsevier, vol. 120(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:jikmxx:v:19:y:2020:i:01:n:s0219649220400183. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/jikm/jikm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.