IDEAS home Printed from https://ideas.repec.org/a/wsi/jikmxx/v19y2020i01ns0219649220400183.html
   My bibliography  Save this article

Application of Support Vector Machine for Arabic Sentiment Classification Using Twitter-Based Dataset

Author

Listed:
  • Sarah N. Alyami

    (College of Computer Science and Information Technology, Community College Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia)

  • Sunday O. Olatunji

    (College of Computer Science and Information Technology, Imam Abdulrahman Bin Faisal University, Dammam, Saudi Arabia)

Abstract

Sentiment classification is the process of classifying emotions and opinions in texts. In this study, the problem of Arabic sentiment analysis was addressed. A support vector machine (SVM) model was proposed to classify opinions in Arabic micro-texts as being positive or negative. To evaluate the performance of the SVM model, a dataset was built from tweets discussing several social issues in Saudi Arabia. These issues include changes that were implemented by the country as part of a newly established vision, known as Saudi Arabia Vision 2030. The constructed dataset was manually annotated according to the sentiment conveyed in the text. To achieve the best sentiment classification accuracy, several procedures were implemented within the proposed framework including light stemming, feature extraction (Ngrams, emoji and tweet-topic features), parameter optimisation and feature-set reduction. The experimental results revealed excellent outcomes. An accuracy of 89.83% was achieved using the proposed SVM model.

Suggested Citation

  • Sarah N. Alyami & Sunday O. Olatunji, 2020. "Application of Support Vector Machine for Arabic Sentiment Classification Using Twitter-Based Dataset," Journal of Information & Knowledge Management (JIKM), World Scientific Publishing Co. Pte. Ltd., vol. 19(01), pages 1-13, April.
  • Handle: RePEc:wsi:jikmxx:v:19:y:2020:i:01:n:s0219649220400183
    DOI: 10.1142/S0219649220400183
    as

    Download full text from publisher

    File URL: https://www.worldscientific.com/doi/abs/10.1142/S0219649220400183
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219649220400183?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Wang, Haifeng & Zheng, Bichen & Yoon, Sang Won & Ko, Hoo Sang, 2018. "A support vector machine-based ensemble algorithm for breast cancer diagnosis," European Journal of Operational Research, Elsevier, vol. 267(2), pages 687-699.
    2. Mohammed Rushdi-Saleh & M. Teresa Martín-Valdivia & L. Alfonso Ureña-López & José M. Perea-Ortega, 2011. "OCA: Opinion corpus for Arabic," Journal of the Association for Information Science & Technology, Association for Information Science & Technology, vol. 62(10), pages 2045-2054, October.
    3. Petra Kralj Novak & Jasmina Smailović & Borut Sluban & Igor Mozetič, 2015. "Sentiment of Emojis," PLOS ONE, Public Library of Science, vol. 10(12), pages 1-22, December.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Astorino, Annabella & Avolio, Matteo & Fuduli, Antonio, 2022. "A maximum-margin multisphere approach for binary Multiple Instance Learning," European Journal of Operational Research, Elsevier, vol. 299(2), pages 642-652.
    2. Meshwa Rameshbhai Savalia & Jaiprakash Vinodkumar Verma, 2023. "Classifying Malignant and Benign Tumors of Breast Cancer: A Comparative Investigation Using Machine Learning Techniques," International Journal of Reliable and Quality E-Healthcare (IJRQEH), IGI Global, vol. 12(1), pages 1-19, January.
    3. Onur Demiray & Evrim D. Gunes & Ercan Kulak & Emrah Dogan & Seyma Gorcin Karaketir & Serap Cifcili & Mehmet Akman & Sibel Sakarya, 2023. "Classification of patients with chronic disease by activation level using machine learning methods," Health Care Management Science, Springer, vol. 26(4), pages 626-650, December.
    4. Blanquero, R. & Carrizosa, E. & Jiménez-Cordero, A. & Martín-Barragán, B., 2019. "Functional-bandwidth kernel for Support Vector Machine with Functional Data: An alternating optimization algorithm," European Journal of Operational Research, Elsevier, vol. 275(1), pages 195-207.
    5. Anand, Abhinav & Pathak, Jalaj, 2022. "The role of Reddit in the GameStop short squeeze," Economics Letters, Elsevier, vol. 211(C).
    6. Carlos Henríquez Miranda & German Sanchez-Torres & Dixon Salcedo, 2023. "Exploring the Evolution of Sentiment in Spanish Pandemic Tweets: A Data Analysis Based on a Fine-Tuned BERT Architecture," Data, MDPI, vol. 8(6), pages 1-18, May.
    7. Oriol J. Bosch & Melanie Revilla, 2021. "Using emojis in mobile web surveys for Millennials? A study in Spain and Mexico," Quality & Quantity: International Journal of Methodology, Springer, vol. 55(1), pages 39-61, February.
    8. Das, Gopal & Wiener, Hillary J.D. & Kareklas, Ioannis, 2019. "To emoji or not to emoji? Examining the influence of emoji on consumer reactions to advertising," Journal of Business Research, Elsevier, vol. 96(C), pages 147-156.
    9. Ganga S. Urumutta Hewage & Yue Liu & Ze Wang & Huifang Mao, 2021. "Consumer responses toward symmetric versus asymmetric facial expression emojis," Marketing Letters, Springer, vol. 32(2), pages 219-230, June.
    10. Christina Bannier & Thomas Pauls & Andreas Walter, 2019. "Content analysis of business communication: introducing a German dictionary," Journal of Business Economics, Springer, vol. 89(1), pages 79-123, February.
    11. Liang, Xijun & Zhang, Zhipeng & Song, Yunquan & Jian, Ling, 2022. "Kernel-based online regression with canal loss," European Journal of Operational Research, Elsevier, vol. 297(1), pages 268-279.
    12. Kamyab Karimi & Ali Ghodratnama & Reza Tavakkoli-Moghaddam, 2023. "Two new feature selection methods based on learn-heuristic techniques for breast cancer prediction: a comprehensive analysis," Annals of Operations Research, Springer, vol. 328(1), pages 665-700, September.
    13. Klostermann, Jan & Plumeyer, Anja & Böger, Daniel & Decker, Reinhold, 2018. "Extracting brand information from social networks: Integrating image, text, and social tagging data," International Journal of Research in Marketing, Elsevier, vol. 35(4), pages 538-556.
    14. Che Xu & Wenjun Chang & Weiyong Liu, 2023. "Data-driven decision model based on local two-stage weighted ensemble learning," Annals of Operations Research, Springer, vol. 325(2), pages 995-1028, June.
    15. Jennifer Fane & Colin MacDougall & Jessie Jovanovic & Gerry Redmond & Lisa Gibbs, 2020. "Preschool Aged Children’s Accounts of their Own Wellbeing: are Current Wellbeing Indicators Applicable to Young Children?," Child Indicators Research, Springer;The International Society of Child Indicators (ISCI), vol. 13(6), pages 1893-1920, December.
    16. Adolfina Pérez Garcias & Gemma Tur & Antònia Darder Mesquida & Victoria I. Marín, 2020. "Reflexive Skills in Teacher Education: A Tweet a Week," Sustainability, MDPI, vol. 12(8), pages 1-18, April.
    17. Li, Yanying & Che, Jinxing & Yang, Youlong, 2018. "Subsampled support vector regression ensemble for short term electric load forecasting," Energy, Elsevier, vol. 164(C), pages 160-170.
    18. Jiamin Pei & Le Cheng, 2022. "Deciphering emoji variation in courts: a social semiotic perspective," Palgrave Communications, Palgrave Macmillan, vol. 9(1), pages 1-8, December.
    19. Chen, Weiyi & Zhang, Limao, 2022. "An automated machine learning approach for earthquake casualty rate and economic loss prediction," Reliability Engineering and System Safety, Elsevier, vol. 225(C).
    20. Béatrice BOULU-RESHEF & Catherine BRUNEAU & Maxime NICOLAS & Thomas RENAULT, 2022. "An Experimental Analysis of Investor Sentiment," LEO Working Papers / DR LEO 2940, Orleans Economics Laboratory / Laboratoire d'Economie d'Orleans (LEO), University of Orleans.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:jikmxx:v:19:y:2020:i:01:n:s0219649220400183. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/jikm/jikm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.