IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v15y2023i16p12539-d1219822.html
   My bibliography  Save this article

Hybrid Feature Extraction for Multi-Label Emotion Classification in English Text Messages

Author

Listed:
  • Zahra Ahanin

    (Department of Information Systems, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia)

  • Maizatul Akmar Ismail

    (Department of Information Systems, Faculty of Computer Science and Information Technology, University of Malaya, Kuala Lumpur 50603, Malaysia)

  • Narinderjit Singh Sawaran Singh

    (Faculty of Data Science and Information Technology, INTI International University, Nilai 71800, Malaysia)

  • Ammar AL-Ashmori

    (Department of Computer and Information Sciences, University Technology PETRONAS, Seri Iskandar 32610, Malaysia)

Abstract

Emotions are vital for identifying an individual’s attitude and mental condition. Detecting and classifying emotions in Natural Language Processing applications can improve Human–Computer Interaction systems, leading to effective decision making in organizations. Several studies on emotion classification have employed word embedding as a feature extraction method, but they do not consider the sentiment polarity of words. Moreover, relying exclusively on deep learning models to extract linguistic features may result in misclassifications due to the small training dataset. In this paper, we present a hybrid feature extraction model using human-engineered features combined with deep learning based features for emotion classification in English text. The proposed model uses data augmentation, captures contextual information, integrates knowledge from lexical resources, and employs deep learning models, including Bidirectional Long Short-Term Memory (Bi-LSTM) and Bidirectional Encoder Representation and Transformer (BERT), to address the issues mentioned above. The proposed model with hybrid features attained the highest Jaccard accuracy on two of the benchmark datasets, with 68.40% on SemEval-2018 and 53.45% on the GoEmotions dataset. The results show the significance of the proposed technique, and we can conclude that the incorporation of the hybrid features improves the performance of the baseline models.

Suggested Citation

  • Zahra Ahanin & Maizatul Akmar Ismail & Narinderjit Singh Sawaran Singh & Ammar AL-Ashmori, 2023. "Hybrid Feature Extraction for Multi-Label Emotion Classification in English Text Messages," Sustainability, MDPI, vol. 15(16), pages 1-24, August.
  • Handle: RePEc:gam:jsusta:v:15:y:2023:i:16:p:12539-:d:1219822
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/15/16/12539/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/15/16/12539/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Anwer Mustafa Hilal & Dalia H. Elkamchouchi & Saud S. Alotaibi & Mohammed Maray & Mahmoud Othman & Amgad Atta Abdelmageed & Abu Sarwar Zamani & Mohamed I. Eldesouki, 2022. "Manta Ray Foraging Optimization with Transfer Learning Driven Facial Emotion Recognition," Sustainability, MDPI, vol. 14(21), pages 1-18, November.
    2. Hassan Adamu & Syaheerah Lebai Lutfi & Nurul Hashimah Ahamed Hassain Malim & Rohail Hassan & Assunta Di Vaio & Ahmad Sufril Azlan Mohamed, 2021. "Framing Twitter Public Sentiment on Nigerian Government COVID-19 Palliatives Distribution Using Machine Learning," Sustainability, MDPI, vol. 13(6), pages 1-21, March.
    3. Yahe Huang & Dongying Bo, 2023. "Emotion Classification and Achievement of Students in Distance Learning Based on the Knowledge State Model," Sustainability, MDPI, vol. 15(3), pages 1-14, January.
    4. Xudong Zhang & Zejun Yan & Qianfeng Wu & Ke Wang & Kelei Miao & Zhangquan Wang & Yourong Chen, 2023. "Community Governance Based on Sentiment Analysis: Towards Sustainable Management and Development," Sustainability, MDPI, vol. 15(3), pages 1-17, February.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Richard J. Butler, 2023. "Isaiah’s Structure from Random Forest Regression Analysis," Asian Culture and History, Canadian Center of Science and Education, vol. 15(1), pages 1-34, June.
    2. Ahmad Ali Jan & Fong-Woon Lai & Muhammad Umar Draz & Muhammad Tahir & Syed Emad Azhar Ali & Muhammad Zahid & Muhammad Kashif Shad, 2022. "Integrating sustainability practices into islamic corporate governance for sustainable firm performance: from the lens of agency and stakeholder theories," Quality & Quantity: International Journal of Methodology, Springer, vol. 56(5), pages 2989-3012, October.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:15:y:2023:i:16:p:12539-:d:1219822. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.