IDEAS home Printed from https://ideas.repec.org/a/wsi/ijitdm/v22y2023i01ns0219622022500584.html
   My bibliography  Save this article

Opinion Mining Using Enriched Joint Sentiment-Topic Model

Author

Listed:
  • Amjad Osmani

    (Department of Computer Engineering, Islamic Azad University, Urmia Branch, Urmia, Iran)

  • Jamshid Bagherzadeh Mohasefi

    (��Department of Computer Engineering, Urmia University, Urmia, Iran)

Abstract

Sentiment analysis has the potential to significantly impact several fields, such as trade, politics, and opinion extraction. Topic modeling is an intriguing concept used in emotion detection. Latent Dirichlet Allocation is an important algorithm in this subject. It investigates the semantic associations between terms in a text document and takes into account the influence of a subject on a word. Joint Sentiment-Topic model is a framework based on Latent Dirichlet Allocation method that investigates the influence of subjects and emotions on words. The emotion parameter is insufficient, and additional factors may be valuable in performance enhancement. This study presents two novel topic models that extend and improve Joint Sentiment-Topic model through a new parameter (the author’s view). The proposed methods care about the author’s inherent characteristics, which is the most important factor in writing a comment. The proposed models consider the effect of the author’s view on words in a text document. The author’s view means that the author creates an opinion in his mind about a product/thing before selecting the words for expressing the opinion. The new parameter has an immense effect on model accuracy regarding evaluation results. The first proposed method is author’s View-based Joint Sentiment-Topic model for Multi-domain. According to the evaluation results, the highest accuracy value in the first method is equal to 85%. It also has a lower perplexity value than other methods. The second proposed method is Author’s View-based Joint Sentiment-Topic model for Single-domain. According to the evaluation results, it achieves the highest accuracy with 95%. The proposed methods perform better than baseline methods with different topic number settings, especially the second method with 95% accuracy. The second method is a version of the first one, which outperforms baseline methods in terms of accuracy. These results demonstrate that the parameter of the author’s view improves sentiment classification at the document level. While not requiring labeled data, the proposed methods are more accurate than discriminative models such as Support Vector Machine (SVM) and logistic regression, based on the evaluation section’s outcomes. The proposed methods are simple with a low number of parameters. While providing a broad perception of connections between different words in documents of a single collection (single-domain) or multiple collections (multi-domain), the proposed methods have prepared solutions for two different situations (single-domain and multi-domain). The first proposed method is suitable for multi-domain datasets, but the second proposed method is suitable for single-domain datasets. While detecting emotion at the document level, the proposed models improve evaluation results compared to the baseline models. Eight datasets with different sizes have been used in implementations. For evaluations, this study uses sentiment analysis at the document level, perplexity, and topic coherency. Also, to see if the outcomes of the suggested models are statistically different from those of other algorithms, the Friedman test, a statistical analysis, is employed.

Suggested Citation

  • Amjad Osmani & Jamshid Bagherzadeh Mohasefi, 2023. "Opinion Mining Using Enriched Joint Sentiment-Topic Model," International Journal of Information Technology & Decision Making (IJITDM), World Scientific Publishing Co. Pte. Ltd., vol. 22(01), pages 313-375, January.
  • Handle: RePEc:wsi:ijitdm:v:22:y:2023:i:01:n:s0219622022500584
    DOI: 10.1142/S0219622022500584
    as

    Download full text from publisher

    File URL: http://www.worldscientific.com/doi/abs/10.1142/S0219622022500584
    Download Restriction: Access to full text is restricted to subscribers

    File URL: https://libkey.io/10.1142/S0219622022500584?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:wsi:ijitdm:v:22:y:2023:i:01:n:s0219622022500584. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Tai Tone Lim (email available below). General contact details of provider: http://www.worldscinet.com/ijitdm/ijitdm.shtml .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.