IDEAS home Printed from https://ideas.repec.org/a/gam/jsusta/v10y2018i9p3313-d170239.html
   My bibliography  Save this article

Tourism Review Sentiment Classification Using a Bidirectional Recurrent Neural Network with an Attention Mechanism and Topic-Enriched Word Vectors

Author

Listed:
  • Qin Li

    (Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu 610041, China
    University of Chinese Academy of Sciences, Beijing 100049, China)

  • Shaobo Li

    (School of Mechanical Engineering, Guizhou University, Guiyang 550025, China
    Guizhou Provincial Key Laboratory of Public Big Data, Guizhou University, Guiyang 550025, China)

  • Jie Hu

    (School of Mechanical Engineering, Guizhou University, Guiyang 550025, China)

  • Sen Zhang

    (Chengdu Institute of Computer Application, Chinese Academy of Sciences, Chengdu 610041, China
    University of Chinese Academy of Sciences, Beijing 100049, China)

  • Jianjun Hu

    (School of Mechanical Engineering, Guizhou University, Guiyang 550025, China
    Department of Computer Science and Engineering, University of South Carolina, Columbia, SC 29208, USA)

Abstract

Sentiment analysis of online tourist reviews is playing an increasingly important role in tourism. Accurately capturing the attitudes of tourists regarding different aspects of the scenic sites or the overall polarity of their online reviews is key to tourism analysis and application. However, the performances of current document sentiment analysis methods are not satisfactory as they either neglect the topics of the document or do not consider that not all words contribute equally to the meaning of the text. In this work, we propose a bidirectional gated recurrent unit neural network model (BiGRULA) for sentiment analysis by combining a topic model (lda2vec) and an attention mechanism. Lda2vec is used to discover all the main topics of review corpus, which are then used to enrich the word vector representation of words with context. The attention mechanism is used to learn to attribute different weights of the words to the overall meaning of the text. Experiments over 20 NewsGroup and IMDB datasets demonstrate the effectiveness of our model. Furthermore, we applied our model to hotel review data analysis, which allows us to get more coherent topics from these reviews and achieve good performance in sentiment classification.

Suggested Citation

  • Qin Li & Shaobo Li & Jie Hu & Sen Zhang & Jianjun Hu, 2018. "Tourism Review Sentiment Classification Using a Bidirectional Recurrent Neural Network with an Attention Mechanism and Topic-Enriched Word Vectors," Sustainability, MDPI, vol. 10(9), pages 1-15, September.
  • Handle: RePEc:gam:jsusta:v:10:y:2018:i:9:p:3313-:d:170239
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2071-1050/10/9/3313/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2071-1050/10/9/3313/
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Gang Ren & Taeho Hong, 2017. "Investigating Online Destination Images Using a Topic-Based Sentiment Analysis Approach," Sustainability, MDPI, vol. 9(10), pages 1-18, September.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Seungju Nam & Hyun Cheol Lee, 2019. "A Text Analytics-Based Importance Performance Analysis and Its Application to Airline Service," Sustainability, MDPI, vol. 11(21), pages 1-24, November.
    2. Sergei Mikhailov & Alexey Kashevnik, 2020. "Tourist Behaviour Analysis Based on Digital Pattern of Life—An Approach and Case Study," Future Internet, MDPI, vol. 12(10), pages 1-16, September.
    3. Yuguo Tao & Feng Zhang & Chunyun Shi & Yun Chen, 2019. "Social Media Data-Based Sentiment Analysis of Tourists’ Air Quality Perceptions," Sustainability, MDPI, vol. 11(18), pages 1-23, September.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Berny Carrera & Jae-Yoon Jung, 2018. "SentiFlow: An Information Diffusion Process Discovery Based on Topic and Sentiment from Online Social Networks," Sustainability, MDPI, vol. 10(8), pages 1-16, August.
    2. Ziye Shang & Jian Ming Luo, 2022. "Topic Modeling for Hiking Trail Online Reviews: Analysis of the Mutianyu Great Wall," Sustainability, MDPI, vol. 14(6), pages 1-16, March.
    3. Chen Liu & Li Tang & Wei Shan, 2018. "An Extended HITS Algorithm on Bipartite Network for Features Extraction of Online Customer Reviews," Sustainability, MDPI, vol. 10(5), pages 1-15, May.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jsusta:v:10:y:2018:i:9:p:3313-:d:170239. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.