IDEAS home Printed from https://ideas.repec.org/a/gam/jmathe/v11y2023i19p4106-d1250035.html
   My bibliography  Save this article

Research on a Hotel Collaborative Filtering Recommendation Algorithm Based on the Probabilistic Language Term Set

Author

Listed:
  • Erwei Wang

    (Zhuhai Campus, Beijing Institute of Technology, Zhuhai 519088, China)

  • Yingyin Chen

    (Faculty of Business Administration, University of Macau, Macau, China)

  • Yumin Li

    (Zhuhai Campus, Beijing Institute of Technology, Zhuhai 519088, China)

Abstract

In the face of problems such as information overload and the information cocoon resulting from big data, it is a key point of current research to solve the problem of semantic fuzziness of online reviews and improve the accuracy of personalized recommendation algorithms by using online reviews. Based on the advantage of the probabilistic language term set to deal with fuzzy information and the historical data of online hotel reviews, this paper proposes a collaborative filtering recommendation algorithm for hotels. Firstly, the text data of hotel online reviews are crawled by a crawler and processed by jieba and TF-IDF tools. Secondly, the hotel evaluation attribute set is constructed, and the sentiment analysis of the review statements is carried out with the help of the HowNet sentiment dictionary and manual annotation method. The probabilistic language term set is used to classify the data and derive statistics, and the maximum deviation method is used to determine the weight of each attribute. Then, the cosine similarity formula is fused with the modified cosine similarity formula to calculate the similarity and construct the decision matrix. Finally, combined with the historical data of the user’s hotel selection, the hotel recommendation results are generated. This paper collected review data from 10 hotels in Macau from the official “Ctrip” website. The proposed recommendation algorithm model was then applied to process and analyze the data, resulting in the generation of a ranked list of hotel recommendations. To validate the accuracy and effectiveness of this research, the recommendation results were compared with those produced by other algorithms.

Suggested Citation

  • Erwei Wang & Yingyin Chen & Yumin Li, 2023. "Research on a Hotel Collaborative Filtering Recommendation Algorithm Based on the Probabilistic Language Term Set," Mathematics, MDPI, vol. 11(19), pages 1-15, September.
  • Handle: RePEc:gam:jmathe:v:11:y:2023:i:19:p:4106-:d:1250035
    as

    Download full text from publisher

    File URL: https://www.mdpi.com/2227-7390/11/19/4106/pdf
    Download Restriction: no

    File URL: https://www.mdpi.com/2227-7390/11/19/4106/
    Download Restriction: no
    ---><---

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:gam:jmathe:v:11:y:2023:i:19:p:4106-:d:1250035. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: MDPI Indexing Manager (email available below). General contact details of provider: https://www.mdpi.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.