IDEAS home Printed from https://ideas.repec.org/a/sae/sagope/v10y2020i4p2158244020983316.html
   My bibliography  Save this article

Using XGBoost and Skip-Gram Model to Predict Online Review Popularity

Author

Listed:
  • Lien Thi Kim Nguyen
  • Hao-Hsuan Chung
  • Kristine Velasquez Tuliao
  • Tom M. Y. Lin

Abstract

Review popularity is similar to awareness and information accessibility components: Both have a profound effect on customer purchase decisions. Therefore, this study proposes a new method for predicting online review popularity that combines the extreme gradient boosting tree algorithm (XGBoost), to extract key features on the bases of ranking scores and the skip-gram model, which can subsequently identify semantic words according to key textual terms. Findings revealed that written reviews had higher review popularity than non-textual reviews (reviewer and product factors). Moreover, the proposed method achieved higher prediction accuracy than the traditional ridge regression technique of Root Mean Squared Logarithmic Error (RMSLE). The main factors affecting review popularity and key reviewers for specific textual terms were also identified. Findings could help vendors identify key influencers for their product promotion and then support the design of word-suggestion systems for online reviews.

Suggested Citation

  • Lien Thi Kim Nguyen & Hao-Hsuan Chung & Kristine Velasquez Tuliao & Tom M. Y. Lin, 2020. "Using XGBoost and Skip-Gram Model to Predict Online Review Popularity," SAGE Open, , vol. 10(4), pages 21582440209, December.
  • Handle: RePEc:sae:sagope:v:10:y:2020:i:4:p:2158244020983316
    DOI: 10.1177/2158244020983316
    as

    Download full text from publisher

    File URL: https://journals.sagepub.com/doi/10.1177/2158244020983316
    Download Restriction: no

    File URL: https://libkey.io/10.1177/2158244020983316?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Wang, Fang & Karimi, Sahar, 2019. "This product works well (for me): The impact of first-person singular pronouns on online review helpfulness," Journal of Business Research, Elsevier, vol. 104(C), pages 283-294.
    2. Brian S. Butler & Xiaoqing Wang, 2012. "The Cross-Purposes of Cross-Posting: Boundary Reshaping Behavior in Online Discussion Communities," Information Systems Research, INFORMS, vol. 23(3-part-2), pages 993-1010, September.
    3. Liu, Zhiwei & Park, Sangwon, 2015. "What makes a useful online review? Implication for travel product websites," Tourism Management, Elsevier, vol. 47(C), pages 140-151.
    4. Nanda Kumar & Izak Benbasat, 2006. "Research Note: The Influence of Recommendations and Consumer Reviews on Evaluations of Websites," Information Systems Research, INFORMS, vol. 17(4), pages 425-439, December.
    5. Yi Zhao & Sha Yang & Vishal Narayan & Ying Zhao, 2013. "Modeling Consumer Learning from Online Product Reviews," Marketing Science, INFORMS, vol. 32(1), pages 153-169, May.
    6. Zablocki, Agnieszka & Makri, Katerina & Houston, Michael J., 2019. "Emotions Within Online Reviews and their Influence on Product Attitudes in Austria, USA and Thailand," Journal of Interactive Marketing, Elsevier, vol. 46(C), pages 20-39.
    7. David Godes & Dina Mayzlin, 2004. "Using Online Conversations to Study Word-of-Mouth Communication," Marketing Science, INFORMS, vol. 23(4), pages 545-560, June.
    8. Srivastava, Vartika & Kalro, Arti D., 2019. "Enhancing the Helpfulness of Online Consumer Reviews: The Role of Latent (Content) Factors," Journal of Interactive Marketing, Elsevier, vol. 48(C), pages 33-50.
    9. Cheng, Yi-Hsiu & Ho, Hui-Yi, 2015. "Social influence's impact on reader perceptions of online reviews," Journal of Business Research, Elsevier, vol. 68(4), pages 883-887.
    10. Chris Forman & Anindya Ghose & Batia Wiesenfeld, 2008. "Examining the Relationship Between Reviews and Sales: The Role of Reviewer Identity Disclosure in Electronic Markets," Information Systems Research, INFORMS, vol. 19(3), pages 291-313, September.
    11. Anindya Ghose & Panagiotis G. Ipeirotis & Beibei Li, 2012. "Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowdsourced Content," Marketing Science, INFORMS, vol. 31(3), pages 493-520, May.
    12. Paul A. Pavlou & Angelika Dimoka, 2006. "The Nature and Role of Feedback Text Comments in Online Marketplaces: Implications for Trust Building, Price Premiums, and Seller Differentiation," Information Systems Research, INFORMS, vol. 17(4), pages 392-414, December.
    13. Park, Sangwon & Nicolau, Juan L., 2015. "Asymmetric effects of online consumer reviews," Annals of Tourism Research, Elsevier, vol. 50(C), pages 67-83.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Moradi, Masoud & Dass, Mayukh & Kumar, Piyush, 2023. "Differential effects of analytical versus emotional rhetorical style on review helpfulness," Journal of Business Research, Elsevier, vol. 154(C).
    2. Xian Wang & Huixian Li & Qingyi Wang & Alison Noble, 2023. "Consumers’ Concerns Regarding Product Quality: Evidence From Chinese Online Reviews," SAGE Open, , vol. 13(1), pages 21582440231, March.
    3. Zheng, Lili, 2021. "The classification of online consumer reviews: A systematic literature review and integrative framework," Journal of Business Research, Elsevier, vol. 135(C), pages 226-251.
    4. Lutz, Bernhard & Pröllochs, Nicolas & Neumann, Dirk, 2022. "Are longer reviews always more helpful? Disentangling the interplay between review length and line of argumentation," Journal of Business Research, Elsevier, vol. 144(C), pages 888-901.
    5. Khim-Yong Goh & Cheng-Suang Heng & Zhijie Lin, 2013. "Social Media Brand Community and Consumer Behavior: Quantifying the Relative Impact of User- and Marketer-Generated Content," Information Systems Research, INFORMS, vol. 24(1), pages 88-107, March.
    6. Angela Aerry Choi & Daegon Cho & Dobin Yim & Jae Yun Moon & Wonseok Oh, 2019. "When Seeing Helps Believing: The Interactive Effects of Previews and Reviews on E-Book Purchases," Information Systems Research, INFORMS, vol. 30(4), pages 1164-1183, December.
    7. Guo, Yue & Barnes, Stuart J. & Jia, Qiong, 2017. "Mining meaning from online ratings and reviews: Tourist satisfaction analysis using latent dirichlet allocation," Tourism Management, Elsevier, vol. 59(C), pages 467-483.
    8. Yani Wang & Jun Wang & Tang Yao, 2019. "What makes a helpful online review? A meta-analysis of review characteristics," Electronic Commerce Research, Springer, vol. 19(2), pages 257-284, June.
    9. Dominik Gutt & Jürgen Neumann & Steffen Zimmermann & Dennis Kundisch & Jianqing Chen, 2018. "Design of Review Systems - A Strategic Instrument to shape Online Review Behavior and Economic Outcomes," Working Papers Dissertations 42, Paderborn University, Faculty of Business Administration and Economics.
    10. Guha Majumder, Madhumita & Dutta Gupta, Sangita & Paul, Justin, 2022. "Perceived usefulness of online customer reviews: A review mining approach using machine learning & exploratory data analysis," Journal of Business Research, Elsevier, vol. 150(C), pages 147-164.
    11. Raoofpanah, Iman & Zamudio, César & Groening, Christopher, 2023. "Review reader segmentation based on the heterogeneous impacts of review and reviewer attributes on review helpfulness: A study involving ZIP code data," Journal of Retailing and Consumer Services, Elsevier, vol. 72(C).
    12. Yanni Ping & Chelsey Hill & Yun Zhu & Jorge Fresneda, 2023. "Antecedents and consequences of the key opinion leader status: an econometric and machine learning approach," Electronic Commerce Research, Springer, vol. 23(3), pages 1459-1484, September.
    13. Ana Babić Rosario & Kristine Valck & Francesca Sotgiu, 2020. "Conceptualizing the electronic word-of-mouth process: What we know and need to know about eWOM creation, exposure, and evaluation," Journal of the Academy of Marketing Science, Springer, vol. 48(3), pages 422-448, May.
    14. Yi Feng & Yunqiang Yin & Dujuan Wang & Lalitha Dhamotharan & Joshua Ignatius & Ajay Kumar, 2023. "Diabetic patient review helpfulness: unpacking online drug treatment reviews by text analytics and design science approach," Annals of Operations Research, Springer, vol. 328(1), pages 387-418, September.
    15. Christoph Schneider & Markus Weinmann & Peter N.C. Mohr & Jan vom Brocke, 2021. "When the Stars Shine Too Bright: The Influence of Multidimensional Ratings on Online Consumer Ratings," Management Science, INFORMS, vol. 67(6), pages 3871-3898, June.
    16. Colmekcioglu, Nazan & Marvi, Reza & Foroudi, Pantea & Okumus, Fevzi, 2022. "Generation, susceptibility, and response regarding negativity: An in-depth analysis on negative online reviews," Journal of Business Research, Elsevier, vol. 153(C), pages 235-250.
    17. Srikanth Parameswaran & Pubali Mukherjee & Rohit Valecha, 2023. "I Like My Anonymity: An Empirical Investigation of the Effect of Multidimensional Review Text and Role Anonymity on Helpfulness of Employer Reviews," Information Systems Frontiers, Springer, vol. 25(2), pages 853-870, April.
    18. Pei-Yu Chen & Yili Hong & Ying Liu, 2018. "The Value of Multidimensional Rating Systems: Evidence from a Natural Experiment and Randomized Experiments," Management Science, INFORMS, vol. 64(10), pages 4629-4647, October.
    19. Yen-Liang Chen & Chia-Ling Chang & An-Qiao Sung, 2021. "Predicting eWOM’s Influence on Purchase Intention Based on Helpfulness, Credibility, Information Quality and Professionalism," Sustainability, MDPI, vol. 13(13), pages 1-19, July.
    20. Meek, Stephanie & Wilk, Violetta & Lambert, Claire, 2021. "A big data exploration of the informational and normative influences on the helpfulness of online restaurant reviews," Journal of Business Research, Elsevier, vol. 125(C), pages 354-367.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:sae:sagope:v:10:y:2020:i:4:p:2158244020983316. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: SAGE Publications (email available below). General contact details of provider: .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.