IDEAS home Printed from https://ideas.repec.org/p/cwl/cwldpp/2176r2.html
   My bibliography  Save this paper

Attribute Sentiment Scoring With Online Text Reviews : Accounting for Language Structure and Attribute Self-Selection

Author

Listed:

Abstract

The authors address two significant challenges in using online text reviews to obtain finegrained attribute level sentiment ratings. First, in contrast to methods that rely on word frequency, they develop a deep learning convolutional-LSTM hybrid model to account for language structure. The convolutional layer accounts for spatial structure (adjacent word groups or phrases) and LSTM accounts for sequential structure of language (sentiment distributed and modified across non-adjacent phrases). Second, they address the problem of missing attributes in text in constructing attribute sentiment scores'as reviewers write only about a subset of attributes and remain silent on others. They develop a model-based imputation strategy using a structural model of heterogeneous rating behavior. Using Yelp restaurant review data, they show superior attribute sentiment scoring accuracy with their model. They find three reviewer segments with different motivations: status seeking, altruism/want voice, and need to vent/praise. Reviewers write to inform and vent/praise, but not based on attribute importance. The heterogeneous model-based imputation performs better than other common imputations; and importantly leads to managerially significant corrections in restaurant attribute ratings. More broadly, our results suggest that social science research should pay more attention to reduce measurement error in variables constructed from text.

Suggested Citation

  • Ishita Chakraborty & Minkyung Kim & K. Sudhir, 2019. "Attribute Sentiment Scoring With Online Text Reviews : Accounting for Language Structure and Attribute Self-Selection," Cowles Foundation Discussion Papers 2176R2, Cowles Foundation for Research in Economics, Yale University, revised Jun 2021.
  • Handle: RePEc:cwl:cwldpp:2176r2
    as

    Download full text from publisher

    File URL: https://cowles.yale.edu/sites/default/files/files/pub/d21/d2176-r2.pdf
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Matthew Gentzkow & Bryan Kelly & Matt Taddy, 2019. "Text as Data," Journal of Economic Literature, American Economic Association, vol. 57(3), pages 535-574, September.
    2. Dina Mayzlin & Yaniv Dover & Judith Chevalier, 2014. "Promotional Reviews: An Empirical Investigation of Online Review Manipulation," American Economic Review, American Economic Association, vol. 104(8), pages 2421-2455, August.
    3. Dhar, Vasant & Chang, Elaine A., 2009. "Does Chatter Matter? The Impact of User-Generated Content on Music Sales," Journal of Interactive Marketing, Elsevier, vol. 23(4), pages 300-307.
    4. Peter Arcidiacono & John Bailey Jones, 2003. "Finite Mixture Distributions, Sequential Likelihood and the EM Algorithm," Econometrica, Econometric Society, vol. 71(3), pages 933-946, May.
    5. Susan Athey & Mohsen Bayati & Nikolay Doudchenko & Guido Imbens & Khashayar Khosravi, 2021. "Matrix Completion Methods for Causal Panel Data Models," Journal of the American Statistical Association, Taylor & Francis Journals, vol. 116(536), pages 1716-1730, October.
    6. Michael Luca, 2011. "Reviews, Reputation, and Revenue: The Case of Yelp.com," Harvard Business School Working Papers 12-016, Harvard Business School, revised Mar 2016.
    7. Oded Netzer & Ronen Feldman & Jacob Goldenberg & Moshe Fresko, 2012. "Mine Your Own Business: Market-Structure Surveillance Through Text Mining," Marketing Science, INFORMS, vol. 31(3), pages 521-543, May.
    8. David Godes & Dina Mayzlin, 2004. "Using Online Conversations to Study Word-of-Mouth Communication," Marketing Science, INFORMS, vol. 23(4), pages 545-560, June.
    9. Jonah Berger & Alan T. Sorensen & Scott J. Rasmussen, 2010. "Positive Effects of Negative Publicity: When Negative Reviews Increase Sales," Marketing Science, INFORMS, vol. 29(5), pages 815-827, 09-10.
    10. Onishi, Hiroshi & Manchanda, Puneet, 2012. "Marketing activity, blogging and sales," International Journal of Research in Marketing, Elsevier, vol. 29(3), pages 221-234.
    11. Nikolay Archak & Anindya Ghose & Panagiotis G. Ipeirotis, 2011. "Deriving the Pricing Power of Product Features by Mining Consumer Reviews," Management Science, INFORMS, vol. 57(8), pages 1485-1509, August.
    12. Aron Culotta & Jennifer Cutler, 2016. "Mining Brand Perceptions from Twitter Social Networks," Marketing Science, INFORMS, vol. 35(3), pages 343-362, May.
    13. Hollenbeck, Brett, 2018. "Online Reputation Mechanisms and the Decreasing Value of Chain Affliation," MPRA Paper 91573, University Library of Munich, Germany.
    14. Joachim Büschken & Greg M. Allenby, 2020. "Improving Text Analysis Using Sentence Conjunctions and Punctuation," Marketing Science, INFORMS, vol. 39(4), pages 727-742, July.
    15. Peter Arcidiacono & Robert A. Miller, 2011. "Conditional Choice Probability Estimation of Dynamic Discrete Choice Models With Unobserved Heterogeneity," Econometrica, Econometric Society, vol. 79(6), pages 1823-1867, November.
    16. Dinesh Puranam & Vishal Narayan & Vrinda Kadiyali, 2017. "The Effect of Calorie Posting Regulation on Consumer Opinion: A Flexible Latent Dirichlet Allocation Model with Informative Priors," Marketing Science, INFORMS, vol. 36(5), pages 726-746, September.
    17. Xinxin Li & Lorin M. Hitt, 2008. "Self-Selection and Information Role of Online Product Reviews," Information Systems Research, INFORMS, vol. 19(4), pages 456-474, December.
    18. Michael Luca & Georgios Zervas, 2016. "Fake It Till You Make It: Reputation, Competition, and Yelp Review Fraud," Management Science, INFORMS, vol. 62(12), pages 3412-3427, December.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Roelen-Blasberg, Tobias & Habel, Johannes & Klarmann, Martin, 2023. "Automated inference of product attributes and their importance from user-generated content: Can we replace traditional market research?," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 164-188.
    2. Alantari, Huwail J. & Currim, Imran S. & Deng, Yiting & Singh, Sameer, 2022. "An empirical comparison of machine learning methods for text-based sentiment analysis of online consumer reviews," International Journal of Research in Marketing, Elsevier, vol. 39(1), pages 1-19.
    3. Carlson, Keith & Kopalle, Praveen K. & Riddell, Allen & Rockmore, Daniel & Vana, Prasad, 2023. "Complementing human effort in online reviews: A deep learning approach to automatic content generation and review synthesis," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 54-74.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ishita Chakraborty & Minkyung Kim & K. Sudhir, 2019. "Attribute Sentiment Scoring With Online Text Reviews : Accounting for Language Structure and Attribute Self-Selection," Cowles Foundation Discussion Papers 2176R, Cowles Foundation for Research in Economics, Yale University, revised Sep 2020.
    2. Ishita Chakraborty & Minkyung Kim & K. Sudhir, 2019. "Attribute Sentiment Scoring With Online Text Reviews : Accounting for Language Structure and Attribute Self-Selection," Cowles Foundation Discussion Papers 2176, Cowles Foundation for Research in Economics, Yale University.
    3. Dominik Gutt & Jürgen Neumann & Steffen Zimmermann & Dennis Kundisch & Jianqing Chen, 2018. "Design of Review Systems - A Strategic Instrument to shape Online Review Behavior and Economic Outcomes," Working Papers Dissertations 42, Paderborn University, Faculty of Business Administration and Economics.
    4. Jorge Mejia & Shawn Mankad & Anandasivam Gopal, 2019. "A for Effort? Using the Crowd to Identify Moral Hazard in New York City Restaurant Hygiene Inspections," Information Systems Research, INFORMS, vol. 30(4), pages 1363-1386, December.
    5. Anindya Ghose & Panagiotis G. Ipeirotis & Beibei Li, 2012. "Designing Ranking Systems for Hotels on Travel Search Engines by Mining User-Generated and Crowdsourced Content," Marketing Science, INFORMS, vol. 31(3), pages 493-520, May.
    6. Balázs Kovács, 2024. "The Turing test of online reviews: Can we tell the difference between human-written and GPT-4-written online reviews?," Marketing Letters, Springer, vol. 35(4), pages 651-666, December.
    7. Weijia (Daisy) Dai & Ginger Jin & Jungmin Lee & Michael Luca, 2018. "Aggregation of consumer ratings: an application to Yelp.com," Quantitative Marketing and Economics (QME), Springer, vol. 16(3), pages 289-339, September.
    8. Xiao Liu & Param Vir Singh & Kannan Srinivasan, 2016. "A Structured Analysis of Unstructured Big Data by Leveraging Cloud Computing," Marketing Science, INFORMS, vol. 35(3), pages 363-388, May.
    9. Christof Naumzik & Stefan Feuerriegel & Markus Weinmann, 2022. "I Will Survive: Predicting Business Failures from Customer Ratings," Marketing Science, INFORMS, vol. 41(1), pages 188-207, January.
    10. Saridakis, Charalampos & Katsikeas, Constantine S. & Angelidou, Sofia & Oikonomidou, Maria & Pratikakis, Polyvios, 2023. "Mining Twitter lists to extract brand-related associative information for celebrity endorsement," European Journal of Operational Research, Elsevier, vol. 311(1), pages 316-332.
    11. Bitty Balducci & Detelina Marinova, 2018. "Unstructured data in marketing," Journal of the Academy of Marketing Science, Springer, vol. 46(4), pages 557-590, July.
    12. Juan Feng & Xin Li & Xiaoquan (Michael) Zhang, 2019. "Online Product Reviews-Triggered Dynamic Pricing: Theory and Evidence," Information Systems Research, INFORMS, vol. 30(4), pages 1107-1123, December.
    13. Colmekcioglu, Nazan & Marvi, Reza & Foroudi, Pantea & Okumus, Fevzi, 2022. "Generation, susceptibility, and response regarding negativity: An in-depth analysis on negative online reviews," Journal of Business Research, Elsevier, vol. 153(C), pages 235-250.
    14. Tao Lu & May Yuan & Chong (Alex) Wang & Xiaoquan (Michael) Zhang, 2022. "Histogram Distortion Bias in Consumer Choices," Management Science, INFORMS, vol. 68(12), pages 8963-8978, December.
    15. Heeseung Andrew Lee & Angela Aerry Choi & Tianshu Sun & Wonseok Oh, 2021. "Reviewing Before Reading? An Empirical Investigation of Book-Consumption Patterns and Their Effects on Reviews and Sales," Information Systems Research, INFORMS, vol. 32(4), pages 1368-1389, December.
    16. Sungsik Park & Woochoel Shin & Jinhong Xie, 2021. "The Fateful First Consumer Review," Marketing Science, INFORMS, vol. 40(3), pages 481-507, May.
    17. Pei-Yu Chen & Yili Hong & Ying Liu, 2018. "The Value of Multidimensional Rating Systems: Evidence from a Natural Experiment and Randomized Experiments," Management Science, INFORMS, vol. 64(10), pages 4629-4647, October.
    18. Erfan Rezvani & Christian Rojas, 2022. "Firm responsiveness to consumers' reviews: The effect on online reputation," Journal of Economics & Management Strategy, Wiley Blackwell, vol. 31(4), pages 898-922, November.
    19. Cheng Zhao & Chong Alex Wang, 2023. "A cross-site comparison of online review manipulation using Benford’s law," Electronic Commerce Research, Springer, vol. 23(1), pages 365-406, March.
    20. Brett Hollenbeck & Sridhar Moorthy & Davide Proserpio, 2019. "Advertising Strategy in the Presence of Reviews: An Empirical Analysis," Marketing Science, INFORMS, vol. 38(5), pages 793-811, September.

    More about this item

    Keywords

    Text mining; Natural language processing (NLP); Convolutional neural networks (CNN); Long-short term memory (LSTM) Networks; Deep learning; Lexicons; Endogeneity; Self-selection; Online reviews; Online ratings; Customer satisfaction;
    All these keywords.

    JEL classification:

    • M1 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Business Administration
    • M3 - Business Administration and Business Economics; Marketing; Accounting; Personnel Economics - - Marketing and Advertising
    • C8 - Mathematical and Quantitative Methods - - Data Collection and Data Estimation Methodology; Computer Programs
    • C5 - Mathematical and Quantitative Methods - - Econometric Modeling

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:cwl:cwldpp:2176r2. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Brittany Ladd (email available below). General contact details of provider: https://edirc.repec.org/data/cowleus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.