IDEAS home Printed from https://ideas.repec.org/a/inm/ormksc/v30y2011i3p532-549.html
   My bibliography  Save this article

Efficient Methods for Sampling Responses from Large-Scale Qualitative Data

Author

Listed:
  • Surendra N. Singh

    (School of Business, University of Kansas, Lawrence, Kansas 66045)

  • Steve Hillmer

    (School of Business, University of Kansas, Lawrence, Kansas 66045)

  • Ze Wang

    (College of Business Administration, University of Central Florida, Orlando, Florida 32816)

Abstract

The World Wide Web contains a vast corpus of consumer-generated content that holds invaluable insights for improving the product and service offerings of firms. Yet the typical method for extracting diagnostic information from online content--text mining--has limitations. As a starting point, we propose analyzing a sample of comments before initiating text mining. Using a combination of real data and simulations, we demonstrate that a sampling procedure that selects respondents whose comments contain a large amount of information is superior to the two most popular sampling methods--simple random sampling and stratified random sampling---in gaining insights from the data. In addition, we derive a method that determines the probability of observing diagnostic information repeated a specific number of times in the population, which will enable managers to base sample size decisions on the trade-off between obtaining additional diagnostic information and the added expense of a larger sample. We provide an illustration of one of the methods using a real data set from a website containing qualitative comments about staying at a hotel and demonstrate how sampling qualitative comments can be a useful first step in text mining.

Suggested Citation

  • Surendra N. Singh & Steve Hillmer & Ze Wang, 2011. "Efficient Methods for Sampling Responses from Large-Scale Qualitative Data," Marketing Science, INFORMS, vol. 30(3), pages 532-549, 05-06.
  • Handle: RePEc:inm:ormksc:v:30:y:2011:i:3:p:532-549
    DOI: 10.1287/mksc.1100.0632
    as

    Download full text from publisher

    File URL: http://dx.doi.org/10.1287/mksc.1100.0632
    Download Restriction: no

    File URL: https://libkey.io/10.1287/mksc.1100.0632?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Dina Mayzlin, 2006. "Promotional Chat on the Internet," Marketing Science, INFORMS, vol. 25(2), pages 155-163, 03-04.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Seung Hyun Lee & Jaeyong Lee, 2018. "Nickel and dime guests for amenities: exploring guests’ perceptions of resort fees," Journal of Revenue and Pricing Management, Palgrave Macmillan, vol. 17(1), pages 20-31, February.
    2. Sheng, Jie & Amankwah-Amoah, Joseph & Wang, Xiaojun, 2019. "Technology in the 21st century: New challenges and opportunities," Technological Forecasting and Social Change, Elsevier, vol. 143(C), pages 321-335.
    3. Sheng, Jie & Amankwah-Amoah, Joseph & Wang, Xiaojun, 2017. "A multidisciplinary perspective of big data in management research," International Journal of Production Economics, Elsevier, vol. 191(C), pages 97-112.
    4. Carlson, Keith & Kopalle, Praveen K. & Riddell, Allen & Rockmore, Daniel & Vana, Prasad, 2023. "Complementing human effort in online reviews: A deep learning approach to automatic content generation and review synthesis," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 54-74.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Edgardo Arturo Ayala Gaytán, 2009. "Social network externalities and price dispersion in online markets," Ensayos Revista de Economia, Universidad Autonoma de Nuevo Leon, Facultad de Economia, vol. 0(2), pages 1-28, November.
    2. René Algesheimer & Sharad Borle & Utpal M. Dholakia & Siddharth S. Singh, 2010. "The Impact of Customer Community Participation on Customer Behaviors: An Empirical Investigation," Marketing Science, INFORMS, vol. 29(4), pages 756-769, 07-08.
    3. Khim-Yong Goh & Cheng-Suang Heng & Zhijie Lin, 2013. "Social Media Brand Community and Consumer Behavior: Quantifying the Relative Impact of User- and Marketer-Generated Content," Information Systems Research, INFORMS, vol. 24(1), pages 88-107, March.
    4. Salvatore Piccolo & Piero Tedeschi & Giovanni Ursino, 2018. "Deceptive Advertising with Rational Buyers," Management Science, INFORMS, vol. 64(3), pages 1291-1310, March.
    5. Rohit Aggarwal & Ram Gopal & Ramesh Sankaranarayanan, 2007. "Negative Blogs, Positive Outcomes: When should Firms Permit Employees to Blog Honestly?," Working Papers 07-32, NET Institute, revised Sep 2007.
    6. Smith, Ronn J. & Knuff, David C. & Sprott, David E. & Spangenberg, Eric R., 2013. "The influence of negative marketplace information on consumer attitudes toward a service establishment," Journal of Retailing and Consumer Services, Elsevier, vol. 20(3), pages 358-364.
    7. Caldieraro, Fabio & Cunha, Marcus, 2022. "Consumers’ response to weak unique selling propositions: Implications for optimal product recommendation strategy," International Journal of Research in Marketing, Elsevier, vol. 39(3), pages 724-744.
    8. Yabing Jiang & Hong Guo, 2012. "Design of Consumer Review Systems and Product Pricing," Working Papers 12-10, NET Institute.
    9. Nguyen, Cathy & Romaniuk, Jenni, 2013. "Factors moderating the impact of word of mouth for TV and film broadcasts," Australasian marketing journal, Elsevier, vol. 21(1), pages 25-29.
    10. Jianqing Chen & Hong Xu & Andrew B. Whinston, 2009. "Moderated Online Communities and User-Generated Content," Working Papers 09-11, NET Institute.
    11. Jungju Yu, 2021. "A Model of Brand Architecture Choice: A House of Brands vs. A Branded House," Marketing Science, INFORMS, vol. 40(1), pages 147-167, January.
    12. Joshua S. Gans & Avi Goldfarb & Mara Lederman, 2021. "Exit, Tweets, and Loyalty," American Economic Journal: Microeconomics, American Economic Association, vol. 13(2), pages 68-112, May.
    13. Chrysanthos Dellarocas, 2006. "Strategic Manipulation of Internet Opinion Forums: Implications for Consumers and Firms," Management Science, INFORMS, vol. 52(10), pages 1577-1593, October.
    14. Zhuang, Mengzhou & Cui, Geng & Peng, Ling, 2018. "Manufactured opinions: The effect of manipulating online product reviews," Journal of Business Research, Elsevier, vol. 87(C), pages 24-35.
    15. Jonah Berger & Michaela Draganska & Itamar Simonson, 2007. "The Influence of Product Variety on Brand Perception and Choice," Marketing Science, INFORMS, vol. 26(4), pages 460-472, 07-08.
    16. Elvira Ismagilova & Emma L. Slade & Nripendra P. Rana & Yogesh K. Dwivedi, 2020. "The Effect of Electronic Word of Mouth Communications on Intention to Buy: A Meta-Analysis," Information Systems Frontiers, Springer, vol. 22(5), pages 1203-1226, October.
    17. Chakravarty, Anindita & Liu, Yong & Mazumdar, Tridib, 2010. "The Differential Effects of Online Word-of-Mouth and Critics' Reviews on Pre-release Movie Evaluation," Journal of Interactive Marketing, Elsevier, vol. 24(3), pages 185-197.
    18. Jurui Zhang & Yong Liu & Yubo Chen, 2015. "Social Learning in Networks of Friends versus Strangers," Marketing Science, INFORMS, vol. 34(4), pages 573-589, July.
    19. Weijia (Daisy) Dai & Ginger Jin & Jungmin Lee & Michael Luca, 2018. "Aggregation of consumer ratings: an application to Yelp.com," Quantitative Marketing and Economics (QME), Springer, vol. 16(3), pages 289-339, September.
    20. Chuan He & Johan Marklund & Thomas Vossen, 2008. "—Vertical Information Sharing in a Volatile Market," Marketing Science, INFORMS, vol. 27(3), pages 513-530, 05-06.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:inm:ormksc:v:30:y:2011:i:3:p:532-549. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Chris Asher (email available below). General contact details of provider: https://edirc.repec.org/data/inforea.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.