IDEAS home Printed from https://ideas.repec.org/a/spr/joamsc/v50y2022i6d10.1007_s11747-022-00840-3.html
   My bibliography  Save this article

An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing

Author

Listed:
  • Venkatesh Shankar

    (Mays Business School)

  • Sohil Parsana

    (Mays Business School
    Oracle)

Abstract

With artificial intelligence permeating conversations and marketing interactions through digital technologies and media, machine learning models, in particular, natural language processing (NLP) models, have surged in popularity for analyzing unstructured data in marketing. Yet, we do not fully understand which NLP models are appropriate for which marketing applications and what insights can be best derived from them. We review different NLP models and their applications in marketing. We layout the advantages and disadvantages of these models and highlight the conditions under which different models are appropriate in the marketing context. We introduce the latest neural autoencoder NLP models, demonstrate these models to analyze new product announcements and news articles, and provide an empirical comparison of the different autoencoder models along with the statistical NLP models. We discuss the insights from the comparison and offer guidelines for researchers. We outline future extensions of NLP models in marketing.

Suggested Citation

  • Venkatesh Shankar & Sohil Parsana, 2022. "An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1324-1350, November.
  • Handle: RePEc:spr:joamsc:v:50:y:2022:i:6:d:10.1007_s11747-022-00840-3
    DOI: 10.1007/s11747-022-00840-3
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1007/s11747-022-00840-3
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1007/s11747-022-00840-3?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Mantian (Mandy) Hu & Chu (Ivy) Dang & Pradeep K. Chintagunta, 2019. "Search and Learning at a Daily Deals Website," Marketing Science, INFORMS, vol. 38(4), pages 609-642, July.
    2. Vermeer, Susan A.M. & Araujo, Theo & Bernritter, Stefan F. & van Noort, Guda, 2019. "Seeing the wood for the trees: How machine learning can help firms in identifying relevant electronic word-of-mouth in social media," International Journal of Research in Marketing, Elsevier, vol. 36(3), pages 492-508.
    3. Yang Pan & Peng Huang & Anandasivam Gopal, 2019. "Storm Clouds on the Horizon? New Entry Threats and R&D Investments in the U.S. IT Industry," Information Systems Research, INFORMS, vol. 30(2), pages 540-562, June.
    4. Ashlee Humphreys & Rebecca Jen-Hui Wang & Eileen FischerEditor & Linda PriceAssociate Editor, 2018. "Automated Text Analysis for Consumer Research," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 44(6), pages 1274-1306.
    5. Manuel Hermosilla & Fernanda Gutiérrez-Navratil & Juan Prieto-Rodríguez, 2018. "Can Emerging Markets Tilt Global Product Design? Impacts of Chinese Colorism on Hollywood Castings," Marketing Science, INFORMS, vol. 37(3), pages 356-381, May.
    6. Patricia M. West & Patrick L. Brockett & Linda L. Golden, 1997. "A Comparative Analysis of Neural Networks and Statistical Methods for Predicting Consumer Choice," Marketing Science, INFORMS, vol. 16(4), pages 370-391.
    7. Dokyun Lee & Kartik Hosanagar & Harikesh S. Nair, 2018. "Advertising Content and Consumer Engagement on Social Media: Evidence from Facebook," Management Science, INFORMS, vol. 64(11), pages 5105-5131, November.
    8. Dirk Hovy & Shiri Melumad & J Jeffrey Inman & Richard J Lutz & Charles F Hofacker, 2021. "Wordify: A Tool for Discovering and Differentiating Consumer Vocabularies," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 48(3), pages 394-414.
    9. Ning Zhong & David A. Schweidel, 2020. "Capturing Changes in Social Media Content: A Multiple Latent Changepoint Topic Model," Marketing Science, INFORMS, vol. 39(4), pages 827-846, July.
    10. Nikolay Archak & Anindya Ghose & Panagiotis G. Ipeirotis, 2011. "Deriving the Pricing Power of Product Features by Mining Consumer Reviews," Management Science, INFORMS, vol. 57(8), pages 1485-1509, August.
    11. Jehoshua Eliashberg & Sam K. Hui & Z. John Zhang, 2007. "From Story Line to Box Office: A New Approach for Green-Lighting Movie Scripts," Management Science, INFORMS, vol. 53(6), pages 881-893, June.
    12. Hartmann, Jochen & Huppertz, Juliana & Schamp, Christina & Heitmann, Mark, 2019. "Comparing automated text classification methods," International Journal of Research in Marketing, Elsevier, vol. 36(1), pages 20-38.
    13. Bitty Balducci & Detelina Marinova, 2018. "Unstructured data in marketing," Journal of the Academy of Marketing Science, Springer, vol. 46(4), pages 557-590, July.
    14. Joachim Büschken & Greg M. Allenby, 2020. "Improving Text Analysis Using Sentence Conjunctions and Punctuation," Marketing Science, INFORMS, vol. 39(4), pages 727-742, July.
    15. Xiao Liu & Param Vir Singh & Kannan Srinivasan, 2016. "A Structured Analysis of Unstructured Big Data by Leveraging Cloud Computing," Marketing Science, INFORMS, vol. 35(3), pages 363-388, May.
    16. Bruno J.D. Jacobs & Bas Donkers & Dennis Fok, 2016. "Model-Based Purchase Predictions for Large Assortments," Marketing Science, INFORMS, vol. 35(3), pages 389-404, May.
    17. Oded Netzer & Ronen Feldman & Jacob Goldenberg & Moshe Fresko, 2012. "Mine Your Own Business: Market-Structure Surveillance Through Text Mining," Marketing Science, INFORMS, vol. 31(3), pages 521-543, May.
    18. Martin Reisenbichler & Thomas Reutterer, 2019. "Topic modeling in marketing: recent advances and research opportunities," Journal of Business Economics, Springer, vol. 89(3), pages 327-356, April.
    19. Jia Liu & Olivier Toubia, 2018. "A Semantic Approach for Estimating Consumer Content Preferences from Online Search Queries," Marketing Science, INFORMS, vol. 37(6), pages 930-952, November.
    20. Joachim Büschken & Greg M. Allenby, 2016. "Sentence-Based Text Analysis for Customer Reviews," Marketing Science, INFORMS, vol. 35(6), pages 953-975, November.
    21. Anindya Ghose & Panagiotis G. Ipeirotis & Beibei Li, 2019. "Modeling Consumer Footprints on Search Engines: An Interplay with Social Media," Management Science, INFORMS, vol. 65(3), pages 1363-1385, March.
    22. Lemmens, A. & Croux, C., 2006. "Bagging and boosting classification trees to predict churn," Other publications TiSEM d5cb664d-5859-44db-a621-e, Tilburg University, School of Economics and Management.
    23. Guiyang Xiong & Sundar Bharadwaj, 2014. "Prerelease Buzz Evolution Patterns and New Product Performance," Marketing Science, INFORMS, vol. 33(3), pages 401-421, May.
    24. João Guerreiro & Paulo Rita & Duarte Trigueiros, 2016. "A Text Mining-Based Review of Cause-Related Marketing Literature," Journal of Business Ethics, Springer, vol. 139(1), pages 111-128, November.
    25. Dapeng Cui & David Curry, 2005. "Prediction in Marketing Using the Support Vector Machine," Marketing Science, INFORMS, vol. 24(4), pages 595-615, January.
    26. Jalali, Nima Y. & Papatla, Purushottam, 2019. "Composing tweets to increase retweets," International Journal of Research in Marketing, Elsevier, vol. 36(4), pages 647-668.
    27. Moro, Sérgio & Pires, Guilherme & Rita, Paulo & Cortez, Paulo, 2019. "A text mining and topic modelling perspective of ethnic marketing research," Journal of Business Research, Elsevier, vol. 103(C), pages 275-285.
    28. Geng Cui & Man Leung Wong & Hon-Kwong Lui, 2006. "Machine Learning for Direct Marketing Response Models: Bayesian Networks with Evolutionary Programming," Management Science, INFORMS, vol. 52(4), pages 597-612, April.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Kirk Plangger & Dhruv Grewal & Ko Ruyter & Catherine Tucker, 2022. "The future of digital technologies in marketing: A conceptual framework and an overview," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1125-1134, November.
    2. Alin-Gabriel Vaduva & Simona-Vasilica Oprea & Dragos-Catalin Barbu, 2023. "Understanding Customers' Opinion using Web Scraping and Natural Language Processing," Ovidius University Annals, Economic Sciences Series, Ovidius University of Constantza, Faculty of Economic Sciences, vol. 0(1), pages 537-544, August.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Ma, Liye & Sun, Baohong, 2020. "Machine learning and AI in marketing – Connecting computing power to human insights," International Journal of Research in Marketing, Elsevier, vol. 37(3), pages 481-504.
    2. Alantari, Huwail J. & Currim, Imran S. & Deng, Yiting & Singh, Sameer, 2022. "An empirical comparison of machine learning methods for text-based sentiment analysis of online consumer reviews," International Journal of Research in Marketing, Elsevier, vol. 39(1), pages 1-19.
    3. Grewal, Dhruv & Herhausen, Dennis & Ludwig, Stephan & Villarroel Ordenes, Francisco, 2022. "The Future of Digital Communication Research: Considering Dynamics and Multimodality," Journal of Retailing, Elsevier, vol. 98(2), pages 224-240.
    4. Li, Xi & Shi, Mengze & Wang, Xin (Shane), 2019. "Video mining: Measuring visual information using automatic methods," International Journal of Research in Marketing, Elsevier, vol. 36(2), pages 216-231.
    5. Bitty Balducci & Detelina Marinova, 2018. "Unstructured data in marketing," Journal of the Academy of Marketing Science, Springer, vol. 46(4), pages 557-590, July.
    6. Ming-Hui Huang & Roland T. Rust, 2021. "A strategic framework for artificial intelligence in marketing," Journal of the Academy of Marketing Science, Springer, vol. 49(1), pages 30-50, January.
    7. Ngai, Eric W.T. & Wu, Yuanyuan, 2022. "Machine learning in marketing: A literature review, conceptual framework, and research agenda," Journal of Business Research, Elsevier, vol. 145(C), pages 35-48.
    8. Wang, Xin (Shane) & Ryoo, Jun Hyun (Joseph) & Bendle, Neil & Kopalle, Praveen K., 2021. "The role of machine learning analytics and metrics in retailing research," Journal of Retailing, Elsevier, vol. 97(4), pages 658-675.
    9. Kübler, Raoul V. & Colicev, Anatoli & Pauwels, Koen H., 2020. "Social Media's Impact on the Consumer Mindset: When to Use Which Sentiment Extraction Tool?," Journal of Interactive Marketing, Elsevier, vol. 50(C), pages 136-155.
    10. Huang, Ming-Hui & Rust, Roland T., 2022. "A Framework for Collaborative Artificial Intelligence in Marketing," Journal of Retailing, Elsevier, vol. 98(2), pages 209-223.
    11. Laura Toschi & Elisa Ughetto & Andrea Fronzetti Colladon, 2023. "The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining," Small Business Economics, Springer, vol. 60(3), pages 1249-1280, March.
    12. Hyowon Kim & Greg M. Allenby, 2022. "Integrating Textual Information into Models of Choice and Scaled Response Data," Marketing Science, INFORMS, vol. 41(4), pages 815-830, July.
    13. Bruno Jacobs & Dennis Fok & Bas Donkers, 2021. "Understanding Large-Scale Dynamic Purchase Behavior," Marketing Science, INFORMS, vol. 40(5), pages 844-870, September.
    14. Ratchford, Brian & Soysal, Gonca & Zentner, Alejandro & Gauri, Dinesh K., 2022. "Online and offline retailing: What we know and directions for future research," Journal of Retailing, Elsevier, vol. 98(1), pages 152-177.
    15. Ning Zhong & David A. Schweidel, 2020. "Capturing Changes in Social Media Content: A Multiple Latent Changepoint Topic Model," Marketing Science, INFORMS, vol. 39(4), pages 827-846, July.
    16. Sheng, Jie & Amankwah-Amoah, Joseph & Wang, Xiaojun, 2017. "A multidisciplinary perspective of big data in management research," International Journal of Production Economics, Elsevier, vol. 191(C), pages 97-112.
    17. Jiyeon Hong & Paul R. Hoban, 2022. "Writing More Compelling Creative Appeals: A Deep Learning-Based Approach," Marketing Science, INFORMS, vol. 41(5), pages 941-965, September.
    18. Carlson, Keith & Kopalle, Praveen K. & Riddell, Allen & Rockmore, Daniel & Vana, Prasad, 2023. "Complementing human effort in online reviews: A deep learning approach to automatic content generation and review synthesis," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 54-74.
    19. Oliver Schaer & Nikolaos Kourentzes & Robert Fildes, 2022. "Predictive competitive intelligence with prerelease online search traffic," Production and Operations Management, Production and Operations Management Society, vol. 31(10), pages 3823-3839, October.
    20. Gandhi, Mohina & Kar, Arpan Kumar, 2022. "How do Fortune firms build a social presence on social media platforms? Insights from multi-modal analytics," Technological Forecasting and Social Change, Elsevier, vol. 182(C).

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:spr:joamsc:v:50:y:2022:i:6:d:10.1007_s11747-022-00840-3. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.springer.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.