IDEAS home Printed from https://ideas.repec.org/a/pal/jmarka/v9y2021i2d10.1057_s41270-021-00109-8.html
   My bibliography  Save this article

BERT: a sentiment analysis odyssey

Author

Listed:
  • Shivaji Alaparthi

    (CenturyLink)

  • Manit Mishra

    (International Management Institute Bhubaneswar)

Abstract

The study investigates relative effectiveness of four sentiment analysis techniques: (1) unsupervised lexicon-based model using SentiWordNet, (2) traditional supervised machine learning model using logistic regression, (3) supervised deep learning model using Long Short-Term Memory (LSTM), and (4) advanced supervised deep learning model using Bidirectional Encoder Representations from Transformers (BERT). Publicly available labeled corpora of 50,000 movie reviews originally posted on Internet movie database (IMDB) were analyzed. Sentiment classification performance was calibrated on accuracy, precision, recall, and F1 score. The study puts forth two key insights: (1) relative efficacy of four sentiment analysis algorithms and (2) undisputed superiority of pre-trained advanced supervised deep learning algorithm BERT in sentiment classification from text. The study is of value to analytics professionals and academicians working on text analysis as it offers critical insight regarding sentiment classification performance of key algorithms, including the recently developed BERT.

Suggested Citation

  • Shivaji Alaparthi & Manit Mishra, 2021. "BERT: a sentiment analysis odyssey," Journal of Marketing Analytics, Palgrave Macmillan, vol. 9(2), pages 118-126, June.
  • Handle: RePEc:pal:jmarka:v:9:y:2021:i:2:d:10.1057_s41270-021-00109-8
    DOI: 10.1057/s41270-021-00109-8
    as

    Download full text from publisher

    File URL: http://link.springer.com/10.1057/s41270-021-00109-8
    File Function: Abstract
    Download Restriction: Access to the full text of the articles in this series is restricted.

    File URL: https://libkey.io/10.1057/s41270-021-00109-8?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Sanjiv R. Das & Mike Y. Chen, 2007. "Yahoo! for Amazon: Sentiment Extraction from Small Talk on the Web," Management Science, INFORMS, vol. 53(9), pages 1375-1388, September.
    2. Sivarajah, Uthayasankar & Kamal, Muhammad Mustafa & Irani, Zahir & Weerakkody, Vishanth, 2017. "Critical analysis of Big Data challenges and analytical methods," Journal of Business Research, Elsevier, vol. 70(C), pages 263-286.
    3. Nikolay Archak & Anindya Ghose & Panagiotis G. Ipeirotis, 2011. "Deriving the Pricing Power of Product Features by Mining Consumer Reviews," Management Science, INFORMS, vol. 57(8), pages 1485-1509, August.
    4. Bradlow, Eric T. & Gangwar, Manish & Kopalle, Praveen & Voleti, Sudhir, 2017. "The Role of Big Data and Predictive Analytics in Retailing," Journal of Retailing, Elsevier, vol. 93(1), pages 79-95.
    5. Subarno Pal & Soumadip Ghosh & Amitava Nag, 2018. "Sentiment Analysis in the Light of LSTM Recurrent Neural Networks," International Journal of Synthetic Emotions (IJSE), IGI Global, vol. 9(1), pages 33-39, January.
    6. Seshadri Tirunillai & Gerard J. Tellis, 2012. "Does Chatter Really Matter? Dynamics of User-Generated Content and Stock Performance," Marketing Science, INFORMS, vol. 31(2), pages 198-215, March.
    7. Oded Netzer & Ronen Feldman & Jacob Goldenberg & Moshe Fresko, 2012. "Mine Your Own Business: Market-Structure Surveillance Through Text Mining," Marketing Science, INFORMS, vol. 31(3), pages 521-543, May.
    8. Joachim Büschken & Greg M. Allenby, 2016. "Sentence-Based Text Analysis for Customer Reviews," Marketing Science, INFORMS, vol. 35(6), pages 953-975, November.
    9. Anjala S. Krishen & Maria Petrescu, 2017. "The world of analytics: interdisciplinary, inclusive, insightful, and influential," Journal of Marketing Analytics, Palgrave Macmillan, vol. 5(1), pages 1-4, March.
    10. Pradeep K. Chintagunta & Shyam Gopinath & Sriram Venkataraman, 2010. "The Effects of Online User Reviews on Movie Box Office Performance: Accounting for Sequential Rollout and Aggregation Across Local Markets," Marketing Science, INFORMS, vol. 29(5), pages 944-957, 09-10.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Djonata Schiessl & Helison Bertoli Alves Dias & José Carlos Korelo, 2022. "Artificial intelligence in marketing: a network analysis and future agenda," Journal of Marketing Analytics, Palgrave Macmillan, vol. 10(3), pages 207-218, September.
    2. Samir Poudel & Rajendra Paudyal & Burak Cankaya & Naomi Sterlingsdottir & Marissa Murphy & Shital Pandey & Jorge Vargas & Khem Poudel, 2023. "Cryptocurrency price and volatility predictions with machine learning," Journal of Marketing Analytics, Palgrave Macmillan, vol. 11(4), pages 642-660, December.
    3. Villarroel Ordenes, Francisco & Silipo, Rosaria, 2021. "Machine learning for marketing on the KNIME Hub: The development of a live repository for marketing applications," Journal of Business Research, Elsevier, vol. 137(C), pages 393-410.
    4. Zecong Ma & Sergio Palacios, 2021. "Image-mining: exploring the impact of video content on the success of crowdfunding," Journal of Marketing Analytics, Palgrave Macmillan, vol. 9(4), pages 265-285, December.
    5. Daesik Kim & Chung Joo Chung & Kihong Eom, 2022. "Measuring Online Public Opinion for Decision Making: Application of Deep Learning on Political Context," Sustainability, MDPI, vol. 14(7), pages 1-16, March.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Xiao Liu & Param Vir Singh & Kannan Srinivasan, 2016. "A Structured Analysis of Unstructured Big Data by Leveraging Cloud Computing," Marketing Science, INFORMS, vol. 35(3), pages 363-388, May.
    2. Sheng, Jie & Amankwah-Amoah, Joseph & Wang, Xiaojun, 2017. "A multidisciplinary perspective of big data in management research," International Journal of Production Economics, Elsevier, vol. 191(C), pages 97-112.
    3. Oded Netzer & Ronen Feldman & Jacob Goldenberg & Moshe Fresko, 2012. "Mine Your Own Business: Market-Structure Surveillance Through Text Mining," Marketing Science, INFORMS, vol. 31(3), pages 521-543, May.
    4. Christof Naumzik & Stefan Feuerriegel & Markus Weinmann, 2022. "I Will Survive: Predicting Business Failures from Customer Ratings," Marketing Science, INFORMS, vol. 41(1), pages 188-207, January.
    5. Supriyo Mandal & Abyayananda Maiti, 2022. "Network promoter score (NePS): An indicator of product sales in E-commerce retailing sector," Electronic Markets, Springer;IIM University of St. Gallen, vol. 32(3), pages 1327-1349, September.
    6. Xin (Shane) Wang & Feng Mai & Roger H. L. Chiang, 2014. "Database Submission ---Market Dynamics and User-Generated Content About Tablet Computers," Marketing Science, INFORMS, vol. 33(3), pages 449-458, May.
    7. Aron Culotta & Jennifer Cutler, 2016. "Mining Brand Perceptions from Twitter Social Networks," Marketing Science, INFORMS, vol. 35(3), pages 343-362, May.
    8. Khim-Yong Goh & Cheng-Suang Heng & Zhijie Lin, 2013. "Social Media Brand Community and Consumer Behavior: Quantifying the Relative Impact of User- and Marketer-Generated Content," Information Systems Research, INFORMS, vol. 24(1), pages 88-107, March.
    9. Li, Xi & Shi, Mengze & Wang, Xin (Shane), 2019. "Video mining: Measuring visual information using automatic methods," International Journal of Research in Marketing, Elsevier, vol. 36(2), pages 216-231.
    10. Agnieszka Zablocki & Bodo Schlegelmilch & Michael J. Houston, 2019. "How valence, volume and variance of online reviews influence brand attitudes," AMS Review, Springer;Academy of Marketing Science, vol. 9(1), pages 61-77, June.
    11. Laura Toschi & Elisa Ughetto & Andrea Fronzetti Colladon, 2023. "The identity of social impact venture capitalists: exploring social linguistic positioning and linguistic distinctiveness through text mining," Small Business Economics, Springer, vol. 60(3), pages 1249-1280, March.
    12. Roelen-Blasberg, Tobias & Habel, Johannes & Klarmann, Martin, 2023. "Automated inference of product attributes and their importance from user-generated content: Can we replace traditional market research?," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 164-188.
    13. Boegershausen, Johannes & Datta, Hannes & Borah, Abhishek & Stephen, Andrew, 2022. "Fields of Gold: Web Scraping and APIs for Impactful Marketing Insights," Other publications TiSEM 5f1ed70a-48c3-422c-bc10-0, Tilburg University, School of Economics and Management.
    14. Peters, Kay & Chen, Yubo & Kaplan, Andreas M. & Ognibeni, Björn & Pauwels, Koen, 2013. "Social Media Metrics — A Framework and Guidelines for Managing Social Media," Journal of Interactive Marketing, Elsevier, vol. 27(4), pages 281-298.
    15. Venkatesh Shankar & Sohil Parsana, 2022. "An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1324-1350, November.
    16. Michael Nofer & Oliver Hinz, 2015. "Using Twitter to Predict the Stock Market," Business & Information Systems Engineering: The International Journal of WIRTSCHAFTSINFORMATIK, Springer;Gesellschaft für Informatik e.V. (GI), vol. 57(4), pages 229-242, August.
    17. Dawn Iacobucci & Maria Petrescu & Anjala Krishen & Michael Bendixen, 2019. "The state of marketing analytics in research and practice," Journal of Marketing Analytics, Palgrave Macmillan, vol. 7(3), pages 152-181, September.
    18. Bitty Balducci & Detelina Marinova, 2018. "Unstructured data in marketing," Journal of the Academy of Marketing Science, Springer, vol. 46(4), pages 557-590, July.
    19. Hartmann, Jochen & Huppertz, Juliana & Schamp, Christina & Heitmann, Mark, 2019. "Comparing automated text classification methods," International Journal of Research in Marketing, Elsevier, vol. 36(1), pages 20-38.
    20. Blasco-Arcas, Lorena & Lee, Hsin-Hsuan Meg & Kastanakis, Minas N. & Alcañiz, Mariano & Reyes-Menendez, Ana, 2022. "The role of consumer data in marketing: A research agenda," Journal of Business Research, Elsevier, vol. 146(C), pages 436-452.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:pal:jmarka:v:9:y:2021:i:2:d:10.1057_s41270-021-00109-8. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Sonal Shukla or Springer Nature Abstracting and Indexing (email available below). General contact details of provider: http://www.palgrave-journals.com/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.