IDEAS home Printed from https://ideas.repec.org/a/eee/jbrese/v198y2025ics0148296325003145.html

From words to insights: Text analysis in business research

Author

Listed:
  • Herhausen, Dennis
  • Ludwig, Stephan
  • Abedin, Ehsan
  • Haque, Nasim Ul
  • de Jong, David

Abstract

Business success relies on effective stakeholder communication, much of which occurs via or can be transcribed into text. Yet, business researchers often lack coherent frameworks to conceptualize business-relevant communication and its underlying logics. We thus consider business research from a message design logic lens to offer a conceptual foundation for research seeking to understand the content, style, and structure of business communication. Business researchers also underutilize modern tools for analyzing text data. Hence, our comparison of current methodologies for analyzing text (i.e., topic models, dictionaries, supervised machine learning, and large language models) points out their respective advantages, limitations, and applications. An overview of recent studies in the Journal of Business Research identifies how these methods are used to extract insights from business communication. We offer practical guidelines for authors and reviewers on method selection, implementation, and evaluation, and conclude by proposing future directions for business research using text data.

Suggested Citation

  • Herhausen, Dennis & Ludwig, Stephan & Abedin, Ehsan & Haque, Nasim Ul & de Jong, David, 2025. "From words to insights: Text analysis in business research," Journal of Business Research, Elsevier, vol. 198(C).
  • Handle: RePEc:eee:jbrese:v:198:y:2025:i:c:s0148296325003145
    DOI: 10.1016/j.jbusres.2025.115491
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0148296325003145
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jbusres.2025.115491?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. R. Filieri & S. Alguezaui & F. Galati & E. Raguseo, 2023. "Customer experience with standard and premium Peer-To-Peer offerings: a mixed-method combining text analytics and qualitative analysis," Post-Print hal-04474167, HAL.
    2. Syrdal, Holly A. & Myers, Susan & Sen, Sandipan & Woodroof, Parker J. & McDowell, William C., 2023. "Influencer marketing and the growth of affiliates: The effects of language features on engagement behavior," Journal of Business Research, Elsevier, vol. 163(C).
    3. Hajek, Petr & Hikkerova, Lubica & Sahut, Jean-Michel, 2023. "Fake review detection in e-Commerce platforms using aspect-based sentiment analysis," Journal of Business Research, Elsevier, vol. 167(C).
    4. Lee, Michael T. & Theokary, Carol, 2021. "The superstar social media influencer: Exploiting linguistic style and emotional contagion over content?," Journal of Business Research, Elsevier, vol. 132(C), pages 860-871.
    5. Cleveland, Mark & Laroche, Michel & Papadopoulos, Nicolas, 2015. "You are what you speak? Globalization, multilingualism, consumer dispositions and consumption," Journal of Business Research, Elsevier, vol. 68(3), pages 542-552.
    6. Ghaddar, Bissan & Naoum-Sawaya, Joe, 2018. "High dimensional data classification and feature selection using support vector machines," European Journal of Operational Research, Elsevier, vol. 265(3), pages 993-1004.
    7. Ashlee Humphreys & Rebecca Jen-Hui Wang & Eileen FischerEditor & Linda PriceAssociate Editor, 2018. "Automated Text Analysis for Consumer Research," Journal of Consumer Research, Journal of Consumer Research Inc., vol. 44(6), pages 1274-1306.
    8. Hamby, Anne & Kim, Hongmin & Spezzano, Francesca, 2024. "Sensational stories: The role of narrative characteristics in distinguishing real and fake news and predicting their spread," Journal of Business Research, Elsevier, vol. 170(C).
    9. Jiang, Cuiqing & Yin, Chang & Tang, Qian & Wang, Zhao, 2023. "The value of official website information in the credit risk evaluation of SMEs," Journal of Business Research, Elsevier, vol. 169(C).
    10. Sun, Bing & Zhang, Yanfeng & Zhu, Kunyan & Mao, Hongying & Liang, Tian, 2024. "Is faster really better? The impact of digital transformation speed on firm financial distress: Based on the cost-benefit perspective," Journal of Business Research, Elsevier, vol. 179(C).
    11. Ali Goli & Amandeep Singh, 2024. "Frontiers: Can Large Language Models Capture Human Preferences?," Marketing Science, INFORMS, vol. 43(4), pages 709-722, July.
    12. Patel, Pankaj C. & Oghazi, Pejvak & Arunachalam, S., 2023. "Does consumer privacy act influence firm performance in the retail industry? Evidence from a US state-level law change," Journal of Business Research, Elsevier, vol. 162(C).
    13. Kumar, Vivek & Srivastava, Arpita, 2022. "Trends in the thematic landscape of corporate social responsibility research: A structural topic modeling approach," Journal of Business Research, Elsevier, vol. 150(C), pages 26-37.
    14. Shan, Wei & Wang, Jiaxuan & Shi, Xiaoxiao & David Evans, Richard, 2024. "The impact of electronic word-of-mouth on patients’ choices in online health communities: A cross-media perspective," Journal of Business Research, Elsevier, vol. 173(C).
    15. Wu, Jie & Zhao, Narisa & Yang, Tong, 2024. "Wisdom of crowds: SWOT analysis based on hybrid text mining methods using online reviews," Journal of Business Research, Elsevier, vol. 171(C).
    16. Liao, Miyan & Fang, Jiaming & Han, Lintong & Wen, Ling & Zheng, Qiqi & Xia, Guoen, 2023. "Boosting eCommerce sales with livestreaming in B2B marketplace: A perspective on live streamers’ competencies," Journal of Business Research, Elsevier, vol. 167(C).
    17. Jia Liu & Olivier Toubia, 2018. "A Semantic Approach for Estimating Consumer Content Preferences from Online Search Queries," Marketing Science, INFORMS, vol. 37(6), pages 930-952, November.
    18. Pan, Xiongfeng & Wang, Yuqing & Zhang, Chengming & Shen, Zhiyang & Song, Malin, 2023. "The impact of platform economy on enterprise value mediated by technological innovation," Journal of Business Research, Elsevier, vol. 165(C).
    19. Yoganathan, Vignesh & Osburg, Victoria-Sophie, 2024. "The mind in the machine: Estimating mind perception's effect on user satisfaction with voice-based conversational agents," Journal of Business Research, Elsevier, vol. 175(C).
    20. Peiyao Li & Noah Castelo & Zsolt Katona & Miklos Sarvary, 2024. "Frontiers: Determining the Validity of Large Language Models for Automated Perceptual Analysis," Marketing Science, INFORMS, vol. 43(2), pages 254-266, March.
    21. Culasso, Francesca & Gavurova, Beata & Crocco, Edoardo & Giacosa, Elisa, 2023. "Empirical identification of the chief digital officer role: A latent Dirichlet allocation approach," Journal of Business Research, Elsevier, vol. 154(C).
    22. Ning Zhong & David A. Schweidel, 2020. "Capturing Changes in Social Media Content: A Multiple Latent Changepoint Topic Model," Marketing Science, INFORMS, vol. 39(4), pages 827-846, July.
    23. Ray, Arghya & Bala, Pradip Kumar & Rana, Nripendra P., 2021. "Exploring the drivers of customers’ brand attitudes of online travel agency services: A text-mining based approach," Journal of Business Research, Elsevier, vol. 128(C), pages 391-404.
    24. Setten, Eric & Chen, Steven, 2024. "Playing with emotions: Text analysis of emotional tones in gender-casted Children’s media," Journal of Business Research, Elsevier, vol. 175(C).
    25. Lee, Inyoung & Yi, Jisu & Kim, Sang-Hoon, 2023. "Standing the test of time: What makes a book survive on the bestseller list?," Journal of Business Research, Elsevier, vol. 164(C).
    26. Dhakate, Nikhil & Joshi, Rohit, 2023. "Classification of reviews of e-healthcare services to improve patient satisfaction: Insights from an emerging economy," Journal of Business Research, Elsevier, vol. 164(C).
    27. Hartmann, Jochen & Heitmann, Mark & Siebert, Christian & Schamp, Christina, 2023. "More than a Feeling: Accuracy and Application of Sentiment Analysis," International Journal of Research in Marketing, Elsevier, vol. 40(1), pages 75-87.
    28. Bitty Balducci & Detelina Marinova, 2018. "Unstructured data in marketing," Journal of the Academy of Marketing Science, Springer, vol. 46(4), pages 557-590, July.
    29. Hartmann, Jochen & Huppertz, Juliana & Schamp, Christina & Heitmann, Mark, 2019. "Comparing automated text classification methods," International Journal of Research in Marketing, Elsevier, vol. 36(1), pages 20-38.
    30. Joachim Büschken & Greg M. Allenby, 2020. "Improving Text Analysis Using Sentence Conjunctions and Punctuation," Marketing Science, INFORMS, vol. 39(4), pages 727-742, July.
    31. Zelin Zhang & Kejia Yang & Jonathan Z. Zhang & Robert W. Palmatier, 2023. "Uncovering Synergy and Dysergy in Consumer Reviews: A Machine Learning Approach," Management Science, INFORMS, vol. 69(4), pages 2339-2360, April.
    32. Zikun Ye & Hema Yoganarasimhan & Yufeng Zheng, 2025. "LOLA: LLM-Assisted Online Learning Algorithm for Content Experiments," Marketing Science, INFORMS, vol. 44(5), pages 995-1016, September.
    33. Wang, Qiping & Yiu Keung Lau, Raymond & Xie, Haoran & Liu, Hongyan & Guo, Xunhua, 2024. "Social Executives’ emotions and firm value: An empirical study enhanced by cognitive analytics," Journal of Business Research, Elsevier, vol. 175(C).
    34. Alaa Alslaity & Rita Orji, 2024. "Machine learning techniques for emotion detection and sentiment analysis: current state, challenges, and future directions," Behaviour and Information Technology, Taylor & Francis Journals, vol. 43(1), pages 139-164, January.
    35. Garner, Benjamin & Thornton, Corliss & Luo Pawluk, Anita & Mora Cortez, Roberto & Johnston, Wesley & Ayala, Cesar, 2022. "Utilizing text-mining to explore consumer happiness within tourism destinations," Journal of Business Research, Elsevier, vol. 139(C), pages 1366-1377.
    36. Allen H. Huang & Reuven Lehavy & Amy Y. Zang & Rong Zheng, 2018. "Analyst Information Discovery and Interpretation Roles: A Topic Modeling Approach," Management Science, INFORMS, vol. 64(6), pages 2833-2855, June.
    37. Fronzetti Colladon, Andrea & Toschi, Laura & Ughetto, Elisa & Greco, Francesca, 2023. "The language and social behavior of innovators," Journal of Business Research, Elsevier, vol. 154(C).
    38. Filieri, Raffaele & Alguezaui, Salma & Galati, Francesco & Raguseo, Elisabetta, 2023. "Customer experience with standard and premium Peer-To-Peer offerings: A mixed-method combining text analytics and qualitative analysis," Journal of Business Research, Elsevier, vol. 167(C).
    39. Wang, Shaoda & Ye, Dezhu & Liao, Junyun, 2024. "Politeness matters: The role of polite languages in online peer-to-peer lending," Journal of Business Research, Elsevier, vol. 171(C).
    40. Venkatesh Shankar & Sohil Parsana, 2022. "An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1324-1350, November.
    41. Mangiò, Federico & Mismetti, Marco & Lissana, Elena & Andreini, Daniela, 2023. "That's the Press, Baby! How journalists co-create family business brands meanings: A mixed method analysis," Journal of Business Research, Elsevier, vol. 161(C).
    42. Lu, Lin & Xu, Pei & Wang, Yen-Yao & Wang, Yu, 2023. "Measuring service quality with text analytics: Considering both importance and performance of consumer opinions on social and non-social online platforms," Journal of Business Research, Elsevier, vol. 169(C).
    43. Martin Reisenbichler & Thomas Reutterer & David A. Schweidel & Daniel Dan, 2022. "Frontiers: Supporting Content Marketing with Natural Language Generation," Marketing Science, INFORMS, vol. 41(3), pages 441-452, May.
    Full references (including those not matched with items on IDEAS)

    Citations

    Citations are extracted by the CitEc Project, subscribe to its RSS feed for this item.
    as


    Cited by:

    1. Thomas, Llewellyn D.W. & Romasanta, Angelo Kenneth G. & Pujol Priego, Laia, 2026. "Jagged competencies: Measuring the reliability of generative AI in academic research," Journal of Business Research, Elsevier, vol. 203(C).
    2. Liu, Zhen-yuan Ralph & Dong, Shuqi Kyra & Zeng, Wenjuan & Wang, Yu-ting & Niu, Dong-fang, 2025. "Exploring the impact of human-centred AI on firms’ social and operational performance: A large language model approach," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 203(C).
    3. Serhii Lehenchuk & Olha Fedorova, 2025. "New Methods for Accounting Research of the Narrative Component of Sustainable Reporting: Challenges and Opportunities," Oblik i finansi, Institute of Accounting and Finance, issue 3, pages 31-40, September.
    4. Wobst, Janice & Lueg, Rainer, 2025. "Avoiding algorithm errors in textual analysis: A guide to selecting software, and a research agenda toward generative artificial intelligence," Journal of Business Research, Elsevier, vol. 199(C).
    5. Al-Sharafi, Mohammed A. & Alzaeemi, Shehab Abdulhabib & Albashrawi, Mousa & Dwivedi, Yogesh K. & Alahmad, Rasha & Chae, Inyoung, 2026. "A longitudinal consumer feedback analytics approach for theory building and testing: Examining consumer satisfaction and loyalty in buy now, pay later (BNPL) platforms," Journal of Retailing and Consumer Services, Elsevier, vol. 88(C).
    6. Stephan Ludwig & Peter J. Danaher & Xiaohao Yang, 2026. "A Neural Topic Method Using a Large-Language-Model-in-the-Loop for Business Research," Papers 2603.03623, arXiv.org.

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Venkatesh Shankar & Sohil Parsana, 2022. "An overview and empirical comparison of natural language processing (NLP) models and an introduction to and empirical application of autoencoder models in marketing," Journal of the Academy of Marketing Science, Springer, vol. 50(6), pages 1324-1350, November.
    2. Jan Ole Krugmann & Jochen Hartmann, 2024. "Sentiment Analysis in the Age of Generative AI," Customer Needs and Solutions, Springer;Institute for Sustainable Innovation and Growth (iSIG), vol. 11(1), pages 1-19, December.
    3. Vidaurreta-Apesteguia, Paula & Alzate, Miriam & Arce-Urriza, Marta & Armendáriz-Iñigo, José Enrique & D’Acunto, David, 2025. "When reviews speak through pictures: Visual content and its influence on helpfulness," Journal of Business Research, Elsevier, vol. 199(C).
    4. Grewal, Dhruv & Herhausen, Dennis & Ludwig, Stephan & Villarroel Ordenes, Francisco, 2022. "The Future of Digital Communication Research: Considering Dynamics and Multimodality," Journal of Retailing, Elsevier, vol. 98(2), pages 224-240.
    5. Daniel Baier & Reinhold Decker & Yana Asenova, 2025. "Collecting and Analyzing User-Generated Content for Decision Support in Marketing Management: An Overview of Methods and Use Cases," Schmalenbach Journal of Business Research, Springer, vol. 77(3), pages 419-455, September.
    6. Huang, Ming-Hui & Rust, Roland T., 2022. "A Framework for Collaborative Artificial Intelligence in Marketing," Journal of Retailing, Elsevier, vol. 98(2), pages 209-223.
    7. Hongshen Sun & Juanjuan Zhang, 2025. "From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research," Papers 2512.23184, arXiv.org.
    8. Gensler, Sonja & Oehring, Karlo & Wiesel, Thorsten, 2024. "Reported and communicated shifts in strategic emphasis and firm performance," International Journal of Research in Marketing, Elsevier, vol. 41(2), pages 220-240.
    9. Hartmann, Jochen & Exner, Yannick & Domdey, Samuel, 2025. "The power of generative marketing: Can generative AI create superhuman visual marketing content?," International Journal of Research in Marketing, Elsevier, vol. 42(1), pages 13-31.
    10. Xinyu Liu & Yezheng Liu & Yang Qian & Yuanchun Jiang & Haifeng Ling, 2025. "Learning consumer preferences through textual and visual data: a multi-modal approach," Electronic Commerce Research, Springer, vol. 25(4), pages 2955-2984, August.
    11. Jonah Berger & Grant Packard & Reihane Boghrati & Ming Hsu & Ashlee Humphreys & Andrea Luangrath & Sarah Moore & Gideon Nave & Christopher Olivola & Matthew Rocklage, 2022. "Marketing insights from text analysis," Marketing Letters, Springer, vol. 33(3), pages 365-377, September.
    12. Ming-Hui Huang & Roland T. Rust, 2021. "A strategic framework for artificial intelligence in marketing," Journal of the Academy of Marketing Science, Springer, vol. 49(1), pages 30-50, January.
    13. Christopher Gerling & Stefan Lessmann, 2024. "Leveraging AI and NLP for Bank Marketing: A Systematic Review and Gap Analysis," Papers 2411.14463, arXiv.org.
    14. Zhang, Mingli & Xie, Kesheng & Cai, Shensheng & Wang, Yu, 2025. "How sentiment volatility of influencer product recommendation posts affects customer engagement," Journal of Business Research, Elsevier, vol. 200(C).
    15. Jürgensmeier, Lukas & Skiera, Bernd, 2024. "Generative AI for scalable feedback to multimodal exercises," International Journal of Research in Marketing, Elsevier, vol. 41(3), pages 468-488.
    16. Yoganathan, Vignesh & Osburg, Victoria-Sophie, 2024. "The mind in the machine: Estimating mind perception's effect on user satisfaction with voice-based conversational agents," Journal of Business Research, Elsevier, vol. 175(C).
    17. Xin (Shane) Wang & Neil Bendle & Yinjie Pan, 2024. "Beyond text: Marketing strategy in a world turned upside down," Journal of the Academy of Marketing Science, Springer, vol. 52(4), pages 939-954, July.
    18. Kadić-Maglajlić, Selma & Lages, Cristiana R. & Pantano, Eleonora, 2024. "No time to lie: Examining the identity of pro-vaccination and anti-vaccination supporters through user-generated content," Social Science & Medicine, Elsevier, vol. 347(C).
    19. Madathil, Johnson Clement, 2025. "Generative AI advertisements and Human–AI collaboration: The role of humans as gatekeepers of humanity," Journal of Retailing and Consumer Services, Elsevier, vol. 87(C).
    20. Jiang, Kan & Zheng, Junyuan & Luo, Shaohua, 2024. "Green power of virtual influencer: The role of virtual influencer image, emotional appeal, and product involvement," Journal of Retailing and Consumer Services, Elsevier, vol. 77(C).

    More about this item

    Keywords

    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jbrese:v:198:y:2025:i:c:s0148296325003145. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jbusres .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.