IDEAS home Printed from https://ideas.repec.org/a/eee/jbrese/v203y2026ics0148296325006277.html

Jagged competencies: Measuring the reliability of generative AI in academic research

Author

Listed:
  • Thomas, Llewellyn D.W.
  • Romasanta, Angelo Kenneth G.
  • Pujol Priego, Laia

Abstract

Large Language Models (LLMs) are increasingly viewed as a valuable tool for academic research. While LLMs have some benefits, a ‘crisis of replicability’ in management scholarship mitigates against unrestrained use. In this paper we investigate the reproducibility of LLM analyses. We analyze three LLMs—ChatGPT, Claude and Mistral—over fifteen weeks, testing the consistency, accuracy and their interaction using the same prompts on the same data corpus. While our results demonstrate significant variations in reliability and consistency across the three LLMs, we also show that LLMs can exhibit deterministic and reliable behavior under specific, well-defined constraints. We argue that replicable LLM-based research will rely on understanding and validating the task-specific operational boundaries of the LLM. To ensure the responsible integration of LLMs into management research, we highlight a need for robust frameworks, transparency, ethical guidelines, and ongoing evaluation. We conclude with actionable guidance for management researchers.

Suggested Citation

  • Thomas, Llewellyn D.W. & Romasanta, Angelo Kenneth G. & Pujol Priego, Laia, 2026. "Jagged competencies: Measuring the reliability of generative AI in academic research," Journal of Business Research, Elsevier, vol. 203(C).
  • Handle: RePEc:eee:jbrese:v:203:y:2026:i:c:s0148296325006277
    DOI: 10.1016/j.jbusres.2025.115804
    as

    Download full text from publisher

    File URL: http://www.sciencedirect.com/science/article/pii/S0148296325006277
    Download Restriction: Full text for ScienceDirect subscribers only

    File URL: https://libkey.io/10.1016/j.jbusres.2025.115804?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    References listed on IDEAS

    as
    1. Miloš Fišar & Ben Greiner & Christoph Huber & Elena Katok & Ali I. Ozkes, 2024. "Reproducibility in Management Science," Management Science, INFORMS, vol. 70(3), pages 1343-1356, March.
    2. Shaw, Steven D. & Nave, Gideon, 2023. "Don't hate the player, hate the game: Realigning incentive structures to promote robust science and better scientific practices in marketing," Journal of Business Research, Elsevier, vol. 167(C).
    3. Adler, Susanne Jana & Röseler, Lukas & Schöniger, Martina Katharina, 2023. "A toolbox to evaluate the trustworthiness of published findings," Journal of Business Research, Elsevier, vol. 167(C).
    4. Dashkevych, Oleg & Portnov, Boris A., 2024. "How can generative AI help in different parts of research? An experiment study on smart cities’ definitions and characteristics," Technology in Society, Elsevier, vol. 77(C).
    5. Wobst, Janice & Lueg, Rainer, 2025. "Avoiding algorithm errors in textual analysis: A guide to selecting software, and a research agenda toward generative artificial intelligence," Journal of Business Research, Elsevier, vol. 199(C).
    6. Anil R. Doshi & J. Jason Bell & Emil Mirzayev & Bart S. Vanneste, 2025. "Generative artificial intelligence and evaluating strategic decisions," Strategic Management Journal, Wiley Blackwell, vol. 46(3), pages 583-610, March.
    7. Hubbard, Raymond & Lindsay, R. Murray, 2013. "From significant difference to significant sameness: Proposing a paradigm shift in business research," Journal of Business Research, Elsevier, vol. 66(9), pages 1377-1388.
    8. Adler, Susanne Jana & Röseler, Lukas & Schöniger, Martina Katharina, 2023. "A Toolbox to Evaluate the Trustworthiness of Published Findings," OSF Preprints s5mzp, Center for Open Science.
    9. Jörn H. Block & Christian Fisch & Narmeen Kanwal & Solvej Lorenzen & Anna Schulze, 2023. "Replication studies in top management journals: An empirical investigation of prevalence, types, outcomes, and impact," Management Review Quarterly, Springer, vol. 73(3), pages 1109-1134, September.
    10. Olavo B. Amaral & Kleber Neves, 2021. "Reproducibility: expect less of the scientific paper," Nature, Nature, vol. 597(7876), pages 329-331, September.
    11. Easley, Richard W. & Madden, Charles S. & Gray, Van, 2013. "A tale of two cultures: Revisiting journal editors' views of replication research," Journal of Business Research, Elsevier, vol. 66(9), pages 1457-1459.
    12. repec:osf:osfxxx:mydzv_v1 is not listed on IDEAS
    13. repec:osf:osfxxx:s5mzp_v1 is not listed on IDEAS
    14. Justin Frake & Andreas Hagemann & Jose Uribe, 2024. "Collider bias in strategy and management research: An illustration using women CEO's effect on other women's career outcomes," Strategic Management Journal, Wiley Blackwell, vol. 45(7), pages 1393-1419, July.
    15. Barry Babin & Jean-Luc Herrmann & Carmen Lopez & David J. Ortinau, 2021. "Science is about corroborating empirical evidence, even in academic business research journals," Post-Print hal-04948655, HAL.
    16. Herhausen, Dennis & Ludwig, Stephan & Abedin, Ehsan & Haque, Nasim Ul & de Jong, David, 2025. "From words to insights: Text analysis in business research," Journal of Business Research, Elsevier, vol. 198(C).
    17. Easley, Richard W. & Madden, Charles S. & Dunn, Mark G., 2000. "Conducting Marketing Science: The Role of Replication in the Research Process," Journal of Business Research, Elsevier, vol. 48(1), pages 83-92, April.
    18. Ojelanki Ngwenyama & Frantz Rowe, 2024. "Should We Collaborate with AI to Conduct Literature Reviews? Changing Epistemic Values in a Flattening World," Post-Print hal-04820922, HAL.
    19. Ryan, James C. & A Tipu, Syed A., 2022. "Business and management research: Low instances of replication studies and a lack of author independence in replications," Research Policy, Elsevier, vol. 51(1).
    20. Mina Bissell, 2013. "Reproducibility: The risks of the replication drive," Nature, Nature, vol. 503(7476), pages 333-334, November.
    21. Francesco Ferrati & Phillip H. Kim & Moreno Muffatto, 2024. "Generative AI in Entrepreneurship Research: Principles and Practical Guidance for Intelligence Augmentation," Foundations and Trends(R) in Entrepreneurship, now publishers, vol. 20(3), pages 245-383, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Chai, Daniel & Ali, Searat & Brosnan, Mark & Hasso, Tim, 2024. "Understanding researchers' perceptions and experiences in finance research replication studies: A pre-registered report," Pacific-Basin Finance Journal, Elsevier, vol. 86(C).
    2. Francisco J. Conejo & Lawrence F. Cunningham & Clifford E. Young, 2020. "Revisiting the Brand Luxury Index: new empirical evidence and future directions," Journal of Brand Management, Palgrave Macmillan, vol. 27(1), pages 108-122, January.
    3. Ryan, James C. & A Tipu, Syed A., 2022. "Business and management research: Low instances of replication studies and a lack of author independence in replications," Research Policy, Elsevier, vol. 51(1).
    4. Jun-Hwa Cheah (Jacky) & Francesca Magno & Fabio Cassia, 2024. "Reviewing the SmartPLS 4 software: the latest features and enhancements," Journal of Marketing Analytics, Palgrave Macmillan, vol. 12(1), pages 97-107, March.
    5. Sadok El Ghoul & Omrane Guedhami & Robert Nash & Ajay Patel, 2019. "New Evidence on the Role of the Media in Corporate Social Responsibility," Journal of Business Ethics, Springer, vol. 154(4), pages 1051-1079, February.
    6. Magno, Francesca & Cassia, Fabio, 2024. "Predicting restaurants’ surplus food platform continuance: Insights from the combined use of PLS-SEM and NCA and predictive model comparisons," Journal of Retailing and Consumer Services, Elsevier, vol. 79(C).
    7. Juquelier, Antoine & Poncin, Ingrid & Hazée, Simon, 2025. "Empathic chatbots: A double-edged sword in customer experiences," Journal of Business Research, Elsevier, vol. 188(C).
    8. Brinkerink, Jasper & De Massis, Alfredo & Kellermanns, Franz, 2022. "One finding is no finding: Toward a replication culture in family business research," Journal of Family Business Strategy, Elsevier, vol. 13(4).
    9. Liu, Zhen-yuan Ralph & Dong, Shuqi Kyra & Zeng, Wenjuan & Wang, Yu-ting & Niu, Dong-fang, 2025. "Exploring the impact of human-centred AI on firms’ social and operational performance: A large language model approach," Transportation Research Part E: Logistics and Transportation Review, Elsevier, vol. 203(C).
    10. Meikel Soliman & Marko Sarstedt & Susanne J. Adler & Doreen Siegfried & Oliver Genschow & Monika Imschloss, 2025. "A Tale of Open Science: Emergence of a New Normal," Schmalenbach Journal of Business Research, Springer, vol. 77(4), pages 763-792, December.
    11. Rakesh Sambharya & Martina Musteen, 2014. "Institutional environment and entrepreneurship: An empirical study across countries," Journal of International Entrepreneurship, Springer, vol. 12(4), pages 314-330, December.
    12. Anna Kusetogullari & Huseyin Kusetogullari & Martin Andersson & Tony Gorschek, 2025. "GenAI in Entrepreneurship: a systematic review of generative artificial intelligence in entrepreneurship research: current issues and future directions," Papers 2505.05523, arXiv.org.
    13. Hensel, Przemysław G., 2019. "Supporting replication research in management journals: Qualitative analysis of editorials published between 1970 and 2015," European Management Journal, Elsevier, vol. 37(1), pages 45-57.
    14. Rahal, Rima-Maria, 2025. "Advancing openness in economic research through the lens of behavioral and experimental economics," Journal of Behavioral and Experimental Economics (formerly The Journal of Socio-Economics), Elsevier, vol. 119(C).
    15. Hussain, Walayat & Merigó, José M. & Rahimi, Iman & Lev, Benjamin, 2025. "Half a century of Omega – The International Journal of Management Science: A bibliometric analysis," Omega, Elsevier, vol. 133(C).
    16. Kapferer, Jean-Noël & Valette-Florence, Pierre, 2019. "How self-success drives luxury demand: An integrated model of luxury growth and country comparisons," Journal of Business Research, Elsevier, vol. 102(C), pages 273-287.
    17. Kenworthy, Thomas P. & Sparks, John R., 2016. "A scientific realism perspective on scientific progress in marketing: An analysis of theory testing in marketing's major journals," European Management Journal, Elsevier, vol. 34(5), pages 466-474.
    18. James, Victoria K., 2010. "A socio-cultural approach to exploring consumer boycott intelligence: A commentary essay," Journal of Business Research, Elsevier, vol. 63(4), pages 363-365, April.
    19. Franck Biétry & Anne-Laure Gatignon-Turnau & David Giauque & Silvester Ivanaj & Alain Lacroux & Nicolas Raineri & Elen Riot & David M. Wasieleski, 2024. "Editorial - RIPCO: Between Continuity and Disruption(s)," Post-Print hal-04659242, HAL.
    20. Angela Altmeier & Christian Fisch, 2025. "To syndicate or not: a replication and extension about the influence of business angels’ personality traits on syndication using Twitter data," Management Review Quarterly, Springer, vol. 75(2), pages 1357-1391, June.

    More about this item

    Keywords

    ;
    ;
    ;
    ;
    ;
    ;

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:eee:jbrese:v:203:y:2026:i:c:s0148296325006277. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Catherine Liu (email available below). General contact details of provider: http://www.elsevier.com/locate/jbusres .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.