Intelligence Without Integrity: Why Capable LLMs May Undermine Reliability

Intelligence Without Integrity: Why Capable LLMs May Undermine Reliability

Author

Listed:

Ryan Allen
Aticus Peterson

Abstract

As LLMs become embedded in research workflows and organizational decision processes, their effect on analytical reliability remains uncertain. We distinguish two dimensions of analytical reliability -- intelligence (the capacity to reach correct conclusions) and integrity (the stability of conclusions when analytically irrelevant cues about desired outcomes are introduced) -- and ask whether frontier LLMs possess both. Whether these dimensions trade off is theoretically ambiguous: the sophistication enabling accurate analysis may also enable responsiveness to non-evidential cues, or alternatively, greater capability may confer protection through better calibration and discernment. Using synthetically generated data with embedded ground truth, we evaluate fourteen models on a task simulating empirical analysis of hospital merger effects. We find that intelligence and integrity trade off: frontier models most likely to reach correct conclusions under neutral conditions are often most susceptible to shifting conclusions under motivated framing. We extend work on sycophancy by introducing goal-conditioned analytical sycophancy: sensitivity of inference to cues about desired outcomes, even when no belief is asserted and evidence is held constant. Unlike simple prompt sensitivity, models shift conclusions away from objective evidence in response to analytically irrelevant framing. This finding has important implications for empirical research and organizations. Selecting tools based on capability benchmarks may inadvertently select against the stability needed for reliable and replicable analysis.

Suggested Citation

Ryan Allen & Aticus Peterson, 2026. "Intelligence Without Integrity: Why Capable LLMs May Undermine Reliability," Papers 2602.20440, arXiv.org, revised Feb 2026.

Handle: RePEc:arx:papers:2602.20440

Download full text from publisher

References listed on IDEAS

Herbert A. Simon, 1955. "A Behavioral Model of Rational Choice," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 69(1), pages 99-118.
Ashish Arora & Michelle Gittelman & Sarah Kaplan & John Lynch & Will Mitchell & Nicolaj Siggelkow & Brent Goldfarb & Andrew A. King, 2016. "Scientific apophenia in strategic management research: Significance tests & mistaken inference," Strategic Management Journal, Wiley Blackwell, vol. 37(1), pages 167-176, January.
Jonathan Roth, 2022. "Pretest with Caution: Event-Study Estimates after Testing for Parallel Trends," American Economic Review: Insights, American Economic Association, vol. 4(3), pages 305-322, September.
Ashesh Rambachan & Jonathan Roth, 2023. "A More Credible Approach to Parallel Trends," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 90(5), pages 2555-2591.
King, Gary & Nielsen, Richard, 2019. "Why Propensity Scores Should Not Be Used for Matching," Political Analysis, Cambridge University Press, vol. 27(4), pages 435-454, October.
Felipe A. Csaszar & Aticus Peterson & Daniel Wilde, 2026. "The Strategic Foresight of LLMs: Evidence from a Fully Prospective Venture Tournament," Papers 2602.01684, arXiv.org.
Klaus E Meyer & Arjen Witteloostuijn & Sjoerd Beugelsdijk, 2017. "What’s in a p? Reassessing best practices for conducting and reporting hypothesis-testing research," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 48(5), pages 535-551, July.
Sandeep Devanatha Pillai & Brent Goldfarb & David Kirsch, 2024. "Lovely and likely: Using historical methods to improve inference to the best explanation in strategy," Strategic Management Journal, Wiley Blackwell, vol. 45(8), pages 1539-1566, August.
Miguel, E & Camerer, C & Casey, K & Cohen, J & Esterling, KM & Gerber, A & Glennerster, R & Green, DP & Humphreys, M & Imbens, G & Laitin, D & Madon, T & Nelson, L & Nosek, BA & Petersen, M & Sedlmayr, 2014. "Promoting Transparency in Social Science Research," Department of Economics, Working Paper Series qt0wt4q2q8, Department of Economics, Institute for Business and Economic Research, UC Berkeley.
Richard A. Bettis, 2012. "The search for asterisks: Compromised statistical tests and flawed theories," Strategic Management Journal, Wiley Blackwell, vol. 33(1), pages 108-113, January.
Prithwiraj Choudhury & Evan Starr & Rajshree Agarwal, 2020. "Machine learning and human capital complementarities: Experimental evidence on bias mitigation," Strategic Management Journal, Wiley Blackwell, vol. 41(8), pages 1381-1411, August.
Sarah Lebovitz & Hila Lifshitz-Assaf & Natalia Levina, 2022. "To Engage or Not to Engage with AI for Critical Judgments: How Professionals Deal with Opacity When Using AI for Medical Diagnosis," Organization Science, INFORMS, vol. 33(1), pages 126-148, January.

Full references (including those not matched with items on IDEAS)

Most related items

These are the items that most often cite the same works as this one and are cited by the same works as this one.

Arne Henningsen & Guy Low & David Wuepper & Tobias Dalhaus & Hugo Storm & Dagim Belay & Stefan Hirsch, 2024. "Estimating Causal Effects with Observational Data: Guidelines for Agricultural and Applied Economists," IFRO Working Paper 2024/03, University of Copenhagen, Department of Food and Resource Economics.
- Arne Henningsen & Guy Low & David Wuepper & Tobias Dalhaus & Hugo Storm & Dagim Belay & Stefan Hirsch, 2025. "Estimating Causal Effects with Observational Data: Guidelines for Agricultural and Applied Economists," Papers 2508.02310, arXiv.org.
Rottner, Elisa, 2023. "Do climate policies lead to outsourcing? Evidence from firm-level imports," ZEW Discussion Papers 23-070, ZEW - Leibniz Centre for European Economic Research.
Anderson, Brian S. & Wennberg, Karl & McMullen, Jeffery S., 2019. "Editorial: Enhancing quantitative theory-testing entrepreneurship research," Journal of Business Venturing, Elsevier, vol. 34(5), pages 1-1.
Wennberg, Karl & Anderson, Brian S. & McMullen, Jeffrey, 2019. "2 Editorial: Enhancing Quantitative Theory-Testing Entrepreneurship Research," Ratio Working Papers 323, The Ratio Institute.
Boehme, Hunter M. & Tregle, Brandon & Olson, Marc & Fulmer, Cannon, 2025. "Leveraging code enforcement units to reduce crime: A difference-in-difference analysis of a targeted crime prevention intervention," Journal of Criminal Justice, Elsevier, vol. 99(C).
Anderson, Brian S, 2018. "Endogeneity and Entrepreneurship Research," OSF Preprints 75tn8, Center for Open Science.
repec:osf:osfxxx:75tn8_v1 is not listed on IDEAS
Tanner Jones & Ryan Quandt, 2025. "An iridescent sunset: An empirical analysis of sunset legislation," Journal of Regulatory Economics, Springer, vol. 68(2), pages 85-123, October.
Boungou, Whelsy & Dufau, Bastien, 2025. "Shareholder wealth effects of corporate sustainability reporting regulations," Finance Research Letters, Elsevier, vol. 80(C).
Omry Yoresh & Weijian Zou, 2026. "The lifecycle of judicial bias," IFS Working Papers W26/12, Institute for Fiscal Studies.
Mark D. Packard & Per L. Bylund, 2025. "Truth, knowledge, and entrepreneurship theory: arguments for a rationalist scientific epistemology," Small Business Economics, Springer, vol. 65(1), pages 381-405, June.
Prem, Mounu & Purroy, Miguel E. & Vargas, Juan F., 2025. "Landmines: The local effects of demining," Journal of Public Economics, Elsevier, vol. 247(C).
- Mounu Prem & Miguel Purroy & Juan F. Vargas, 2021. "Landmines: The local effects of demining," Documentos de Trabajo 19733, The Latin American and Caribbean Economic Association (LACEA).
- Prem, Mounu & Purroy, Miguel E. & Vargas, Juan F., 2021. "Landmines: The Local Effects of Demining," SocArXiv 3jzk6, Center for Open Science.
- Prem, Mounu & Purroy, Miguel E. & Vargas, Juan F., 2022. "Landmines: the Local Effects of Demining," IAST Working Papers 22-132, Institute for Advanced Study in Toulouse (IAST).
- M Prem & M. E. Purroy & J. F. Vargas, 2021. "Landmines: the local effects of demining," Documentos de Trabajo 19588, Universidad del Rosario.
- Prem, Mounu & Purroy, Miguel E. & Vargas, Juan F., 2022. "Landmines: the Local Effects of Demining," TSE Working Papers 22-1305, Toulouse School of Economics (TSE).
- Mounu Prem & Juan Vargas & Miguel E. Purroy, 2021. "Landmines: The Local Effects of Demining," Empirical Studies of Conflict Project (ESOC) Working Papers 28, Empirical Studies of Conflict Project.
- Prem, Mounu & Purroy, Miguel & Vargas, Juan F., 2024. "Landmines: The local effects of demining," CEPR Discussion Papers 18975, C.E.P.R. Discussion Papers.
- Mounu Prem & Miguel E. Purroy & Juan F. Vargas, 2021. "Landmines: The Local Effects of Demining," HiCN Working Papers 360, Households in Conflict Network.
- Prem, Mounu & Purroy, Miguel E & Vargas, Juan F., 2021. "Landmines: The Local Effects of Demining," Working papers 86, Red Investigadores de Economía.
Sandeep Devanatha Pillai & Brent Goldfarb & David Kirsch, 2024. "Lovely and likely: Using historical methods to improve inference to the best explanation in strategy," Strategic Management Journal, Wiley Blackwell, vol. 45(8), pages 1539-1566, August.
Lin, Pengsheng & Pan, Yinghao & Wang, Yuan & Hu, Longhai, 2024. "Reshaping unfairness perceptions: Evidence from China's Hukou reform," China Economic Review, Elsevier, vol. 83(C).
Ebersberger, Bernd & Galia, Fabrice & Laursen, Keld & Salter, Ammon, 2021. "Inbound Open Innovation and Innovation Performance: A Robustness Study," Research Policy, Elsevier, vol. 50(7).
Christian Alemán-Pericón & Alexander Ludwig & Christopher Busch & Raül Santaeulàlia-Llopis, 2022. "A Stage-Based Identification of Policy Effects," Working Papers 1369, Barcelona School of Economics.
- Alemán, Christian & Busch, Christopher & Ludwig, Alexander & Santaeulàlia-Llopis, Raül, 2023. "Stage-based identification of policy effects," ICIR Working Paper Series 52/23, Goethe University Frankfurt, International Center for Insurance Regulation (ICIR).
- Christian Alemán & Christopher Busch & Alexander Ludwig & Raül Santaeulàlia-Llopis, 2023. "Stage-Based Identification of Policy Effects," CESifo Working Paper Series 10722, CESifo.
- Christian AlemÃ¡n & Christopher Busch & Alexander Ludwig & RaÃ¼l SantaeulÃ lia-Llopis, 2023. "Stage-Based Identification of Policy Effects," Working Papers 2023-026, Human Capital and Economic Opportunity Working Group.
- Christian Aleman & Christopher Busch & Alexander Ludwig & Raul Santaeulalia-Llopis, 2022. "A Stage-Based Identification of Policy Effects," PIER Working Paper Archive 22-026, Penn Institute for Economic Research, Department of Economics, University of Pennsylvania.
- Alemán, Christian & Busch, Christopher & Ludwig, Alexander & Santaeulàlia-Llopis, Raül, 2023. "Stage-based identification of policy effects," SAFE Working Paper Series 404, Leibniz Institute for Financial Research SAFE.
- Aleman, Christian & Busch, Christopher & Ludwig, Alexander & SantaeulÃ lia-Llopis, RaÃ¼l, 2023. "Stage Based Identification of Policy Effects," CEPR Discussion Papers 18681, C.E.P.R. Discussion Papers.
Jiang, Hui & Peng, Cheng & Ren, Daling, 2024. "Supply-chain finance digitalization and corporate financial fraud: Evidence from China," Economic Modelling, Elsevier, vol. 139(C).
Kim, Dongin & Steinbach, Sandro & Zurita, Carlos, 2024. "Deep trade agreements and agri-food global value chain integration," Food Policy, Elsevier, vol. 127(C).
Yesol Huh & Matthew Kling, 2025. "Parallel Trends Forest: Data-Driven Control Sample Selection in Difference-in-Differences," Finance and Economics Discussion Series 2025-091, Board of Governors of the Federal Reserve System (U.S.).
Emmott, Emily H. & Ihara, Yasuo & Tokumasu, Yudai & Nozaki, Mari & Saito, Atsuko & Kawamoto, Tetsuya & Ito, Shingo & Hassan, Anushé & Brown, Laura J. & Dennett, Thea & Crane, Andrew & Borra, Catherine, 2025. "Adolescence as a key period of identity development and connectedness: a comparative autophotography study in England and Japan," LSE Research Online Documents on Economics 129137, London School of Economics and Political Science, LSE Library.
Federico N Daverio-Occhini & María Montoya-Aguirre & Máximo Ponce de León & L Guillermo Woo-Mora, 2024. "Moral Force: Leaders' Actions and Public Health Compliance in Crisis," PSE Working Papers halshs-04721932, HAL.

More about this item

Statistics

Access and download statistics

Corrections

All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2602.20440. See general information about how to correct material in RePEc.

If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

Please note that corrections may take a couple of weeks to filter through the various RePEc services.

IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.

Browse Econ Literature

More features

Intelligence Without Integrity: Why Capable LLMs May Undermine Reliability

Author

Abstract

Suggested Citation

Download full text from publisher

References listed on IDEAS

Most related items

More about this item

Statistics

Corrections

More services and features

MyIDEAS

Author registration

Rankings

RePEc Genealogy

RePEc Biblio

MPRA

New papers by email

EconAcademics

Plagiarism

About RePEc

RePEc home

Blog

Help/FAQ

RePEc team

Participating archives

Privacy statement

Help us

Corrections

Volunteers

Get papers listed

Open a RePEc archive

Get RePEc data