IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2602.20440.html

Intelligence Without Integrity: Why Capable LLMs May Undermine Reliability

Author

Listed:
  • Ryan Allen
  • Aticus Peterson

Abstract

As LLMs become embedded in research workflows and organizational decision processes, their effect on analytical reliability remains uncertain. We distinguish two dimensions of analytical reliability -- intelligence (the capacity to reach correct conclusions) and integrity (the stability of conclusions when analytically irrelevant cues about desired outcomes are introduced) -- and ask whether frontier LLMs possess both. Whether these dimensions trade off is theoretically ambiguous: the sophistication enabling accurate analysis may also enable responsiveness to non-evidential cues, or alternatively, greater capability may confer protection through better calibration and discernment. Using synthetically generated data with embedded ground truth, we evaluate fourteen models on a task simulating empirical analysis of hospital merger effects. We find that intelligence and integrity trade off: frontier models most likely to reach correct conclusions under neutral conditions are often most susceptible to shifting conclusions under motivated framing. We extend work on sycophancy by introducing goal-conditioned analytical sycophancy: sensitivity of inference to cues about desired outcomes, even when no belief is asserted and evidence is held constant. Unlike simple prompt sensitivity, models shift conclusions away from objective evidence in response to analytically irrelevant framing. This finding has important implications for empirical research and organizations. Selecting tools based on capability benchmarks may inadvertently select against the stability needed for reliable and replicable analysis.

Suggested Citation

  • Ryan Allen & Aticus Peterson, 2026. "Intelligence Without Integrity: Why Capable LLMs May Undermine Reliability," Papers 2602.20440, arXiv.org, revised Feb 2026.
  • Handle: RePEc:arx:papers:2602.20440
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2602.20440
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Herbert A. Simon, 1955. "A Behavioral Model of Rational Choice," The Quarterly Journal of Economics, President and Fellows of Harvard College, vol. 69(1), pages 99-118.
    2. Ashish Arora & Michelle Gittelman & Sarah Kaplan & John Lynch & Will Mitchell & Nicolaj Siggelkow & Brent Goldfarb & Andrew A. King, 2016. "Scientific apophenia in strategic management research: Significance tests & mistaken inference," Strategic Management Journal, Wiley Blackwell, vol. 37(1), pages 167-176, January.
    3. Jonathan Roth, 2022. "Pretest with Caution: Event-Study Estimates after Testing for Parallel Trends," American Economic Review: Insights, American Economic Association, vol. 4(3), pages 305-322, September.
    4. Ashesh Rambachan & Jonathan Roth, 2023. "A More Credible Approach to Parallel Trends," The Review of Economic Studies, Review of Economic Studies Ltd, vol. 90(5), pages 2555-2591.
    5. King, Gary & Nielsen, Richard, 2019. "Why Propensity Scores Should Not Be Used for Matching," Political Analysis, Cambridge University Press, vol. 27(4), pages 435-454, October.
    6. Felipe A. Csaszar & Aticus Peterson & Daniel Wilde, 2026. "The Strategic Foresight of LLMs: Evidence from a Fully Prospective Venture Tournament," Papers 2602.01684, arXiv.org.
    7. Klaus E Meyer & Arjen Witteloostuijn & Sjoerd Beugelsdijk, 2017. "What’s in a p? Reassessing best practices for conducting and reporting hypothesis-testing research," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 48(5), pages 535-551, July.
    8. Sandeep Devanatha Pillai & Brent Goldfarb & David Kirsch, 2024. "Lovely and likely: Using historical methods to improve inference to the best explanation in strategy," Strategic Management Journal, Wiley Blackwell, vol. 45(8), pages 1539-1566, August.
    9. Miguel, E & Camerer, C & Casey, K & Cohen, J & Esterling, KM & Gerber, A & Glennerster, R & Green, DP & Humphreys, M & Imbens, G & Laitin, D & Madon, T & Nelson, L & Nosek, BA & Petersen, M & Sedlmayr, 2014. "Promoting Transparency in Social Science Research," Department of Economics, Working Paper Series qt0wt4q2q8, Department of Economics, Institute for Business and Economic Research, UC Berkeley.
    10. Richard A. Bettis, 2012. "The search for asterisks: Compromised statistical tests and flawed theories," Strategic Management Journal, Wiley Blackwell, vol. 33(1), pages 108-113, January.
    11. Prithwiraj Choudhury & Evan Starr & Rajshree Agarwal, 2020. "Machine learning and human capital complementarities: Experimental evidence on bias mitigation," Strategic Management Journal, Wiley Blackwell, vol. 41(8), pages 1381-1411, August.
    12. Sarah Lebovitz & Hila Lifshitz-Assaf & Natalia Levina, 2022. "To Engage or Not to Engage with AI for Critical Judgments: How Professionals Deal with Opacity When Using AI for Medical Diagnosis," Organization Science, INFORMS, vol. 33(1), pages 126-148, January.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Arne Henningsen & Guy Low & David Wuepper & Tobias Dalhaus & Hugo Storm & Dagim Belay & Stefan Hirsch, 2024. "Estimating Causal Effects with Observational Data: Guidelines for Agricultural and Applied Economists," IFRO Working Paper 2024/03, University of Copenhagen, Department of Food and Resource Economics.
    2. Rottner, Elisa, 2023. "Do climate policies lead to outsourcing? Evidence from firm-level imports," ZEW Discussion Papers 23-070, ZEW - Leibniz Centre for European Economic Research.
    3. Anderson, Brian S. & Wennberg, Karl & McMullen, Jeffery S., 2019. "Editorial: Enhancing quantitative theory-testing entrepreneurship research," Journal of Business Venturing, Elsevier, vol. 34(5), pages 1-1.
    4. Wennberg, Karl & Anderson, Brian S. & McMullen, Jeffrey, 2019. "2 Editorial: Enhancing Quantitative Theory-Testing Entrepreneurship Research," Ratio Working Papers 323, The Ratio Institute.
    5. Boehme, Hunter M. & Tregle, Brandon & Olson, Marc & Fulmer, Cannon, 2025. "Leveraging code enforcement units to reduce crime: A difference-in-difference analysis of a targeted crime prevention intervention," Journal of Criminal Justice, Elsevier, vol. 99(C).
    6. Anderson, Brian S, 2018. "Endogeneity and Entrepreneurship Research," OSF Preprints 75tn8, Center for Open Science.
    7. repec:osf:osfxxx:75tn8_v1 is not listed on IDEAS
    8. Tanner Jones & Ryan Quandt, 2025. "An iridescent sunset: An empirical analysis of sunset legislation," Journal of Regulatory Economics, Springer, vol. 68(2), pages 85-123, October.
    9. Boungou, Whelsy & Dufau, Bastien, 2025. "Shareholder wealth effects of corporate sustainability reporting regulations," Finance Research Letters, Elsevier, vol. 80(C).
    10. Omry Yoresh & Weijian Zou, 2026. "The lifecycle of judicial bias," IFS Working Papers W26/12, Institute for Fiscal Studies.
    11. Mark D. Packard & Per L. Bylund, 2025. "Truth, knowledge, and entrepreneurship theory: arguments for a rationalist scientific epistemology," Small Business Economics, Springer, vol. 65(1), pages 381-405, June.
    12. Prem, Mounu & Purroy, Miguel E. & Vargas, Juan F., 2025. "Landmines: The local effects of demining," Journal of Public Economics, Elsevier, vol. 247(C).
    13. Sandeep Devanatha Pillai & Brent Goldfarb & David Kirsch, 2024. "Lovely and likely: Using historical methods to improve inference to the best explanation in strategy," Strategic Management Journal, Wiley Blackwell, vol. 45(8), pages 1539-1566, August.
    14. Lin, Pengsheng & Pan, Yinghao & Wang, Yuan & Hu, Longhai, 2024. "Reshaping unfairness perceptions: Evidence from China's Hukou reform," China Economic Review, Elsevier, vol. 83(C).
    15. Ebersberger, Bernd & Galia, Fabrice & Laursen, Keld & Salter, Ammon, 2021. "Inbound Open Innovation and Innovation Performance: A Robustness Study," Research Policy, Elsevier, vol. 50(7).
    16. Christian Alemán-Pericón & Alexander Ludwig & Christopher Busch & Raül Santaeulàlia-Llopis, 2022. "A Stage-Based Identification of Policy Effects," Working Papers 1369, Barcelona School of Economics.
    17. Jiang, Hui & Peng, Cheng & Ren, Daling, 2024. "Supply-chain finance digitalization and corporate financial fraud: Evidence from China," Economic Modelling, Elsevier, vol. 139(C).
    18. Kim, Dongin & Steinbach, Sandro & Zurita, Carlos, 2024. "Deep trade agreements and agri-food global value chain integration," Food Policy, Elsevier, vol. 127(C).
    19. Yesol Huh & Matthew Kling, 2025. "Parallel Trends Forest: Data-Driven Control Sample Selection in Difference-in-Differences," Finance and Economics Discussion Series 2025-091, Board of Governors of the Federal Reserve System (U.S.).
    20. Emmott, Emily H. & Ihara, Yasuo & Tokumasu, Yudai & Nozaki, Mari & Saito, Atsuko & Kawamoto, Tetsuya & Ito, Shingo & Hassan, Anushé & Brown, Laura J. & Dennett, Thea & Crane, Andrew & Borra, Catherine, 2025. "Adolescence as a key period of identity development and connectedness: a comparative autophotography study in England and Japan," LSE Research Online Documents on Economics 129137, London School of Economics and Political Science, LSE Library.
    21. Federico N Daverio-Occhini & María Montoya-Aguirre & Máximo Ponce de León & L Guillermo Woo-Mora, 2024. "Moral Force: Leaders' Actions and Public Health Compliance in Crisis," PSE Working Papers halshs-04721932, HAL.

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2602.20440. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.