IDEAS home Printed from https://ideas.repec.org/p/nbr/nberwo/34082.html
   My bibliography  Save this paper

Can Author Manipulation of AI Referees be Welfare Improving?

Author

Listed:
  • Joshua S. Gans

Abstract

This paper examines a new moral hazard in delegated decision-making: authors can embed hidden instructions—known as prompt injections—to bias AI referees in academic peer review, thereby hijacking machine recommendations. Because AI reviews are relatively inexpensive compared to manual assessments, referees would otherwise delegate fully, which undermines quality. The paper shows that moderate detection of manipulation can paradoxically improve welfare. With intermediate detection probabilities, only low-quality authors undertake manipulation, and detection becomes informative about quality, inducing referees to mix between manual and AI reviews. This partially separating equilibrium preserves the value of peer review when AI quality is intermediate. When detection is too low, all bad papers are manipulated and the market unravels; when detection is perfect, referees use only AI and acceptance collapses. Thus, some prompt injection must be tolerated to sustain the market: it disciplines referees and generates information. The results caution against zero-tolerance enforcement and highlight how prompt injection can, counterintuitively, play a welfare-enhancing role when AI reviews are easily produced.

Suggested Citation

  • Joshua S. Gans, 2025. "Can Author Manipulation of AI Referees be Welfare Improving?," NBER Working Papers 34082, National Bureau of Economic Research, Inc.
  • Handle: RePEc:nbr:nberwo:34082
    Note: PR
    as

    Download full text from publisher

    File URL: http://www.nber.org/papers/w34082.pdf
    Download Restriction: Access to the full text is generally limited to series subscribers, however if the top level domain of the client browser is in a developing country or transition economy free access is provided. More information about subscriptions and free access is available at http://www.nber.org/wwphelp.html. Free access is also available to older working papers.
    ---><---

    As the access to this document is restricted, you may want to

    for a different version of it.

    More about this item

    JEL classification:

    • D82 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Asymmetric and Private Information; Mechanism Design
    • D86 - Microeconomics - - Information, Knowledge, and Uncertainty - - - Economics of Contract Law
    • O33 - Economic Development, Innovation, Technological Change, and Growth - - Innovation; Research and Development; Technological Change; Intellectual Property Rights - - - Technological Change: Choices and Consequences; Diffusion Processes

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:nbr:nberwo:34082. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    We have no bibliographic references for this item. You can help adding them by using this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: the person in charge (email available below). General contact details of provider: https://edirc.repec.org/data/nberrus.html .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.