Author
Listed:
- Nils Köbis
(Universität Duisburg-Essen = University of Duisburg-Essen [Essen], Max Planck Institute for Human Development - Max-Planck-Gesellschaft)
- Zoe Rahwan
(Max Planck Institute for Human Development - Max-Planck-Gesellschaft)
- Raluca Rilla
(Max Planck Institute for Human Development - Max-Planck-Gesellschaft)
- Bramantyo Ibrahim Supriyatno
(Max Planck Institute for Human Development - Max-Planck-Gesellschaft)
- Clara Bersch
(Max Planck Institute for Human Development - Max-Planck-Gesellschaft)
- Tamer Ajaj
(Max Planck Institute for Human Development - Max-Planck-Gesellschaft)
- Jean-François Bonnefon
(TSE-R - Toulouse School of Economics - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - EHESS - École des hautes études en sciences sociales - CNRS - Centre National de la Recherche Scientifique - INRAE - Institut National de Recherche pour l’Agriculture, l’Alimentation et l’Environnement, TSM - Toulouse School of Management Research - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse - CNRS - Centre National de la Recherche Scientifique - TSM - Toulouse School of Management - UT Capitole - Université Toulouse Capitole - Comue de Toulouse - Communauté d'universités et établissements de Toulouse, CNRS - Centre National de la Recherche Scientifique)
- Iyad Rahwan
(Max Planck Institute for Human Development - Max-Planck-Gesellschaft)
Abstract
Although artificial intelligence enables productivity gains from delegating tasks to machines1, it may facilitate the delegation of unethical behaviour2. This risk is highly relevant amid the rapid rise of ‘agentic' artificial intelligence systems3,4. Here we demonstrate this risk by having human principals instruct machine agents to perform tasks with incentives to cheat. Requests for cheating increased when principals could induce machine dishonesty without telling the machine precisely what to do, through supervised learning or high-level goal setting. These effects held whether delegation was voluntary or mandatory. We also examined delegation via natural language to large language models5. Although the cheating requests by principals were not always higher for machine agents than for human agents, compliance diverged sharply: machines were far more likely than human agents to carry out fully unethical instructions. This compliance could be curbed, but usually not eliminated, with the injection of prohibitive, task-specific guardrails. Our results highlight ethical risks in the context of increasingly accessible and powerful machine delegation, and suggest design and policy strategies to mitigate them.
Suggested Citation
Nils Köbis & Zoe Rahwan & Raluca Rilla & Bramantyo Ibrahim Supriyatno & Clara Bersch & Tamer Ajaj & Jean-François Bonnefon & Iyad Rahwan, 2025.
"Delegation to Artificial Intelligence can increase dishonest behaviour,"
Post-Print
hal-05277822, HAL.
Handle:
RePEc:hal:journl:hal-05277822
DOI: 10.1038/s41586-025-09505-x
Download full text from publisher
To our knowledge, this item is not available for
download. To find whether it is available, there are three
options:
1. Check below whether another version of this item is available online.
2. Check on the provider's
web page
whether it is in fact available.
3. Perform a
for a similarly titled item that would be
available.
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:hal:journl:hal-05277822. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: CCSD (email available below). General contact details of provider: https://hal.archives-ouvertes.fr/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.