IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2503.20848.html
   My bibliography  Save this paper

The Backfiring Effect of Weak AI Safety Regulation

Author

Listed:
  • Benjamin Laufer
  • Jon Kleinberg
  • Hoda Heidari

Abstract

Recent policy proposals aim to improve the safety of general-purpose AI, but there is little understanding of the efficacy of different regulatory approaches to AI safety. We present a strategic model that explores the interactions between safety regulation, the general-purpose AI creators, and domain specialists--those who adapt the technology for specific applications. Our analysis examines how different regulatory measures, targeting different parts of the AI development chain, affect the outcome of this game. In particular, we assume AI technology is characterized by two key attributes: safety and performance. The regulator first sets a minimum safety standard that applies to one or both players, with strict penalties for non-compliance. The general-purpose creator then invests in the technology, establishing its initial safety and performance levels. Next, domain specialists refine the AI for their specific use cases, updating the safety and performance levels and taking the product to market. The resulting revenue is then distributed between the specialist and generalist through a revenue-sharing parameter. Our analysis reveals two key insights: First, weak safety regulation imposed predominantly on domain specialists can backfire. While it might seem logical to regulate AI use cases, our analysis shows that weak regulations targeting domain specialists alone can unintentionally reduce safety. This effect persists across a wide range of settings. Second, in sharp contrast to the previous finding, we observe that stronger, well-placed regulation can in fact mutually benefit all players subjected to it. When regulators impose appropriate safety standards on both general-purpose AI creators and domain specialists, the regulation functions as a commitment device, leading to safety and performance gains, surpassing what is achieved under no regulation or regulating one player alone.

Suggested Citation

  • Benjamin Laufer & Jon Kleinberg & Hoda Heidari, 2025. "The Backfiring Effect of Weak AI Safety Regulation," Papers 2503.20848, arXiv.org, revised Jun 2025.
  • Handle: RePEc:arx:papers:2503.20848
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2503.20848
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kalai, Ehud & Smorodinsky, Meir, 1975. "Other Solutions to Nash's Bargaining Problem," Econometrica, Econometric Society, vol. 43(3), pages 513-518, May.
    2. Viscusi, W Kip & Moore, Michael J, 1993. "Product Liability, Research and Development, and Innovation," Journal of Political Economy, University of Chicago Press, vol. 101(1), pages 161-184, February.
    3. Tyna Eloundou & Sam Manning & Pamela Mishkin & Daniel Rock, 2023. "GPTs are GPTs: An Early Look at the Labor Market Impact Potential of Large Language Models," Papers 2303.10130, arXiv.org, revised Aug 2023.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Benjamin Laufer & Jon Kleinberg & Hoda Heidari, 2023. "Fine-Tuning Games: Bargaining and Adaptation for General-Purpose Models," Papers 2308.04399, arXiv.org, revised Dec 2024.
    2. Alberto Galasso & Hong Luo, 2018. "Punishing Robots: Issues in the Economics of Tort Liability and Innovation in Artificial Intelligence," NBER Chapters, in: The Economics of Artificial Intelligence: An Agenda, pages 493-504, National Bureau of Economic Research, Inc.
    3. Omer F. Baris, 2018. "Timing effect in bargaining and ex ante efficiency of the relative utilitarian solution," Theory and Decision, Springer, vol. 84(4), pages 547-556, June.
    4. Bergantiños, Gustavo & Moreno-Ternero, Juan D., 2022. "Monotonicity in sharing the revenues from broadcasting sports leagues," European Journal of Operational Research, Elsevier, vol. 297(1), pages 338-346.
    5. Etienne Pfister & Bruno Deffains & Myriam Doriat-Duban & Stéphane Saussier, 2006. "Institutions and contracts: Franchising," European Journal of Law and Economics, Springer, vol. 21(1), pages 53-78, January.
    6. Yoshihara, Naoki, 2003. "Characterizations of bargaining solutions in production economies with unequal skills," Journal of Economic Theory, Elsevier, vol. 108(2), pages 256-285, February.
    7. Roma Paolo & Perrone Giovanni, 2010. "Generic Advertising, Brand Advertising and Price Competition: An Analysis of Free-Riding Effects and Coordination Mechanisms," Review of Marketing Science, De Gruyter, vol. 8(1), pages 1-29, October.
    8. Jamie Fairbrother & Konstantinos G. Zografos & Kevin D. Glazebrook, 2020. "A Slot-Scheduling Mechanism at Congested Airports that Incorporates Efficiency, Fairness, and Airline Preferences," Transportation Science, INFORMS, vol. 54(1), pages 115-138, January.
    9. Christoph Riedl & Eric Bogert, 2024. "Effects of AI Feedback on Learning, the Skill Gap, and Intellectual Diversity," Papers 2409.18660, arXiv.org.
    10. Lea Melnikovová, 2017. "Can Game Theory Help to Mitigate Water Conflicts in the Syrdarya Basin?," Acta Universitatis Agriculturae et Silviculturae Mendelianae Brunensis, Mendel University Press, vol. 65(4), pages 1393-1401.
    11. Carvajal, Daniel & Franco, Catalina & Isaksson, Siri, 2024. "Will Artificial Intelligence Get in the Way of Achieving Gender Equality?," Discussion Paper Series in Economics 3/2024, Norwegian School of Economics, Department of Economics, revised 28 Apr 2025.
    12. Naomi Hausman & Oren Rigbi & Sarit Weisburd, 2025. "Generative AI’s Impact on Student Achievement and Implications for Worker Productivity," CESifo Working Paper Series 11843, CESifo.
    13. Daniele Cassese & Paolo Pin, 2025. "Decentralized pure exchange processes on networks," Social Choice and Welfare, Springer;The Society for Social Choice and Welfare, vol. 64(3), pages 427-463, May.
    14. Gérard Mondello & Evens Salies, 2016. "Tort law under oligopolistic competition," SciencePo Working papers Main hal-03459225, HAL.
    15. Han, Han & Julien, Benoît & Petursdottir, Asgerdur & Wang, Liang, 2019. "Asset liquidity and indivisibility," European Economic Review, Elsevier, vol. 119(C), pages 236-250.
    16. Evangelos Katsamakas & Oleg V. Pavlov & Ryan Saklad, 2024. "Artificial intelligence and the transformation of higher education institutions," Papers 2402.08143, arXiv.org.
    17. Carrillo-Tudela, Carlos & Graber, Michael & Waelde, Klaus, 2018. "Unemployment and vacancy dynamics with imperfect financial markets," Labour Economics, Elsevier, vol. 50(C), pages 128-143.
    18. Ursula F Ott & Pervez N Ghauri, 2019. "Brexit negotiations: From negotiation space to agreement zones," Journal of International Business Studies, Palgrave Macmillan;Academy of International Business, vol. 50(1), pages 137-149, February.
    19. José-Manuel Giménez-Gómez & António Osório & Josep E. Peris, 2015. "From Bargaining Solutions to Claims Rules: A Proportional Approach," Games, MDPI, vol. 6(1), pages 1-7, March.
    20. Yang Shen, 2024. "Future jobs: analyzing the impact of artificial intelligence on employment and its mechanisms," Economic Change and Restructuring, Springer, vol. 57(2), pages 1-33, April.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2503.20848. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.