IDEAS home Printed from https://ideas.repec.org/p/arx/papers/2503.20986.html
   My bibliography  Save this paper

MAD Chairs: A new tool to evaluate AI

Author

Listed:
  • Chris Santos-Lang
  • Christopher M. Homan

Abstract

This paper contributes a new way to evaluate AI. Much as one might evaluate a machine in terms of its performance at chess, this approach involves evaluating a machine in terms of its performance at a game called "MAD Chairs". At the time of writing, evaluation with this game exposed opportunities to improve Claude, Gemini, ChatGPT, Qwen and DeepSeek. Furthermore, this paper sets a stage for future innovation in game theory and AI safety by providing an example of success with non-standard approaches to each: studying a game beyond the scope of previous game theoretic tools and mitigating a serious AI safety risk in a way that requires neither determination of values nor their enforcement.

Suggested Citation

  • Chris Santos-Lang & Christopher M. Homan, 2025. "MAD Chairs: A new tool to evaluate AI," Papers 2503.20986, arXiv.org, revised May 2025.
  • Handle: RePEc:arx:papers:2503.20986
    as

    Download full text from publisher

    File URL: http://arxiv.org/pdf/2503.20986
    File Function: Latest version
    Download Restriction: no
    ---><---

    References listed on IDEAS

    as
    1. Kalliopi Kastampolidou & Christos Papalitsas & Theodore Andronikos, 2022. "The Distributed Kolkata Paise Restaurant Game," Games, MDPI, vol. 13(3), pages 1-21, April.
    2. Abreu, Dilip & Dutta, Prajit K & Smith, Lones, 1994. "The Folk Theorem for Repeated Games: A NEU Condition," Econometrica, Econometric Society, vol. 62(4), pages 939-948, July.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Drew Fudenberg & David K. Levine & Satoru Takahashi, 2008. "Perfect public equilibrium when players are patient," World Scientific Book Chapters, in: Drew Fudenberg & David K Levine (ed.), A Long-Run Collaboration On Long-Run Games, chapter 16, pages 345-367, World Scientific Publishing Co. Pte. Ltd..
    2. Mehmet Barlo & Guilherme Carmona, 2007. "One - memory in repeated games," Nova SBE Working Paper Series wp500, Universidade Nova de Lisboa, Nova School of Business and Economics.
    3. Pedro Bó, 2007. "Social norms, cooperation and inequality," Economic Theory, Springer;Society for the Advancement of Economic Theory (SAET), vol. 30(1), pages 89-105, January.
    4. Quan Wen, 2002. "Repeated Games with Asynchronous Moves," Vanderbilt University Department of Economics Working Papers 0204, Vanderbilt University Department of Economics.
    5. Asen Kochov & Yangwei Song, 2023. "Intertemporal Hedging and Trade in Repeated Games With Recursive Utility," Econometrica, Econometric Society, vol. 91(6), pages 2333-2369, November.
    6. Liu, Ce, 2023. "Stability in repeated matching markets," Theoretical Economics, Econometric Society, vol. 18(4), November.
    7. Iskakov, A. & Iskakov, M., 2017. "In Search of a Generalized Concept of Rationality," Journal of the New Economic Association, New Economic Association, vol. 34(2), pages 181-189.
    8. Sambuddha Ghosh & Seungjin Han, 2012. "Repeated Contracting in Decentralised Markets," Department of Economics Working Papers 2012-03, McMaster University, revised May 2013.
    9. Stähler, Frank & Wagner, Friedrich, 1998. "Cooperation in a resource extraction game," Kiel Working Papers 846, Kiel Institute for the World Economy (IfW Kiel).
    10. Bernergård, Axel, 2011. "Folk Theorems for Present-Biased Players," SSE/EFI Working Paper Series in Economics and Finance 736, Stockholm School of Economics.
    11. Aramendia, Miguel, 2006. "Asymmetric finite punishments in repeated games," Economics Letters, Elsevier, vol. 92(2), pages 234-239, August.
    12. Luca Anderlini & Dino Gerardi & Roger Lagunoff, 2004. "The Folk Theorem in Dynastic Repeated Games," Levine's Bibliography 122247000000000577, UCLA Department of Economics.
    13. David Levine, 2000. "The Castle on the Hill," Review of Economic Dynamics, Elsevier for the Society for Economic Dynamics, vol. 3(2), pages 330-337, April.
    14. Johannes Hörner & Satoru Takahashi & Nicolas Vieille, 2015. "Truthful Equilibria in Dynamic Bayesian Games," Econometrica, Econometric Society, vol. 83(5), pages 1795-1848, September.
    15. Goldlücke, Susanne & Kranz, Sebastian, 2012. "Infinitely repeated games with public monitoring and monetary transfers," Journal of Economic Theory, Elsevier, vol. 147(3), pages 1191-1221.
    16. Lipman, Barton L. & Wang, Ruqu, 2009. "Switching costs in infinitely repeated games," Games and Economic Behavior, Elsevier, vol. 66(1), pages 292-314, May.
    17. Committee, Nobel Prize, 2005. "Robert Aumann's and Thomas Schelling's Contributions to Game Theory: Analyses of Conflict and Cooperation," Nobel Prize in Economics documents 2005-1, Nobel Prize Committee.
    18. Gonzalez-Diaz, Julio, 2006. "Finitely repeated games: A generalized Nash folk theorem," Games and Economic Behavior, Elsevier, vol. 55(1), pages 100-111, April.
    19. Laclau, M., 2013. "Repeated games with local monitoring and private communication," Economics Letters, Elsevier, vol. 120(2), pages 332-337.
    20. Ghislain-Herman Demeze-Jouatsa, 2020. "A complete folk theorem for finitely repeated games," International Journal of Game Theory, Springer;Game Theory Society, vol. 49(4), pages 1129-1142, December.

    More about this item

    NEP fields

    This paper has been announced in the following NEP Reports:

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:arx:papers:2503.20986. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: arXiv administrators (email available below). General contact details of provider: http://arxiv.org/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.