Author
Listed:
- Nikoleta E Glynatsi
- Vincent Knight
- Marc Harper
Abstract
Researchers have explored the performance of Iterated Prisoner’s Dilemma strategies for decades, from the celebrated performance of Tit for Tat to the introduction of the zero-determinant strategies and the use of sophisticated learning structures such as neural networks. Many new strategies have been introduced and tested in a variety of tournaments and population dynamics. Typical results in the literature, however, rely on performance against a small number of somewhat arbitrarily selected strategies, casting doubt on the generalizability of conclusions. In this work, we analyze a large collection of 195 strategies in thousands of computer tournaments, present the top performing strategies across multiple tournament types, and distill their salient features. The results show that there is not yet a single strategy that performs well in diverse Iterated Prisoner’s Dilemma scenarios, nevertheless there are several properties that heavily influence the best performing strategies. This refines the properties described by Axelrod in light of recent and more diverse opponent populations to: be nice, be provocable and generous, be a little envious, be clever, and adapt to the environment. More precisely, we find that strategies perform best when their probability of cooperation matches the total tournament population’s aggregate cooperation probabilities. The features of high performing strategies help cast some light on why strategies such as Tit For Tat performed historically well in tournaments and why zero-determinant strategies typically do not fare well in tournament settings.Author summary: In 1980, political scientist Robert Axelrod ran one of the most famous computer tournaments of the Iterated Prisoner’s Dilemma (IPD). The winner? The now-famous strategy, Tit for Tat. Axelrod attributed its success to simple properties such as: do not be envious, avoid being the first to defect, and do not be overly clever. Yet the tournament design, using only a small, selected set of strategies, not including random noise, and having fixed game lengths, raises questions about the generalizability of these results. Many researchers have continued to make similar assumptions in their own IPD experiments, limiting the insights that can be applied to more complex, realistic settings.
Suggested Citation
Nikoleta E Glynatsi & Vincent Knight & Marc Harper, 2024.
"Properties of winning Iterated Prisoner’s Dilemma strategies,"
PLOS Computational Biology, Public Library of Science, vol. 20(12), pages 1-19, December.
Handle:
RePEc:plo:pcbi00:1012644
DOI: 10.1371/journal.pcbi.1012644
Download full text from publisher
Corrections
All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pcbi00:1012644. See general information about how to correct material in RePEc.
If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.
We have no bibliographic references for this item. You can help adding them by using this form .
If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.
For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: ploscompbiol (email available below). General contact details of provider: https://journals.plos.org/ploscompbiol/ .
Please note that corrections may take a couple of weeks to filter through
the various RePEc services.