IDEAS home Printed from https://ideas.repec.org/a/plo/pone00/0338261.html

The impact of intransitivity on the Elo rating system

Author

Listed:
  • Adam H Hamilton
  • Anna Kalenkova
  • Matthew Roughan

Abstract

This paper studies the behaviour of the Elo rating system when the underlying modelling assumptions are not met. The Elo rating system is a popular statistical technique used to analyse pairwise comparison data. It is perhaps best known for rating chess players. The crucial assumption behind the Elo rating system is that the probability of which players wins a chess game depends primarily on a real-valued parameter, that quantifies the player’s “skill". This implicitly assumes that the binary relation “more likely to win against" is transitive. This paper studies how the Elo rating system behaves when this assumption is relaxed. First, we prove that once the assumption of transitivity is relaxed, the Elo ratings exhibits the undesirable property that estimated ratings are dependent on who plays who. Second, we prove that even when the assumption of transitivity is relaxed, for a given distribution with which players are selected, there is a unique point where the expected change of Elo ratings at that point is zero. This point represents the maximum likelihood estimator of the Elo ratings, given the observed data. Finally, we introduce a statistic that can be used to measure the intransitivity present in a game. We derive this measurement, and demonstrate on simulated data that it satisfies some useful sanity tests.

Suggested Citation

  • Adam H Hamilton & Anna Kalenkova & Matthew Roughan, 2025. "The impact of intransitivity on the Elo rating system," PLOS ONE, Public Library of Science, vol. 20(12), pages 1-25, December.
  • Handle: RePEc:plo:pone00:0338261
    DOI: 10.1371/journal.pone.0338261
    as

    Download full text from publisher

    File URL: https://journals.plos.org/plosone/article?id=10.1371/journal.pone.0338261
    Download Restriction: no

    File URL: https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0338261&type=printable
    Download Restriction: no

    File URL: https://libkey.io/10.1371/journal.pone.0338261?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    References listed on IDEAS

    as
    1. Firth, David, 2005. "Bradley-Terry Models in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 12(i01).
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Seonghyeon Kim & Jaesik Yang, 2024. "Toward better drivability: Investigating user preferences for tip-in acceleration profiles in electric vehicles," PLOS ONE, Public Library of Science, vol. 19(12), pages 1-26, December.
    2. McHale, Ian & Morton, Alex, 2011. "A Bradley-Terry type model for forecasting tennis match results," International Journal of Forecasting, Elsevier, vol. 27(2), pages 619-630, April.
    3. repec:jss:jstsof:32:i10 is not listed on IDEAS
    4. Anna Gottard & Giorgio Calzolari, 2014. "Alternative estimating procedures for multiple membership logit models with mixed effects: indirect inference and data cloning," Econometrics Working Papers Archive 2014_07, Universita' degli Studi di Firenze, Dipartimento di Statistica, Informatica, Applicazioni "G. Parenti".
    5. Jones, Ian, 2025. "Comparative judgement without the fancy statistics," OSF Preprints r487u_v1, Center for Open Science.
    6. Vicente Rodríguez Montequín & Joaquín Manuel Villanueva Balsera & Marina Díaz Piloñeta & César Álvarez Pérez, 2020. "A Bradley-Terry Model-Based Approach to Prioritize the Balance Scorecard Driving Factors: The Case Study of a Financial Software Factory," Mathematics, MDPI, vol. 8(2), pages 1-15, February.
    7. McHale, Ian & Morton, Alex, 2011. "A Bradley-Terry type model for forecasting tennis match results," International Journal of Forecasting, Elsevier, vol. 27(2), pages 619-630.
    8. Wickelmaier, Florian & Strobl, Carolin & Zeileis, Achim, 2012. "Psychoco: Psychometric Computing in R," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i01).
    9. David G. Hamilton & Martin J. Whiting & Sarah R. Pryke, 2013. "Fiery frills: carotenoid-based coloration predicts contest success in frillneck lizards," Behavioral Ecology, International Society for Behavioral Ecology, vol. 24(5), pages 1138-1149.
    10. Turner, Heather & Firth, David, 2012. "Bradley-Terry Models in R: The BradleyTerry2 Package," Journal of Statistical Software, Foundation for Open Access Statistics, vol. 48(i09).

    More about this item

    Statistics

    Access and download statistics

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:plo:pone00:0338261. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: plosone (email available below). General contact details of provider: https://journals.plos.org/plosone/ .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.