IDEAS home Printed from https://ideas.repec.org/a/bpj/jqsprt/v16y2020i2p121-132n1.html
   My bibliography  Save this article

Route identification in the National Football League: An application of model-based curve clustering using the EM algorithm

Author

Listed:
  • Chu Dani

    (Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada)

  • Reyers Matthew

    (Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada)

  • Thomson James

    (Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada)

  • Wu Lucas Yifan

    (Department of Statistics and Actuarial Science, Simon Fraser University, Burnaby, BC, V5A 1S6, Canada)

Abstract

Tracking data in the National Football League (NFL) is a sequence of spatial-temporal measurements that varies in length depending on the duration of the play. In this paper, we demonstrate how model-based curve clustering of observed player trajectories can be used to identify the routes run by eligible receivers on offensive passing plays. We use a Bernstein polynomial basis function to represent cluster centers, and the Expectation Maximization algorithm to learn the route labels for each of the 33,967 routes run on the 6963 passing plays in the data set. With few assumptions and no pre-existing labels, we are able to closely recreate the standard route tree from our algorithm. We go on to suggest ideas for new potential receiver metrics that account for receiver deployment and movement common throughout the league. The resulting route labels can also be paired with film to enable streamlined queries of game film.

Suggested Citation

  • Chu Dani & Reyers Matthew & Thomson James & Wu Lucas Yifan, 2020. "Route identification in the National Football League: An application of model-based curve clustering using the EM algorithm," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 16(2), pages 121-132, June.
  • Handle: RePEc:bpj:jqsprt:v:16:y:2020:i:2:p:121-132:n:1
    DOI: 10.1515/jqas-2019-0047
    as

    Download full text from publisher

    File URL: https://doi.org/10.1515/jqas-2019-0047
    Download Restriction: For access to full text, subscription to the journal or payment for the individual article is required.

    File URL: https://libkey.io/10.1515/jqas-2019-0047?utm_source=ideas
    LibKey link: if access is restricted and if your library uses this service, LibKey will redirect you to where you can use your library subscription to access this item
    ---><---

    As the access to this document is restricted, you may want to search for a different version of it.

    References listed on IDEAS

    as
    1. Baumer Benjamin S. & Jensen Shane T. & Matthews Gregory J., 2015. "openWAR: An open source system for evaluating overall player performance in major league baseball," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 11(2), pages 69-84, June.
    Full references (including those not matched with items on IDEAS)

    Most related items

    These are the items that most often cite the same works as this one and are cited by the same works as this one.
    1. Franks Alexander M. & D’Amour Alexander & Cervone Daniel & Bornn Luke, 2016. "Meta-analytics: tools for understanding the statistical properties of sports metrics," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 12(4), pages 151-165, December.
    2. Mallepalle Sarah & Yurko Ronald & Pelechrinis Konstantinos & Ventura Samuel L., 2020. "Extracting NFL tracking data from images to evaluate quarterbacks and pass defenses," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 16(2), pages 95-120, June.
    3. Yurko Ronald & Ventura Samuel & Horowitz Maksim, 2019. "nflWAR: a reproducible method for offensive player evaluation in football," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 15(3), pages 163-183, September.
    4. Jyh-How Huang & Yu-Chia Hsu, 2021. "A Multidisciplinary Perspective on Publicly Available Sports Data in the Era of Big Data: A Scoping Review of the Literature on Major League Baseball," SAGE Open, , vol. 11(4), pages 21582440211, November.
    5. Yurko Ronald & Ventura Samuel & Horowitz Maksim, 2019. "nflWAR: a reproducible method for offensive player evaluation in football," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 15(3), pages 163-183, September.
    6. Shane Sanders & Joel Potter & Justin Ehrlich & Justin Perline & Christopher Boudreaux, 2021. "Informed voters and electoral outcomes: a natural experiment stemming from a fundamental information-technological shift," Public Choice, Springer, vol. 189(1), pages 257-277, October.
    7. Vock David Michael & Vock Laura Frances Boehm, 2018. "Estimating the effect of plate discipline using a causal inference framework: an application of the G-computation algorithm," Journal of Quantitative Analysis in Sports, De Gruyter, vol. 14(2), pages 37-56, June.

    Corrections

    All material on this site has been provided by the respective publishers and authors. You can help correct errors and omissions. When requesting a correction, please mention this item's handle: RePEc:bpj:jqsprt:v:16:y:2020:i:2:p:121-132:n:1. See general information about how to correct material in RePEc.

    If you have authored this item and are not yet registered with RePEc, we encourage you to do it here. This allows to link your profile to this item. It also allows you to accept potential citations to this item that we are uncertain about.

    If CitEc recognized a bibliographic reference but did not link an item in RePEc to it, you can help with this form .

    If you know of missing items citing this one, you can help us creating those links by adding the relevant references in the same way as above, for each refering item. If you are a registered author of this item, you may also want to check the "citations" tab in your RePEc Author Service profile, as there may be some citations waiting for confirmation.

    For technical questions regarding this item, or to correct its authors, title, abstract, bibliographic or download information, contact: Peter Golla (email available below). General contact details of provider: https://www.degruyter.com .

    Please note that corrections may take a couple of weeks to filter through the various RePEc services.

    IDEAS is a RePEc service. RePEc uses bibliographic data supplied by the respective publishers.