Loading…

Route identification in the National Football League: An application of model-based curve clustering using the EM algorithm

Tracking data in the National Football League (NFL) is a sequence of spatial-temporal measurements that varies in length depending on the duration of the play. In this paper, we demonstrate how model-based curve clustering of observed player trajectories can be used to identify the routes run by eli...

Full description

Saved in:
Bibliographic Details
Published in:Journal of quantitative analysis in sports 2020-06, Vol.16 (2), p.121-132
Main Authors: Chu, Dani, Reyers, Matthew, Thomson, James, Wu, Lucas Yifan
Format: Article
Language:English
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Tracking data in the National Football League (NFL) is a sequence of spatial-temporal measurements that varies in length depending on the duration of the play. In this paper, we demonstrate how model-based curve clustering of observed player trajectories can be used to identify the routes run by eligible receivers on offensive passing plays. We use a Bernstein polynomial basis function to represent cluster centers, and the Expectation Maximization algorithm to learn the route labels for each of the 33,967 routes run on the 6963 passing plays in the data set. With few assumptions and no pre-existing labels, we are able to closely recreate the standard route tree from our algorithm. We go on to suggest ideas for new potential receiver metrics that account for receiver deployment and movement common throughout the league. The resulting route labels can also be paired with film to enable streamlined queries of game film.
ISSN:2194-6388
1559-0410
DOI:10.1515/jqas-2019-0047