Loading…

The use of machine learning in sport outcome prediction: A review

The increase in the volume of structured and unstructured data related to more than just sport events leads to the development and increased use of techniques that extract information and employ machine‐learning algorithms in predicting process outcomes based on input but not necessarily output data...

Full description

Saved in:
Bibliographic Details
Published in:Wiley interdisciplinary reviews. Data mining and knowledge discovery 2020-09, Vol.10 (5), p.e1380-n/a
Main Authors: Horvat, Tomislav, Job, Josip
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:The increase in the volume of structured and unstructured data related to more than just sport events leads to the development and increased use of techniques that extract information and employ machine‐learning algorithms in predicting process outcomes based on input but not necessarily output data. Taking sports into consideration, predicting outcomes, and extracting valuable information has become appealing not only to sports workers but also to the wider audience, particularly in the areas of team management and sports betting. The aim of this article is to review the existing machine learning (ML) algorithms in predicting sport outcomes. Over 100 papers were analyzed and only some of these papers were taken into consideration. Almost all of the analyzed papers use some sort of feature selection and feature extraction, most often prior to using the machine‐learning algorithm. As an evaluation method of ML algorithms, researchers, in most cases, use data segmentation with data being chronologically distributed. In addition to data segmentation, researchers also use the k‐cross‐evaluation method. Sport predictions are usually treated as a classification problem with one class being predicted and rare cases being predicted as numerical values. Mostly used ML models are neural networks using data segmentation. This article is categorized under: Technologies > Machine Learning Technologies > Prediction Boxplot of achieved maximum accuracies related to analyzed sports.
ISSN:1942-4787
1942-4795
DOI:10.1002/widm.1380