Loading…

An exhaustive comparison of distance measures in the classification of time series with 1NN method

Time series classification is an important and challenging problem in data analysis. With the increase in time series data availability, hundreds of algorithms have been proposed. A huge effort over the past two decades caused a significant improvement in both the efficiency and effectiveness of tim...

Full description

Saved in:
Bibliographic Details
Published in:Journal of computational science 2024-03, Vol.76, p.102235, Article 102235
Main Authors: Górecki, Tomasz, Łuczak, Maciej, Piasecki, Paweł
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Time series classification is an important and challenging problem in data analysis. With the increase in time series data availability, hundreds of algorithms have been proposed. A huge effort over the past two decades caused a significant improvement in both the efficiency and effectiveness of time series classification. There is a belief in the community that the best method is a surprisingly simple one. Even though there exist many algorithms outperforming the nearest neighbor (NN) classifier, the popularity of the latter remains stable — due to its simplicity and high performance in many domains, especially with dynamic time warping (DTW) as the distance measure. In the paper, we present an exhaustive study in which we compare the performance of different similarity measures relying on the 1NN classifier. We used the most highly cited time series distance measures used in classification (in total we compared 56 distance measures). We evaluate methods on all datasets from the UCR Time Series Classification Archive. Additionally, we perform extensive statistical comparison of the examined methods. We show that none of the distance measures is the best for all datasets, however, there is a group performing statistically significantly better than the others. •An extensive comparison of distance measures for time series classification.•Comprehensive statistical analysis.•The biggest available study considering both the number of data sets (128) and the number of distances (56).
ISSN:1877-7503
1877-7511
DOI:10.1016/j.jocs.2024.102235