Loading…

An Ensemble of Deep Clustering Models With Autoencoders to Mine Travel Patterns From Smart Card Data

In recent research, clustering algorithms have been utilized to analyze smart card travel patterns. However, these methods often struggle due to the need for a predetermined number of clusters and the challenges posed by the curse of dimensionality. To address these issues and improve travel pattern...

Full description

Saved in:
Bibliographic Details
Published in:IEEE transactions on intelligent transportation systems 2024-12, Vol.25 (12), p.20960-20969
Main Authors: Saronian, Sharon, Yousefimehr, Behnam, Ghatee, Mehdi, Bejani, Mohammad Mahdi
Format: Article
Language:English
Subjects:
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:In recent research, clustering algorithms have been utilized to analyze smart card travel patterns. However, these methods often struggle due to the need for a predetermined number of clusters and the challenges posed by the curse of dimensionality. To address these issues and improve travel pattern recognition from smart-card transactions, a new framework has been proposed. This framework uses autoencoders to extract low-dimensional representations from transactions and an ensemble of deep clustering models to identify patterns. Additionally, it compares geodetic and network distances between stations for feature extraction. A key advantage of this approach is its ability to automatically determine the number of clusters and effectively mitigate overfitting using the Sharon index based on Rademacher complexity. When applied to the London metro smart card benchmark with more than 10,000 samples, the framework identified 7 clusters of travel patterns, compared to 3 clusters by baseline methods. Each cluster corresponds to a specific day of the week, uncovering distinct travel behaviors across the week. Furthermore, within each cluster, the framework detected three daily sub-patterns: morning peak, midday, and evening peak. This resulted in 21 unique travel patterns, allowing a detailed analysis of both weekday and weekend travel dynamics. These findings were validated using the Davies-Bouldin, Silhouette, and Calinski-Harabasz indices. The impact of the starting time for daily services has also been analyzed, showing that daily travel patterns are highly sensitive to this parameter.
ISSN:1524-9050
1558-0016
DOI:10.1109/TITS.2024.3475295