Loading…

Identification of Representative Patterns of Time Use Activity Through Fuzzy C-Means Clustering

Analysis of the time use activity patterns of urbanites will contribute greatly to the modeling of urban transportation demands by linking activity generation and activity scheduling modules in the overall activity-based modeling framework. This paper develops a framework for novel pattern recogniti...

Full description

Saved in:
Bibliographic Details
Published in:Transportation research record 2017, Vol.2668 (1), p.38-50
Main Authors: Hafezi, Mohammad Hesam, Liu, Lei, Millward, Hugh
Format: Article
Language:English
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Analysis of the time use activity patterns of urbanites will contribute greatly to the modeling of urban transportation demands by linking activity generation and activity scheduling modules in the overall activity-based modeling framework. This paper develops a framework for novel pattern recognition modeling to identify groups of individuals with homogeneous daily activity patterns. The framework consists of four modules: initialization of the total cluster number and cluster centroids, identification of individuals with homogeneous activity patterns and grouping of them into clusters, identification of sets of representative activity patterns, and exploration of interdependencies among the attributes in each identified cluster. Numerous new machine-learning techniques, such as the fuzzy C-means clustering algorithm and the classification and regression tree classifier, are employed in the process of pattern recognition. The 24-h activity patterns are split into 288 intervals of 5-min duration. Each interval includes information on activity types, duration, start time, location, and travel mode, if applicable. Aggregated statistical evaluation and Kolmogorov–Smirnov tests are performed to determine statistical significance of clustered data. Results show a heterogeneous diversity in eight identified clusters in relation to temporal distribution and significant differences in a variety of sociodemographic variables. The insights gained from this study include important information on activities—such as activity type, start time, duration, location, and travel distance—that are essential for the scheduling phase of the activity-based model. Finally, the results of this paper are expected to be implemented within the activity-based travel demand model for Halifax, Nova Scotia.
ISSN:0361-1981
2169-4052
DOI:10.3141/2668-05