Loading…

From traces to trajectories: How well can we guess activity locations from mobile phone traces?

•We develop a procedure to generate a simulated mobile phone dataset containing ground truth information on locations visited.•We develop improved models to identify locations visited.•The distribution of the locations visited matches the ground truth well.•70% and 97% of identified homes are within...

Full description

Saved in:
Bibliographic Details
Published in:Transportation research. Part C, Emerging technologies Emerging technologies, 2014-09, Vol.46, p.326-337
Main Authors: Chen, Cynthia, Bian, Ling, Ma, Jingtao
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•We develop a procedure to generate a simulated mobile phone dataset containing ground truth information on locations visited.•We develop improved models to identify locations visited.•The distribution of the locations visited matches the ground truth well.•70% and 97% of identified homes are within 100 and 1000m from the truth.•65% and 86% of identified workplaces are within 100 and 1000m from the truth. Passively generated mobile phone dataset is emerging as a new data source for research in human mobility patterns. Information on individuals’ trajectories is not directly available from such data; they must be inferred. Many questions remain in terms how well we can capture human mobility patterns from these datasets. Only one study has compared the results from a mobile phone dataset to those from the National Household Travel Survey (NHTS), though the comparison is on two different populations and samples. This study is a very first attempt that develops a procedure to generate a simulated mobile phone dataset containing the ground truth information. This procedure can be used by other researchers and practitioners who are interested in using mobile phone data and want to formally evaluate the effectiveness of an algorithm. To identify activity locations from mobile phone traces, we develop an ensemble of methods: a model-based clustering method to identify clusters, a logistic regression model to distinguish between activity and travel clusters, and a set of behavior-based algorithms to detect types of locations visited. We show that the distribution of the activity locations identified from the simulated mobile phone dataset resembles the ground truth better than the existing studies. For home locations, 70% and 97% of identified homes are within 100 and 1000m from the truth, respectively. For work places, 65% and 86% of the identified work places are within 100 and 1000m from the true ones, respectively. These results point to the possibility of using these passively generated mobile phone datasets to supplement or even replace household travel surveys in transportation planning in the future.
ISSN:0968-090X
1879-2359
DOI:10.1016/j.trc.2014.07.001