Loading…

Exploring multiple evidence to infer users’ location in Twitter

Online social networks are valuable sources of information to monitor real-time events, such as earthquakes and epidemics. For this type of surveillance, users’ location is an essential piece of information, but a substantial number of users choose not to disclose their geographical location. Howeve...

Full description

Saved in:
Bibliographic Details
Published in:Neurocomputing (Amsterdam) 2016-01, Vol.171, p.30-38
Main Authors: Rodrigues, Erica, Assunção, Renato, Pappa, Gisele L., Renno, Diogo, Meira Jr, Wagner
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:Online social networks are valuable sources of information to monitor real-time events, such as earthquakes and epidemics. For this type of surveillance, users’ location is an essential piece of information, but a substantial number of users choose not to disclose their geographical location. However, characteristics of the users׳ behavior, such as the friends they associate with and the types of messages published may hint on their spatial location. In this paper, we propose a method to infer the spatial location of Twitter users. Unlike the approaches proposed so far, it incorporates two sources of information to learn geographical position: the text posted by users and their friendship network. We propose a probabilistic approach that jointly models the geographical labels and Twitter texts of users organized in the form of a graph representing the friendship network. We use the Markov random field probability model to represent the network, and learning is carried out through a Markov Chain Monte Carlo simulation technique to approximate the posterior probability distribution of the missing geographical labels. We show the accuracy of the algorithm in a large dataset of Twitter users, where the ground truth is the location given by GPS. The method presents promising results, with little sensitivity to parameters and high values of precision.
ISSN:0925-2312
1872-8286
DOI:10.1016/j.neucom.2015.05.066