https://doi.org/10.1140/epjds/s13688-018-0147-7
Regular article
Travelers or locals? Identifying meaningful sub-populations from human movement data in the absence of ground truth
1
Department of Geography, University of Zurich, Zurich, Switzerland
2
Department of Infrastructure Engineering, The University of Melbourne, Victoria, Australia
* e-mail: lscherrer7@gmail.com
Received:
25
January
2018
Accepted:
25
June
2018
Published online:
4
July
2018
As users of mobile devices make phone calls, browse the web, or use an app, large volumes of data are routinely generated that are a potentially useful source for investigating human behavior in space. However, as such data are usually collected only as a by-product, they often lack stringent experimental design and ground truth, which makes interpretation and derivation of valid behavioral conclusions challenging. Here, we propose an unsupervised, data-driven approach to identify different user types based on high-resolution human movement data collected from a smartphone navigation app, in the absence of ground truth. We capture spatio-temporal footprints of users, characterized by meaningful summary statistics, which are then used in an unsupervised step to identify user types. Based on an extensive dataset of users of the mobile navigation app Sygic in Australia, we show how the proposed methodology allows to identify two distinct groups of users: ‘travelers’, visiting different areas with distinct, salient characteristics, and ‘locals’, covering shorter distances and revisiting many of their locations. We verify our approach by relating user types to space use: we find that travelers and locals prefer to visit distinct, different locations in the Australian cities Sydney and Melbourne, as suggested independently by other studies. Although we use high-resolution GPS data, the proposed methodology is potentially transferable to low-resolution movement data (e.g. Call Detail Records), since we rely only on summary statistics.
Key words: Human mobility / Clustering / PCA / User characterization / Unsupervised learning / Movement patterns
© The Author(s), 2018