Extracting Mobile Behavioral Patterns with the Distant N-Gram Topic Model

  • Katayoun Farrahi
  • , Daniel Gatica-Perez

Research output: Chapter in Book/Report/Conference proceedingConference proceedingspeer-review

Abstract

Mining patterns of human behavior from large-scale mobile phone data has potential to understand certain phenomena in society. The study of such human-centric massive datasets requires new mathematical models. In this paper, we propose a probabilistic topic model that we call the distant n-gram topic model (DNTM) to address the problem of learning long duration human location sequences. The DNTM is based on Latent Dirichlet Allocation (LDA). We define the generative process for the model, derive the inference procedure and evaluate our model on real mobile data. We consider two different real-life human datasets, collected by mobile phone locations, the first considering GPS locations and the second considering cell tower connections. The DNTM successfully discovers topics on the two datasets. Finally, the DNTM is compared to LDA by considering log-likelihood performance on unseen data, showing the predictive power of the model on unseen data. We find that the DNTM consistantly outperforms LDA as the sequence length increases.
Original languageEnglish
Title of host publicationInternational Symposium on Wearable Computers (ISWC)
Number of pages8
Publication statusPublished - Jun 2012

Fields of science

  • 102 Computer Sciences
  • 102009 Computer simulation
  • 102013 Human-computer interaction
  • 102019 Machine learning
  • 102020 Medical informatics
  • 102021 Pervasive computing
  • 102022 Software development
  • 102025 Distributed systems
  • 202017 Embedded systems
  • 211902 Assistive technologies
  • 211912 Product design

JKU Focus areas

  • Computation in Informatics and Mathematics
  • Engineering and Natural Sciences (in general)

Cite this