Abstract
In part due to the proliferation of GPS-equipped mobile devices, massive volumes of geo-tagged streaming text messages are becoming available on social media. It is of great interest to discover most frequent nearby terms from such tremendous stream data. In this paper, we present novel indexing, updating, and query processing techniques that are capable of discovering top-k most frequent nearby terms over a sliding window. Specifically, given a query location and a set of geo-tagged messages within a sliding window, we study the problem of searching for the top-k terms by considering term frequency, spatial proximity, and term freshness. We develop a novel and efficient mechanism to solve the problem, including a quad-tree based indexing structure, indexing update technique, and a best-first based searching algorithm. An empirical study is conducted to show that our proposed techniques are efficient and fit for users’ requirements through varying a number of parameters.
Original language | English (US) |
---|---|
Pages (from-to) | 1953-1970 |
Number of pages | 18 |
Journal | World Wide Web |
Volume | 22 |
Issue number | 5 |
DOIs | |
State | Published - Sep 15 2019 |
Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2018, Springer Science+Business Media, LLC, part of Springer Nature.
Keywords
- Spatial
- Temporal
- Term
- Top-k
ASJC Scopus subject areas
- Software
- Hardware and Architecture
- Computer Networks and Communications