Abstract
The temporal data is ubiquitous, and massive amount of temporal data is generated nowadays. Management of big temporal data is important yet challenging. Processing big temporal data using a distributed system is a desired choice. However, existing distributed systems/methods either cannot support native queries, or are disk-based solutions, which could not well satisfy the requirements of high throughput and low latency. To alleviate this issue, this paper proposes an In-memory based Two-level Index Solution in Spark (ITISS) for processing big temporal data. The framework of our system is easy to understand and implement, but without loss of efficiency. We conduct extensive experiments to verify the performance of our solution. Experimental results based on both real and synthetic datasets consistently demonstrate that our solution is efficient and competitive.
Original language | English (US) |
---|---|
Title of host publication | Database Systems for Advanced Applications - 23rd International Conference, DASFAA 2018, Proceedings |
Editors | Yannis Manolopoulos, Jianxin Li, Shazia Sadiq, Jian Pei |
Publisher | Springer Verlag |
Pages | 549-565 |
Number of pages | 17 |
ISBN (Print) | 9783319914510 |
DOIs | |
State | Published - 2018 |
Event | 23rd International Conference on Database Systems for Advanced Applications, DASFAA 2018 - Gold Coast, Australia Duration: May 21 2018 → May 24 2018 |
Publication series
Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
---|---|
Volume | 10827 LNCS |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 23rd International Conference on Database Systems for Advanced Applications, DASFAA 2018 |
---|---|
Country/Territory | Australia |
City | Gold Coast |
Period | 05/21/18 → 05/24/18 |
Bibliographical note
Publisher Copyright:© Springer International Publishing AG, part of Springer Nature 2018.
Keywords
- Apache Spark
- Big temporal data
- Distributed in-memory analytics
- Temporal queries
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science