Abstract
In this paper, we introduce EXPress closED ITemset Enumeration (Expedite), a new frequent closed itemset (FCI) miner designed to speed up the process of FCIs extraction from a dataset of transactions. Compared to the state of the art, Expedite provides a CPU time saving of up to two orders of magnitude without compromising other dimensions of performance (e.g. memory). The reason why it is so fast is that Expedite wastes less time in mining intermediate item sets that are discarded in later phases of the algorithm. More specifically, it cuts down the number of both duplicate FCIs - those generated multiple times by the algorithm - and infrequent itemsets - those with low support or no supporting transactions. This feature, enjoyable by both sparse and dense datasets, is analytically motivated first, and then experimentally supported by extensive tests on real datasets. As a further contribution, we propose two alternative implementations of Expedite that perform even better than the basic version, although they rely on particular features of the input dataset.
Original language | English (US) |
---|---|
Pages (from-to) | 3933-3944 |
Number of pages | 12 |
Journal | Expert Systems with Applications |
Volume | 42 |
Issue number | 8 |
DOIs | |
State | Published - May 15 2015 |
Externally published | Yes |
Bibliographical note
Generated from Scopus record by KAUST IRTS on 2023-09-20ASJC Scopus subject areas
- Artificial Intelligence
- General Engineering
- Computer Science Applications