Abstract
Frequent pattern mining from uncertain data has been paid closed attention due to most of the real life databases contain data with uncertainty. Several approaches have been proposed for mining high significance frequent itemsets over uncertain data, however, previous algorithms yield many redundant frequent itemsets and require to set an appropriate user specified threshold which is difficult for users. In this paper, we formally define the problem of top-fc minimal redundancy probabilistic frequent pattern mining, which targets to identify top-fc patterns with high-significance and low-redundancy simultaneously from uncertain data. We first design uncertain pattern correlation based on Pearson correlation coefficient, which considers pattern uncertainty. Moreover, we present a new algorithm, UTFP, to mine top-fc minimal redundancy frequent patterns of length no less than minimum length mind without setting threshold. We further propose a set of strategies to prune and reduce search space. Experimental results demonstrate that the proposed algorithm achieves good performance in terms of finding top-fc frequent patterns with low redundancy on probabilistic data. Our method represents the first research endeavor for probabilistic data based top-fc correlated pattern mining.
Original language | English |
---|---|
Title of host publication | Neural Information Processing |
Subtitle of host publication | 22nd International Conference, ICONIP 2015 Istanbul, Turkey, November 9–12, 2015 Proceedings, Part IV |
Editors | Sabri Arik, Tingwen Huang, Weng Kin Lai, Qingshan Liu |
Place of Publication | Cham Switzerland |
Publisher | Springer |
Pages | 111-119 |
Number of pages | 9 |
ISBN (Electronic) | 9783319265612 |
ISBN (Print) | 9783319265605 |
DOIs | |
Publication status | Published - 2015 |
Externally published | Yes |
Event | International Conference on Neural Information Processing 2015 - Istanbul, Türkiye Duration: 9 Nov 2015 → 12 Nov 2015 Conference number: 22nd https://web.archive.org/web/20151210114427/http://www.iconip2015.org/ https://link.springer.com/chapter/10.1007%2F978-3-319-26535-3_5 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 9492 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | International Conference on Neural Information Processing 2015 |
---|---|
Abbreviated title | ICONIP 2015 |
Country/Territory | Türkiye |
City | Istanbul |
Period | 9/11/15 → 12/11/15 |
Internet address |
Keywords
- Frequent patterns
- Redundancy
- Top-k
- Uncertain