Abstract
Discovering patterns in long event sequences is an important data mining task. Most existing work focuses on frequency-based quality measures that allow algorithms to use the anti-monotonicity property to prune the search space and efficiently discover the most frequent patterns. In this work, we step away from such measures, and evaluate patterns using cohesion—a measure of how close to each other the items making up the pattern appear in the sequence on average. We tackle the fact that cohesion is not an anti-monotonic measure by developing a novel pruning technique in order to reduce the search space. By doing so, we are able to efficiently unearth rare, but strongly cohesive, patterns that existing methods often fail to discover. The data and software related to this paper are available at https://bitbucket.org/len feremans/ sequencepatternmining public.
Original language | English |
---|---|
Title of host publication | Machine Learning and Knowledge Discovery in Databases |
Subtitle of host publication | European Conference, ECML PKDD 2016, Riva del Garda, Italy, September 19–23, 2016, Proceedings, Part I |
Editors | Paolo Frasconi, Niels Landwehr, Giuseppe Manco, Jilles Vreeken |
Publisher | Springer |
Pages | 361-377 |
Number of pages | 17 |
ISBN (Electronic) | 9783319461281 |
ISBN (Print) | 9783319461274 |
DOIs | |
Publication status | Published - 2016 |
Event | European Conference on Machine Learning and Knowledge Discovery in Databases 2016 - Riva del Garda, Italy Duration: 19 Sept 2016 → 23 Sept 2016 Conference number: 15th http://www.ecmlpkdd2016.org/ https://link.springer.com/book/10.1007/978-3-319-46128-1 (Proceedings) |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 9851 LNAI |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | European Conference on Machine Learning and Knowledge Discovery in Databases 2016 |
---|---|
Abbreviated title | ECML PKDD 2016 |
Country/Territory | Italy |
City | Riva del Garda |
Period | 19/09/16 → 23/09/16 |
Internet address |