Abstract
Hierarchical Dirichlet processes (HDP) was originally designed and experimented for a single data channel. In this paper we enhanced its ability to model heterogeneous data using a richer structure for the base measure being a product-space. The enhanced model, called Product Space HDP (PS-HDP), can (1) simultaneously model heterogeneous data from multiple sources in a Bayesian nonparametric framework and (2) discover multilevel latent structures from data to result in different types of topics/latent structures that can be explained jointly. We experimented with the MDC dataset, a large and real-world data collected from mobile phones. Our goal was to discover identity–location– time (a.k.a who-where-when) patterns at different levels (globally for all groups and locally for each group). We provided analysis on the activities and patterns learned from our model, visualized, compared and contrasted with the ground-truth to demonstrate the merit of the proposed framework. We further quantitatively evaluated and reported its performance using standard metrics including F1-score, NMI, RI, and purity. We also compared the performance of the PS-HDP model with those of popular existing clustering methods (including K-Means, NNMF, GMM, DP-Means, and AP). Lastly, we demonstrate the ability of the model in learning activities with missing data, a common problem encountered in pervasive and ubiquitous computing applications.
Original language | English |
---|---|
Title of host publication | Trends and Applications in Knowledge Discovery and Data Mining |
Subtitle of host publication | PAKDD 2016 Workshops, BDM, MLSDA, PACC, WDMBF - Auckland, New Zealand, April 19, 2016 - Revised Selected Papers |
Editors | Huiping Cao, Jinyan Li, Ruili Wang |
Place of Publication | Cham Switzerland |
Publisher | Springer |
Pages | 128-140 |
Number of pages | 13 |
ISBN (Electronic) | 9783319429960 |
ISBN (Print) | 9783319429953 |
DOIs | |
Publication status | Published - 2016 |
Externally published | Yes |
Event | PAKDD 2016 Workshops: BDM, MLSDA, PACC, WDMBF - Auckland, New Zealand Duration: 19 Apr 2019 → 19 Apr 2019 http://pakdd16.wordpress.fos.auckland.ac.nz/ |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Publisher | Springer |
Volume | 9794 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | PAKDD 2016 Workshops |
---|---|
Country/Territory | New Zealand |
City | Auckland |
Period | 19/04/19 → 19/04/19 |
Internet address |