Abstract
Ranking over sets arise when users choose between groups of items. For example, a group may be of those movies deemed 5 stars to them, or a customized tour package. It turns out, to model this data type properly, we need to investigate the general combinatorics problem of partitioning a set and ordering the subsets. Here we construct a probabilistic log-linear model over a set of ordered subsets. Inference in this combinatorial space is highly challenging: The space size approaches (N!/2)6:93145N+1 as N approaches infinity. We propose a split-and-merge Metropolis-Hastings procedure that can explore the statespace efficiently. For discovering hidden aspects in the data, we enrich the model with latent binary variables so that the posteriors can be efficiently evaluated. Finally, we evaluate the proposed model on large-scale collaborative filtering tasks and demonstrate that it is competitive against state-of-the-art methods.
Original language | English |
---|---|
Title of host publication | 4th Asian Conference on Machine Learning, ACML 2012 |
Pages | 427-442 |
Number of pages | 16 |
Volume | 25 |
Publication status | Published - 1 Dec 2012 |
Externally published | Yes |
Event | Asian Conference on Machine Learning 2012 - Singapore, Singapore Duration: 4 Nov 2012 → 6 Nov 2012 Conference number: 4th http://proceedings.mlr.press/v25/ (Proceedings) |
Publication series
Name | Journal of Machine Learning Research |
---|---|
Publisher | Journal of Machine Learning Research (JMLR) |
ISSN (Print) | 1532-4435 |
Conference
Conference | Asian Conference on Machine Learning 2012 |
---|---|
Abbreviated title | ACML 2012 |
Country/Territory | Singapore |
City | Singapore |
Period | 4/11/12 → 6/11/12 |
Internet address |
|
Keywords
- Boltzmann machines
- Collaborative filtering
- Latent models
- MCMC
- Ordered sets
- Ranking with ties
- Split-merge