Abstract
Joint analysis of multiple data sources is becoming increasingly popular in transfer learning, multi-task learning and cross-domain data mining. One promising approach to model the data jointly is through learning the shared and individual factor subspaces. However, performance of this approach depends on the subspace dimensionalities and the level of sharing needs to be specified a priori. To this end, we propose a nonparametric joint factor analysis framework for modeling multiple related data sources. Our model utilizes the hierarchical beta process as a nonparametric prior to automatically infer the number of shared and individual factors. For posterior inference, we provide a Gibbs sampling scheme using auxiliary variables. The effectiveness of the proposed framework is validated through its application on two real world problems - Transfer learning in text and image retrieval.
Original language | English |
---|---|
Title of host publication | Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012 |
Pages | 200-211 |
Number of pages | 12 |
Publication status | Published - 1 Dec 2012 |
Externally published | Yes |
Event | SIAM International Conference on Data Mining 2012 - Anaheim, United States of America Duration: 26 Apr 2012 → 28 Apr 2012 Conference number: 12th |
Publication series
Name | Proceedings of the 12th SIAM International Conference on Data Mining, SDM 2012 |
---|
Conference
Conference | SIAM International Conference on Data Mining 2012 |
---|---|
Abbreviated title | SDM 2012 |
Country/Territory | United States of America |
City | Anaheim |
Period | 26/04/12 → 28/04/12 |