TY - JOUR
T1 - An empirical study on user-topic rating based collaborative filtering methods
AU - He, Tieke
AU - Chen, Zhenyu
AU - Liu, Jia
AU - Zhou, Xiaofang
AU - Du, Xingzhong
AU - Wang, Weiqing
PY - 2017/7
Y1 - 2017/7
N2 - User based collaborative filtering (CF) has been successfully applied into recommender system for years. The main idea of user based CF is to discover communities of users sharing similar interests, thus, in which, the measurement of user similarity is the foundation of CF. However, existing user based CF methods suffer from data sparsity, which means the user-item matrix is often too sparse to get ideal outcome in recommender systems. One possible way to alleviate this problem is to bring new data sources into user based CF. Thanks to the rapid development of social annotation systems, we turn to using tags as new sources. In these approaches, user-topic rating based CF is proposed to extract topics from tags using different topic model methods, based on which we compute the similarities between users by measuring their preferences on topics. In this paper, we conduct comparisons between three user-topic rating based CF methods, using PLSA, Hierarchical Clustering and LDA. All these three methods calculate user-topic preferences according to their ratings of items and topic weights. We conduct the experiments using the MovieLens dataset. The experimental results show that LDA based user-topic rating CF and Hierarchical Clustering outperforms the traditional user based CF in recommending accuracy, while the PLSA based user-topic rating CF performs worse than the traditional user based CF.
AB - User based collaborative filtering (CF) has been successfully applied into recommender system for years. The main idea of user based CF is to discover communities of users sharing similar interests, thus, in which, the measurement of user similarity is the foundation of CF. However, existing user based CF methods suffer from data sparsity, which means the user-item matrix is often too sparse to get ideal outcome in recommender systems. One possible way to alleviate this problem is to bring new data sources into user based CF. Thanks to the rapid development of social annotation systems, we turn to using tags as new sources. In these approaches, user-topic rating based CF is proposed to extract topics from tags using different topic model methods, based on which we compute the similarities between users by measuring their preferences on topics. In this paper, we conduct comparisons between three user-topic rating based CF methods, using PLSA, Hierarchical Clustering and LDA. All these three methods calculate user-topic preferences according to their ratings of items and topic weights. We conduct the experiments using the MovieLens dataset. The experimental results show that LDA based user-topic rating CF and Hierarchical Clustering outperforms the traditional user based CF in recommending accuracy, while the PLSA based user-topic rating CF performs worse than the traditional user based CF.
KW - Collaborative filtering
KW - Hierarchical clustering
KW - LDA
KW - PLSA
KW - Recommender systems
UR - http://www.scopus.com/inward/record.url?scp=84989182943&partnerID=8YFLogxK
U2 - 10.1007/s11280-016-0412-2
DO - 10.1007/s11280-016-0412-2
M3 - Article
AN - SCOPUS:84989182943
SN - 1386-145X
VL - 20
SP - 815
EP - 829
JO - World Wide Web-Internet and Web Information Systems
JF - World Wide Web-Internet and Web Information Systems
IS - 4
ER -