Evaluation of User-Based Collaborative Filtering Algorithms Using Regression

Article Preview

Abstract:

The explosive growth of world-wide-web and the emergence of e-commerce make the Collaborative Filtering (CF) algorithms widely used in recommender systems to identify a set of items that will be of interest for a certain user. However, with the rapid growth in the number of users and items, the quality of recommendation decreases. Improved CF algorithms are needed to provide a large number of users with the ability to efficiently obtain useful information. Different user-based approaches for computing similarities (e.g., correlation-based vs. adjusted-cosine) and different techniques for obtaining recommendations (e.g., weighted sum vs. regression model) are presented in the paper. The main steps of the algorithms are divided into two steps. The first is pre-computation of the similarities and pre-selecting of the N-nearest neighbors in the training phase, and the second is online prediction computation. All the algorithms require time linear in the number of items for online predicting. The experiments suggest that the user-based regression algorithms provide better qualities than those not using regression, and the average MAE drop is by 11.16%. Furthermore, the correlation-based regression gets the best MAE drop by 19.41%, and it provides better quality than the best available classic Pearson algorithms.

You might also be interested in these eBooks

Info:

Periodical:

Pages:

400-404

Citation:

Online since:

October 2012

Export:

Price:

Permissions CCC:

Permissions PLS:

Сopyright:

© 2012 Trans Tech Publications Ltd. All Rights Reserved

Share:

Citation:

[1] Xiaoyuan Su, and T. M. Khoshgoftaar: Advances in Artificial Intelligence, Vol. 2009 (2009) No. 1, pp.1-19.

Google Scholar

[2] B. M. Sarwar, G. Karypis, J. A. Konstan, and J. Riedl: Proceedings of the 10th International Conference on World Wide Web (Hong Kong, May 1-5, 2001), pp.285-295.

DOI: 10.1145/371920.372071

Google Scholar

[3] G. Linden, B. Smith, and J. York: IEEE Internet Computing, Vol. 7 (2003) No. 1, p.76–80.

Google Scholar

[4] S. Vucetic and Z. Obradovic: Knowledge and Information Systems, Vol. 7 (2005) No. 1, p.1–22.

Google Scholar

[5] J. Canny: Proceedings of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (Tampere, Finland, August, 2002), p.238–245.

Google Scholar