Choice of Metrics used in Collaborative Filtering and their Impact on Recommender Systems

Author: Jesus Bobadilla Sancho, Eduardo Martúnez Murciano, Francisco Serradilla Garcúa, J.L. Sánchez
Publisher: Institute of Electrical and Electronics Engineers (IEEE)

ABOUT BOOK

The capacity of recommender systems to make correct predictions is essentially determined by the quality and suitability of the collaborative filtering that implements them. The common memory-based metrics are Pearson correlation and cosine, however, their use is not always the most appropriate or sufficiently justified. In this paper, we analyze these two metrics together with the less common mean squared difference (MSD) to discover their advantages and drawbacks in very important aspects such as the impact when introducing different values of k-neighborhoods, minimization of the MAE error, capacity to carry out a sufficient number of predictions, percentage of correct and incorrect predictions and behavior when attempting to recommend the n-best items. The paper lists the results and practical conclusions that have been obtained after carrying out a comparative study of the metrics based on 135 experiments on the MovieLens database of 100,000 ratios

Powered by: