“The Wisdom of the Few: a Collaborative Filtering Approach Based on Expert Opinions from the Web”, Xavier Amatriain, Neal Lathia, Josep M. Pujol, Haewoon Kwak, Nuria Oliver2009-07-19 ()⁠:

Nearest-neighbor collaborative filtering provides a successful means of generating recommendations for web users. However, this approach suffers from several shortcomings, including data sparsity and noise, the cold-start problem, and scalability.

In this work, we present a novel method for recommending items to users based on expert opinions. Our method is a variation of traditional collaborative filtering: rather than applying a nearest neighbor algorithm to the user-rating data, predictions are computed using a set of expert neighbors from an independent dataset, whose opinions are weighted according to their similarity to the user. This wisdom of the few method promises to address some weaknesses in traditional collaborative filtering, while maintaining comparable accuracy.

We validate our approach by predicting a subset of the Netflix data set. We use ratings crawled from a web portal of expert reviews [Rotten Tomatoes], measuring results both in terms of prediction accuracy and recommendation list precision.

Finally, we explore the ability of our method to generate useful recommendations, by reporting the results of a user-study where users prefer the recommendations generated by our approach.

[Keywords: collaborative filtering, cosine similarity, experts, nearest-neighbors, recommender system, top-n recommendations]

4. Results: Based on the previously described data, we measure how well the 169 experts predict the ratings of the 10,000 Netflix users. In order to validate our approach, we set up two different experiments: in the first experiment, we measure the mean error and coverage of the predicted recommendations. In the second experiment, we measure the precision of the recommendation lists generated for the users.

Table 1: Summary of the MAE and Coverage in our Expert-based CF approach compared to Critics’ Choice and Neighbor CF.
Method MAE Coverage
Critics’ Choice 0.885 100%
Expert-CF 0.781 97.7%
Neighbor-CF 0.704 92.9%
Figure 5: (a) Mean absolute error (MAE) and (b) coverage of expert-predicted ratings as a function of the minimum similarity (δ) and confidence (τ) thresholds; (c) MAE versus confidence threshold.
Figure 6: Comparison between Expert CF and Nearest-Neighbor CF: (a) MAE and (b) per user error.

…As shown in §4, our approach does not outperform a “naive” neighbor-CF approach. However, our focus when using external data sources is not as much on prediction accuracy as it is on addressing some of the common problems found in traditional CF recommender systems. In the next paragraphs, we shall describe a few of these problems and discuss how they might be addressed by using a limited set of external experts to generate the predictions (ie. ‘wisdom of the few’):