Analysis of a Collaborative Filter Based on Popularity Amongst Neighbors

Kishor Barman; Onkar Dabeer

arXiv:1006.1772·cs.IT·November 18, 2016

Analysis of a Collaborative Filter Based on Popularity Amongst Neighbors

Kishor Barman, Onkar Dabeer

PDF

TL;DR

This paper provides a theoretical analysis of a popularity-based collaborative filtering algorithm, identifying its performance regimes and validating findings with real-world datasets like MovieLens and Netflix.

Contribution

It introduces a theoretical framework for analyzing the error rates of a popularity-based collaborative filter, filling a gap in understanding its performance.

Findings

01

In large sample, small degrees of freedom regime, BER approaches zero.

02

In large sample, large degrees of freedom regime, BER is bounded away from 0 and 1/2.

03

The algorithm fails with small sample sizes.

Abstract

In this paper, we analyze a collaborative filter that answers the simple question: What is popular amongst your friends? While this basic principle seems to be prevalent in many practical implementations, there does not appear to be much theoretical analysis of its performance. In this paper, we partly fill this gap. While recent works on this topic, such as the low-rank matrix completion literature, consider the probability of error in recovering the entire rating matrix, we consider probability of an error in an individual recommendation (bit error rate (BER)). For a mathematical model introduced in [1],[2], we identify three regimes of operation for our algorithm (named Popularity Amongst Friends (PAF)) in the limit as the matrix size grows to infinity. In a regime characterized by large number of samples and small degrees of freedom (defined precisely for the model in the paper),…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.