Collaborative Filtering
Motivation
Given a sparse matrix
A minimal assumption in matrix reconstruction is that entries within the same row or the same column are not independent
Recommender Systems and Collaborative Filtering
The goal of recommender systems is to recommend or pre-select relevant items in a personalized manner, based on a person's history or profile, which leads to sparse matrix completion, where the columns correspond to items on some ordinal rating scale (e.g. 1-5 star or 1-10 numerical), and the rows correspond to people or users. In recommender systems, data are usually highly sparse (e.g. only 1% of the entries are observed).
In collaborative filtering, we exploit the similarity between people's ratings to learn from the collective data.
Formalization
- We have a rating matrix
, where corresponds to the number of users, and corresponds to the number of items. - We have an observation matrix
, where each indicates that is observed.
Preprocessing
Centering
Centering makes rows or columns more comparable and subtract out rating bias
Variance Normalization
It may make sense to normalize the variance to 1: per row or per column.
Let