Generalization Bounds for Semi-supervised Matrix Completion with Distributional Side Information
Antoine Ledent, Mun Chong Soo, Nong Minh Hieu

TL;DR
This paper develops theoretical error bounds for semi-supervised matrix completion leveraging both labeled and unlabeled data, with applications to recommender systems using explicit and implicit feedback.
Contribution
It introduces a novel analysis combining low-rank subspace recovery with generalization bounds for semi-supervised matrix completion, accounting for distributional side information.
Findings
Error bounds scale as rac{rac{rac{rac{nd}{M}}}{rac{dr}{N}}
Synthetic experiments confirm independent error components for P and R estimation
Real-world experiments show improved performance over explicit-only baselines
Abstract
We study a matrix completion problem where both the ground truth matrix and the unknown sampling distribution over observed entries are low-rank matrices, and \textit{share a common subspace}. We assume that a large amount of \textit{unlabeled} data drawn from the sampling distribution is available, together with a small amount of labeled data drawn from the same distribution and noisy estimates of the corresponding ground truth entries. This setting is inspired by recommender systems scenarios where the unlabeled data corresponds to `implicit feedback' (consisting in interactions such as purchase, click, etc. ) and the labeled data corresponds to the `explicit feedback', consisting of interactions where the user has given an explicit rating to the item. Leveraging powerful results from the theory of low-rank subspace recovery, together with classic generalization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Stochastic Gradient Optimization Techniques · Tensor decomposition and applications
