Sparse Contrastive Learning for Content-Based Cold Item Recommendation
Gregor Meehan, Johan Pauwels

TL;DR
This paper introduces SEMCo, a content-based cold-start recommendation method using sparse contrastive learning and entmax activation, outperforming existing approaches in ranking accuracy.
Contribution
It proposes a novel purely content-based modeling approach with sparse contrastive learning and entmax, avoiding the information gap in traditional CF-based methods.
Findings
SEMCo outperforms existing cold-start methods in ranking accuracy.
The use of entmax activation enables sharper relevance estimation.
Knowledge distillation further enhances the model's performance.
Abstract
Item cold-start is a pervasive challenge for collaborative filtering (CF) recommender systems. Existing methods often train cold-start models by mapping auxiliary item content, such as images or text descriptions, into the embedding space of a CF model. However, such approaches can be limited by the fundamental information gap between CF signals and content features. In this work, we propose to avoid this limitation with purely content-based modeling of cold items, i.e. without alignment with CF user or item embeddings. We instead frame cold-start prediction in terms of item-item similarity, training a content encoder to project into a latent space where similarity correlates with user preferences. We define our training objective as a sparse generalization of sampled softmax loss with the -entmax family of activation functions, which allows for sharper estimation of item…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
