TL;DR
This paper introduces SiBraR, a single-branch multimodal embedding network for recommendation systems that effectively handles cold-start and missing modality scenarios by sharing weights across different data types.
Contribution
The paper proposes a novel single-branch embedding approach that encodes multiple modalities with shared weights, improving recommendation accuracy in cold-start and missing modality situations.
Findings
SiBraR outperforms collaborative filtering in cold-start scenarios.
The model is effective across music, movie, and e-commerce domains.
SiBraR reduces modality gaps by mapping different data types into a shared space.
Abstract
Most recommender systems adopt collaborative filtering (CF) and provide recommendations based on past collective interactions. Therefore, the performance of CF algorithms degrades when few or no interactions are available, a scenario referred to as cold-start. To address this issue, previous work relies on models leveraging both collaborative data and side information on the users or items. Similar to multimodal learning, these models aim at combining collaborative and content representations in a shared embedding space. In this work we propose a novel technique for multimodal recommendation, relying on a multimodal Single-Branch embedding network for Recommendation (SiBraR). Leveraging weight-sharing, SiBraR encodes interaction data as well as multimodal side information using the same single-branch embedding network on different modalities. This makes SiBraR effective in scenarios of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
