Multi-Modality is All You Need for Transferable Recommender Systems
Youhua Li, Hanwen Du, Yongxin Ni, Pengpeng Zhao, Qi Guo, Fajie Yuan,, Xiaofang Zhou

TL;DR
This paper introduces PMMRec, a multi-modality based recommender system that overcomes the limitations of ID-based methods, enabling transferability across domains and platforms through novel contrastive learning and self-supervised objectives.
Contribution
It proposes a flexible, multi-modal framework with new learning objectives that improve transferability and robustness in recommender systems, moving beyond traditional ID-based approaches.
Findings
Outperforms state-of-the-art recommenders in multiple datasets
Demonstrates strong transferability across domains and modalities
Achieves superior recommendation accuracy and robustness
Abstract
ID-based Recommender Systems (RecSys), where each item is assigned a unique identifier and subsequently converted into an embedding vector, have dominated the designing of RecSys. Though prevalent, such ID-based paradigm is not suitable for developing transferable RecSys and is also susceptible to the cold-start issue. In this paper, we unleash the boundaries of the ID-based paradigm and propose a Pure Multi-Modality based Recommender system (PMMRec), which relies solely on the multi-modal contents of the items (e.g., texts and images) and learns transition patterns general enough to transfer across domains and platforms. Specifically, we design a plug-and-play framework architecture consisting of multi-modal item encoders, a fusion module, and a user encoder. To align the cross-modal item representations, we propose a novel next-item enhanced cross-modal contrastive learning objective,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRecommender Systems and Techniques · Multimodal Machine Learning Applications · Topic Modeling
MethodsContrastive Learning · ALIGN
