Train Once, Deploy Anywhere: Matryoshka Representation Learning for Multimodal Recommendation
Yueqi Wang, Zhenrui Yue, Huimin Zeng, Dong Wang, Julian McAuley

TL;DR
This paper introduces fMRLRec, a lightweight, scalable multimodal recommendation framework that captures item features at multiple granularities, enabling efficient, one-time training for adaptable recommendation models across diverse datasets.
Contribution
The paper presents a novel Matryoshka representation learning framework that efficiently integrates multimodal features at different granularities for scalable recommendation.
Findings
fMRLRec outperforms state-of-the-art methods on benchmark datasets.
It reduces memory requirements through linear transformations.
The framework scales to various dimensions with one-time training.
Abstract
Despite recent advancements in language and vision modeling, integrating rich multimodal knowledge into recommender systems continues to pose significant challenges. This is primarily due to the need for efficient recommendation, which requires adaptive and interactive responses. In this study, we focus on sequential recommendation and introduce a lightweight framework called full-scale Matryoshka representation learning for multimodal recommendation (fMRLRec). Our fMRLRec captures item features at different granularities, learning informative representations for efficient recommendation across multiple dimensions. To integrate item features from diverse modalities, fMRLRec employs a simple mapping to project multimodal item features into an aligned feature space. Additionally, we design an efficient linear transformation that embeds smaller features into larger ones, substantially…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Speech and dialogue systems
MethodsFocus
