Molar: Multimodal LLMs with Collaborative Filtering Alignment for Enhanced Sequential Recommendation
Yucong Luo, Qitao Qin, Hao Zhang, Mingyue Cheng, Ruiran Yan, Kefan, Wang, Jie Ouyang

TL;DR
Molar introduces a multimodal LLM framework for sequential recommendation that effectively combines textual, non-textual, and collaborative filtering signals to improve personalization and accuracy.
Contribution
It presents a novel multimodal LLM-based recommendation framework that integrates content modalities with collaborative filtering through a post-alignment mechanism.
Findings
Molar outperforms traditional and LLM-based baselines in recommendation accuracy.
The framework effectively captures user interests and contextual semantics.
Multimodal content integration enhances recommendation robustness.
Abstract
Sequential recommendation (SR) systems have evolved significantly over the past decade, transitioning from traditional collaborative filtering to deep learning approaches and, more recently, to large language models (LLMs). While the adoption of LLMs has driven substantial advancements, these models inherently lack collaborative filtering information, relying primarily on textual content data neglecting other modalities and thus failing to achieve optimal recommendation performance. To address this limitation, we propose Molar, a Multimodal large language sequential recommendation framework that integrates multiple content modalities with ID information to capture collaborative signals effectively. Molar employs an MLLM to generate unified item representations from both textual and non-textual data, facilitating comprehensive multimodal modeling and enriching item embeddings.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Language Processing Techniques · Recommender Systems and Techniques · Semantic Web and Ontologies
