Collaborative Cross-modal Fusion with Large Language Model for Recommendation
Zhongzhou Liu, Hao Zhang, Kuicai Dong, Yuan Fang

TL;DR
This paper introduces CCF-LLM, a novel framework that fuses semantic knowledge and collaborative signals using large language models to improve recommendation accuracy, addressing limitations of prior methods.
Contribution
The paper proposes a cross-modal fusion framework that combines semantic and collaborative signals within large language models for recommendation systems.
Findings
CCF-LLM outperforms existing methods in recommendation tasks.
Effective fusion of semantic and collaborative signals improves recommendation quality.
Extensive experiments validate the superiority of the proposed framework.
Abstract
Despite the success of conventional collaborative filtering (CF) approaches for recommendation systems, they exhibit limitations in leveraging semantic knowledge within the textual attributes of users and items. Recent focus on the application of large language models for recommendation (LLM4Rec) has highlighted their capability for effective semantic knowledge capture. However, these methods often overlook the collaborative signals in user behaviors. Some simply instruct-tune a language model, while others directly inject the embeddings of a CF-based model, lacking a synergistic fusion of different modalities. To address these issues, we propose a framework of Collaborative Cross-modal Fusion with Large Language Models, termed CCF-LLM, for recommendation. In this framework, we translate the user-item interactions into a hybrid prompt to encode both semantic knowledge and collaborative…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsFocus
